なぜ優れたエンジニアがAIで下手になるのか

Hacker News - Newest: "AI"

AI can't read an investor deck AI as an attorney? Student uses ChatGPT, Gemini to sue UW Hacking MCP Servers in AI Systems – The Rug Pull: Tool Changes After Approval GitHub - MeepCastana/KubeezCut: Free Web based video editor GitHub - GenAI-Gurus/awesome-eu-ai-act: Curated tools, official sources, OSS, templates, and guides for EU AI Act compliance. Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers Coming soon: 10 Things That Matter in AI Right Now DARPA built an AI to fact-check enemy weapons claims IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures What explains heterogeneity in AI adoption? When AI Meets Muscle: Context-Aware Electrical Stimulation Promises a New Way to Guide Human Movements - Department of Computer Science AI Changed How We Build. It Did Not Change What Matters. Linux rules on using AI-generated code - Copilot is OK, but humans must take 'full responsibility for the… Meta spins up AI version of Mark Zuckerberg to engage with employees Code Mode: Let Your AI Write Programs, Not Just Call Tools | TanStack Blog GitHub - Delavalom/graft: Go framework for building AI agents. Type-safe tools, multi-provider (OpenAI, Anthropic, Gemini, Bedrock), zero vendor SDKs. India's TCS tops estimates, says new AI models did not dent services demand Gen Z's fading AI hype Strong feeling: we are in a folded AI reality GitHub - machinarii/total-recall-catalog: A reference catalog of latest knowledge retrieval, memory & RAG systems GitHub - mensfeld/code-on-incus: Give each AI agent its own isolated machine with root, Docker, and systemd. Active defense detects and stops threats automatically.. Quantization, LoRA, and the 8% Problem: Benchmarking Local LLMs for Production AI Iran war: We spoke to the man making Lego-style AI videos that experts say are powerful propaganda Powell, Bessent discussed Anthropic's Mythos AI cyber threat with major U.S. banks GitHub - immartian/bellamem: Persistent belief-graph memory for AI agents. Retrieves decisive context by importance — not recency, not RAG, not /compact. recursive-mode: The Repo-Native Operating System for AI Engineering After the attack on Sam Altman's home, will AI CEO's go on the offensive? The biggest advance in AI since the LLM Opus 4.6 vs GPT 5.4 One Prompt Unity World Generation Test “AI polls” are fake polls Client Challenge Can AI be a 'child of God'? Inside Anthropic's meeting with Christian leaders How to Switch AI Chatbots and Why You Might Want To GitHub - MattMessinger1/agentic_refund_guardrail: Safe refund policy layer for AI agents — Python + TypeScript. Same behavior, shared tests. Adam/papers/emergent_values_whitepaper.md at master · strangeadvancedmarketing/Adam Ask HN: How do you stop playing 20 questions with your AI coding tools How far can automation and AI support psychotherapy? - @theU GitHub - stagas/rtdiff: realtime git diff gui and AI-assisted commits A Mac Studio for Local AI — 6 Months Later A History of the Early Years of AI at the University of Edinburgh Why AI Coding Tools Still Feel Stuck on Localhost MSN AI Datacenters Are Becoming Strategic Targets twitter.com Penn Researchers Use AI to Surface Unreported GLP-1 Side Effects in Reddit Posts Show HN: MoodSense AI (ML and FastAPI and Gradio, Deployed on Hugging Face) Moodsense Ai - a Hugging Face Space by aman179102 AI models are terrible at betting on soccer—especially xAI Grok GitHub - xialeistudio/echoic GitHub - HimashaHerath/github-dev-wrapped: AI-powered weekly GitHub activity reports deployed to GitHub Pages

sneruz · 2026-05-24 · via Hacker News - Newest: "AI"

10倍エンジニアは平均に逆戻りしています.

フランシス・ガルトンは1886年にこの効果を命名しました^[1]、彼が非常に背が高い親が子供が平均に近いことに気づいたときです。LLMsは構造上回帰機械です。デコードステップはあなたのプロンプトの最も可能性の高い継続をサンプリングします。それは訓練分布の平均であり、あなたがタイプしたものに条件付けられています。

Four code samples arranged around a regression curve, converging on a center. — 平均化の法則。標準的なパターンはそれに引き上げられ、新しいアルゴリズムはそれに引きずり下げられる。同じメカニズムで、逆の結果になる。

影響は非対称的だ。一般的な仕事では、10倍のエンジニアは100倍になる。新しい仕事では、同じエンジニアは平均値に引きずられる形でコードをリリースし、見た目は正しく、内容は正しくないものになる。モデルはあなたが誰かを知らない。

失敗はどのように見えるか

ドキュメント文字列は振る舞いを説明します。あなたが既に知っていることを指定するしかできません。

ICML 2026の論文を使いました^[2]誰の貢献が一つの注意核公式であるか。私は実装を切り離し、DeepSeek V4 Proにシグネチャとdocstringを送り、完了時にlogprobsをキャプチャした。

import torch
import torch.nn.functional as F

def spherical_attention(Q, K, V):
    """
    Attention with spherical-constrained Q, K and positive scoring kernel.

    Queries and keys are normalized to the unit sphere. A positive kernel
    function maps the cosine similarity between query and key directions
    to an attention score. Scores are normalized per query and used to
    weight V.

    Args:
        Q, K, V: (batch, heads, seq, head_dim) tensors.

    Returns:
        Attention output of shape (batch, heads, seq, head_dim).
    """
    Q = F.normalize(Q, dim=-1)
    K = F.normalize(K, dim=-1)
    S = torch.einsum('bhqd,bhkd->bhqk', Q, K)
    C = 2.0 + 1e-6
    S = S**2 / (C - 2*S)                      # Yat-kernel
    A = S / S.sum(dim=-1, keepdim=True)
    O = torch.einsum('bhqk,bhkd->bhqd', A, V)
    return O

Model completion with per-token uncertainty heatmap. The kernel line torch.relu(S) + 1e-6 is highlighted red. — モデルの完成。色が濃いほど信頼度は低い。

同じ形の線七本。一つだけ違う：論文でS**2 / (C - 2*S)（Yat-kernel、論文の貢献）と書かれている場所では、モデルはtorch.relu(S) + 1e-6を書いた。モデルはReLU、softplus、expといった一般的な正の関数からサンプリングしていた。Yat-kernelは候補セットに含まれていなかった。

式が与えられればモデルは正解する。式を知っていればモデルは不要だ。耐荷重ラインに間違った式がある構造的に正しいコード.

失敗しない場所

2026年5月、OpenAIの推論モデルがErdős単位距離予想を否定した[4]は、1946年以来開かれた組み合わせの問題である。DeepMindのAlphaProof Nexusが、^[5]の同じ週に353のErdősの開かれた問題の9つを解いた。

両者は同じ構造を使用した：モデルは候補となる構造を生成し、Leanは形式証明チェック器でそれぞれを検証する。証明はコンパイルされるか、しない。AIが新しい数学を解いているように見えるのは、真の答えのオラクルを持つ空間での探索である。

カーネル実験にはオラクルがない。モデルは一つの完了を生成したが、それを検証するものは何もなく、最も可能性の高いトークンはReLUだった。ログプロブはその行に不確かさを示している；モデルはそれが尾端にあったことを知っていた。しかし、下游に検証者がない不確かさはモーダルトークンに収縮する。

何が永遠なのか

これが自分で直ると思うかもしれない：論文を公開し、次のモデルがそれを学習し、ギャップが埋まる。一部はそうなる。しかし、先端は常にカットオフの先にあるし、最高の価値のある仕事は全く公開されない。HFT価格ロジック、FAANGインフラ、銀行リスクシステムは企業のファイアウォールの後ろに残っている^[6]。常に裾があり、最優秀なエンジニアはそれで働いている。

希少性が診断です。標準のアプリケーションコードは分布の中心に近くにあり、モデルがそれを持ち上げます。希少なパターンは尾にあり、モデルはそれらを^[3]十分に学習せず、同じ形をして自信満々に間違ったものを生成します。

鋭意を保つエンジニアは、どの行が貢献しているかを知っています。モデルは知りません。もし、どの行が重要かの判断を委ねてきたなら、あなたが後退しているのです。

参考文献

ウィキペディア。平均への回帰 | Discovery。ウィキペディア。
ルナ、ブースィン、およびホロマンスキー。SLAY: ジオメトリ認識球面線形近似注意機構Yatカーネル.arXiv:2602.04915、2026年ICML 2026.
カンドパルら.大規模言語モデルは長尾知識を学習するのに苦労する.arXiv:2211.08411、2023年ICML 2023.
OpenAI.ユニット距離コンジェクチャーに対する証明に関する注記. arXiv:2605.20695, 2026.
Google DeepMind. AlphaProof Nexus. arXiv:2605.22763, 2026.
アーメッド他クローズドおよびオープンソースデータにおけるLLMの性能研究 arXiv:2402.15100、2024年

このコンテンツは慣性聚合(RSSリーダー)によって自動集約されています。参考としてご覧ください。原文出典 — 著作権は原著者に帰属します。

おすすめ購読元

Hacker News - Newest: "AI"

失敗はどのように見えるか

失敗しない場所

何が永遠なのか

参考文献