惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
CERT Recently Published Vulnerability Notes
www.infosecurity-magazine.com
www.infosecurity-magazine.com
I
Intezer
Malwarebytes
Malwarebytes
V
V2EX - 技术
P
Proofpoint News Feed
Google Online Security Blog
Google Online Security Blog
C
Cybersecurity and Infrastructure Security Agency CISA
GbyAI
GbyAI
Cyberwarzone
Cyberwarzone
A
Arctic Wolf
博客园 - Franky
C
CXSECURITY Database RSS Feed - CXSecurity.com
Cisco Talos Blog
Cisco Talos Blog
腾讯CDC
F
Fox-IT International blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
T
Threat Research - Cisco Blogs
Hacker News: Ask HN
Hacker News: Ask HN
WordPress大学
WordPress大学
Attack and Defense Labs
Attack and Defense Labs
Security Latest
Security Latest
D
Docker
Google DeepMind News
Google DeepMind News
Simon Willison's Weblog
Simon Willison's Weblog
H
Hacker News: Front Page
小众软件
小众软件
酷 壳 – CoolShell
酷 壳 – CoolShell
爱范儿
爱范儿
MyScale Blog
MyScale Blog
L
LangChain Blog
T
True Tiger Recordings
aimingoo的专栏
aimingoo的专栏
T
The Exploit Database - CXSecurity.com
博客园 - 司徒正美
Latest news
Latest news
Jina AI
Jina AI
U
Unit 42
Application and Cybersecurity Blog
Application and Cybersecurity Blog
Hugging Face - Blog
Hugging Face - Blog
Martin Fowler
Martin Fowler
T
ThreatConnect
Blog — PlanetScale
Blog — PlanetScale
S
SegmentFault 最新的问题
SecWiki News
SecWiki News
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
The Cloudflare Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
量子位
The Hacker News
The Hacker News

Hacker News - Newest: "AI"

Gen Z is not booing AI. It is booing its own job market AI #169: New Knowledge AI as a Design Medium Frontier labs don’t use most AI compute (yet) It's 2026...where are all the AI NPCs? Ask HN: Do people lie about why they hate AI writing on social media? CoreMem - Your context, any AI agent Sundar Pichai discusses AI search VICTORY: POLITICO agrees to shut down both AI tools at center of landmark arbitration AI's Plummeting Prices Are a Software Story, Not a Hardware One The Invisible Cliff: AI Development and Architectural Debt Show HN: AI-Mirror - Self-optimising ranking engine for modern web applications. How do AI chips work? [video] Navigating the New Frontier: AI's Role in Revolutionizing Mathematics and the Quest for Ethical Science Trump's unsigned AI executive order Mdview.io – a Markdown viewer for AI era documentation Anti-"doomer" feedback derails Trump's AI executive order Agents League: The Esports-Inspired Hackathon Where AI Agents Battle for Glory The AI Superstars Who Say a 'Vibe Slop' Crisis Is Coming Show HN: Lilo – An open source personal AI assistant that lives in Telegram Cannes Film Cost $500k to Make. $400k Was AI Compute Costs Where to buy anything AI Powered Search Everyone is an AI Cop Now: What Happens When an AI-Generated Story Wins a Prestigious Prize On AI Coding Assistants | Winston Cooke China’s AI optimism isn’t what it seems AI errno(2) values Believe It Or Not, The Government Is Adopting AI to Make Your Life Easier Google plans to win the AI war Anime AI Studio | Create AI Anime Dramas & Videos from Ideas HN isn't swamped yet, just obsessed with AI · mahl.me OneHundredBiz — Financial Business Ideas with AI Tools An AI system to help scientists write expert-level empirical software Ask HN: We need a standard way to say how much AI was used in a PR Anthropic, Microsoft in talks for AI chip deal after $5 billion investment Idea: Subreddits as curator blogs for the AI era The elephant in the room • Josh W. Comeau What Happens When AI Edits a Classical Chinese Academic Paper: What Happens When AI Edits a Classical Chinese Academic Paper / 当AI修改古汉语学术论文时发生了什么 China's AI optimism isn't what it seems Ask HN: How much AI is in your writing? wwwatch · AI intel for builders Diia - Ukraine gov app launched AI agent based on Google Gemini The IPO wave will enshrine the AI gods' control over the future We shipped 30 tools to our agent. The most-used one just reads our documentation. - kapa.ai - Instant AI answers to technical questions How we work: AI skills - Easy Cyber Protection Governor Newsom signs first-of-its-kind executive order to prepare workers and businesses for potential AI disruption | Governor of California Another California tech company lays off thousands - Los Angeles Times How the AI backlash could cost investors AI Has a Memory. It Just Doesn't Know What to Remember The Companies Cutting Headcount for AI Will Lose to the Ones Who Didn't Ask HN: Is there a better and more affordable AI coding tool than Claude? Food for Agile Thought #545: R/L Agentic Chaos, AI Killed the Agile Industry The current AI pricing was always going to go away A top K-drama star faces explosive backlash over AI-manipulated voice evidence Clickup mocks employees over AI 8 days before layoff Automated Expert Extraction: Behavioural Telemetry of Nyx Wave Ban on Authors Who Submit AI Content “Welcome but Unenforceable” Hollywood in the 60s and the Good AI Future — Joel Dueck Proton Pass for AI Agents Baby Magic-AI Baby Image & Video Generator Online Interactive AI Chat - Chrome 应用商店 Google I/O showed how the path for AI-driven science is shifting Google makes Gemini 3.5 Flash the default AI model for billions of users - Tech Three Dots AI didn't kill your junior pipeline. You did | Andrew Murphy Adobe, Canva, CapCut Are Coming to Gemini to Help You Edit AI Creations "Erase," an AI tool that can remove unwanted objects from images Steve Wozniak cheered after telling students they have AI – actual intelligence AI-Assisted Engineering Habits Worth Stealing (Week 2 Roundup) The best engineers in 2026 aren't the best coders. They're the best at not trusting AI code. GitHub - Woodman97/lucy-agent: AI agent for writing, research, code, DeFi & blockchain. Pay per task in USDC on Base or Solana. A2A + MCP + x402 protocols. $200/month per developer on AI tools. Most companies can't explain what they're getting. Spotify and UMG Announce Licensing Deal to Allow for AI Covers and Remixes CodeAlta After Automation Acrisure layoffs to number 2,250, attributed to AI advancements Report Alleges Chinese Influence Behind AI Data Center Pushback in the U.S. Pressure from Silicon Valley helped block Trump’s expected order on AI AI may be inflationary before it becomes productive Cisco used AI to write security incident reports, with mixed results PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications GitHub - ai-mf/media-engine Ask HN: What the Best AI for Coding? Meet Hell Grind, The First Feature Film "Created Entirely On The Higgsfield AI Platform" Navigating AI with paper maps The Unsustainable Subsidy An Uncharitable Taxonomy of the AI Discourse ReCardEx — AI Product Photography for Marketplaces White House yanked AI order after David Sacks raised industry concerns Best Practices to Produce Maintainable Code with AI [video] AI Slop & the Vulnerability Treadmill Crypto and AI-Funded Super PACs Are Metastasizing The AI Bubble — No One's Happy Lam Research focused on adding AI to chipmaking tools as it eyes US expansion Donald Trump abruptly postpones AI order after White House infighting Tell HN: I'm tired of AI-generated answers Design prompting: describe the world, not the widget AI Local Recorder App - App Store erlang_python — erlang_python v3.0.0 Outlier AI is paying cardiologists to review ECGs and train AI models (referral) Agentic Engineering Memory — A Memco Field Guide Igor Babuschkin Seeks Up To $1 Billion For River AI
GitHub - GitMonsters/13-Impossible-ARC-Tasks-SOLVED: 13 ARC-AGI-2 tasks with 0% AI solve rate — solved by TranscendPlexity. NVARC, GPT-4, Claude, Gemini: 0/13. We got 13/13. Verified, deterministic Python solvers.
wormsWorld · 2026-05-23 · via Hacker News - Newest: "AI"

13 Impossible Tasks — All Solved

TranscendPlexity: 13/13 Everyone Else: 0/13
13 verified solvers ARC Explainer Full Catalog MIT License

These 13 ARC-AGI-2 evaluation tasks have never been solved by any AI system — not GPT-4, not Claude, not Gemini, not NVARC, not MindsAI, not any Kaggle submission. They have a 0% AI solve rate across all publicly tracked attempts.

TranscendPlexity solved all 13.

The Scoreboard

System Tasks Solved (of 13) Overall ARC-AGI-2 Score
TranscendPlexity 13 / 13 120 / 120 (100%)
NVARC (Kaggle 1st) 0 / 13 24%
The ARChitects (2nd) 0 / 13 16.5%
MindsAI (3rd) 0 / 13 12.6%
GPT-4o 0 / 13 9%
Claude 3.5 Sonnet 0 / 13 21%
Gemini 1.5 0 / 13 8%

Source: ARC Explainer — Unsolved Puzzles

The 13 Tasks

Task ID Solver Lines Train/Test Pairs Status
abc82100 239 4 / 1 ✅ Solved
21897d95 525 4 / 2 ✅ Solved
e12f9a14 348 4 / 2 ✅ Solved
a32d8b75 303 3 / 2 ✅ Solved
9bbf930d 274 3 / 1 ✅ Solved
4e34c42c 269 2 / 2 ✅ Solved
88bcf3b4 259 5 / 2 ✅ Solved
13e47133 190 3 / 2 ✅ Solved
8b7bacbf 168 4 / 2 ✅ Solved
62593bfd 166 2 / 2 ✅ Solved
88e364bc 153 3 / 2 ✅ Solved
2b83f449 151 2 / 1 ✅ Solved
269e22fb 93 5 / 2 ✅ Solved

Total: 3,138 lines of deterministic solver code.

Verify It Yourself

git clone https://github.com/GitMonsters/13-Impossible-ARC-Tasks-SOLVED.git
cd 13-Impossible-ARC-Tasks-SOLVED
python3 verify_all.py

Every solver is a standalone Python function — no dependencies, no ML models, no LLMs at inference time. Clone it, run it, verify it.

Run a single solver:

python3 -c "
import json, importlib.util

task_id = 'abc82100'
with open(f'dataset/tasks/{task_id}.json') as f:
    task = json.load(f)

spec = importlib.util.spec_from_file_location('solver', f'solves/{task_id}/solver.py')
mod = importlib.util.module_from_spec(spec)
spec.loader.exec_module(mod)

for pair in task['test']:
    result = mod.solve(pair['input'])
    assert result == pair['output'], 'Mismatch!'
    print(f'{task_id}: ✅ PASS')
"

Visual Showcase

Open 13_Impossible_Tasks_SOLVED.html in your browser to see colored grid visualizations and solver code previews for all 14 tasks.

Methodology

Each solver was synthesized using LLM-guided program synthesis (Claude Opus 4.6):

  1. The model analyzes input/output training examples
  2. Hypothesizes the transformation rule
  3. Writes a solve(grid) function
  4. Tests against training pairs, iterates until correct
  5. Independently verified against held-out test pairs

The result: readable, deterministic Python code that encodes the discovered rule. No black boxes.

Full Catalog

These 14 are the hardest of the hard. For all 540 solved tasks (400 AGI-1 + 120 AGI-2 + 20 AGI-3), see:

👉 GitMonsters/SOLVED-540-of-540

License

MIT

Contact

Evan Pieserepieser@protonmail.com

Built with TranscendPlexity