GitHub - GitMonsters/13-Impossible-ARC-Tasks-SOLVED: 13 ARC-AGI-2 tasks with 0% AI solve rate — solved by TranscendPlexity. NVARC, GPT-4, Claude, Gemini: 0/13. We got 13/13. Verified, deterministic Python solvers.

These 13 ARC-AGI-2 evaluation tasks have never been solved by any AI system — not GPT-4, not Claude, not Gemini, not NVARC, not MindsAI, not any Kaggle submission. They have a 0% AI solve rate across all publicly tracked attempts.

TranscendPlexity solved all 13.

The Scoreboard

System	Tasks Solved (of 13)	Overall ARC-AGI-2 Score
TranscendPlexity	13 / 13 ✅	120 / 120 (100%)
NVARC (Kaggle 1st)	0 / 13	24%
The ARChitects (2nd)	0 / 13	16.5%
MindsAI (3rd)	0 / 13	12.6%
GPT-4o	0 / 13	9%
Claude 3.5 Sonnet	0 / 13	21%
Gemini 1.5	0 / 13	8%

Source: ARC Explainer — Unsolved Puzzles

The 13 Tasks

Task ID	Solver Lines	Train/Test Pairs	Status
`abc82100`	239	4 / 1	✅ Solved
`21897d95`	525	4 / 2	✅ Solved
`e12f9a14`	348	4 / 2	✅ Solved
`a32d8b75`	303	3 / 2	✅ Solved
`9bbf930d`	274	3 / 1	✅ Solved
`4e34c42c`	269	2 / 2	✅ Solved
`88bcf3b4`	259	5 / 2	✅ Solved
`13e47133`	190	3 / 2	✅ Solved
`8b7bacbf`	168	4 / 2	✅ Solved
`62593bfd`	166	2 / 2	✅ Solved
`88e364bc`	153	3 / 2	✅ Solved
`2b83f449`	151	2 / 1	✅ Solved
`269e22fb`	93	5 / 2	✅ Solved

Total: 3,138 lines of deterministic solver code.

Verify It Yourself

git clone https://github.com/GitMonsters/13-Impossible-ARC-Tasks-SOLVED.git
cd 13-Impossible-ARC-Tasks-SOLVED
python3 verify_all.py

Every solver is a standalone Python function — no dependencies, no ML models, no LLMs at inference time. Clone it, run it, verify it.

Run a single solver:

python3 -c "
import json, importlib.util

task_id = 'abc82100'
with open(f'dataset/tasks/{task_id}.json') as f:
    task = json.load(f)

spec = importlib.util.spec_from_file_location('solver', f'solves/{task_id}/solver.py')
mod = importlib.util.module_from_spec(spec)
spec.loader.exec_module(mod)

for pair in task['test']:
    result = mod.solve(pair['input'])
    assert result == pair['output'], 'Mismatch!'
    print(f'{task_id}: ✅ PASS')
"

Visual Showcase

Open 13_Impossible_Tasks_SOLVED.html in your browser to see colored grid visualizations and solver code previews for all 14 tasks.

Methodology

Each solver was synthesized using LLM-guided program synthesis (Claude Opus 4.6):

The model analyzes input/output training examples
Hypothesizes the transformation rule
Writes a solve(grid) function
Tests against training pairs, iterates until correct
Independently verified against held-out test pairs

The result: readable, deterministic Python code that encodes the discovered rule. No black boxes.

Full Catalog

These 14 are the hardest of the hard. For all 540 solved tasks (400 AGI-1 + 120 AGI-2 + 20 AGI-3), see:

👉 GitMonsters/SOLVED-540-of-540

License

MIT

Contact

Evan Pieser — epieser@protonmail.com

Built with TranscendPlexity

推荐订阅源

Hacker News - Newest: "AI"