惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

酷 壳 – CoolShell
酷 壳 – CoolShell
H
Hacker News: Front Page
P
Palo Alto Networks Blog
T
ThreatConnect
Apple Machine Learning Research
Apple Machine Learning Research
博客园_首页
T
True Tiger Recordings
P
Privacy & Cybersecurity Law Blog
B
Blog
IT之家
IT之家
Last Week in AI
Last Week in AI
F
Full Disclosure
Hacker News: Ask HN
Hacker News: Ask HN
C
Comments on: Blog
Microsoft Azure Blog
Microsoft Azure Blog
C
Cybersecurity and Infrastructure Security Agency CISA
Microsoft Security Blog
Microsoft Security Blog
博客园 - 【当耐特】
N
News and Events Feed by Topic
NISL@THU
NISL@THU
腾讯CDC
雷峰网
雷峰网
Security Latest
Security Latest
李成银的技术随笔
M
Microsoft Research Blog - Microsoft Research
L
LangChain Blog
L
Lohrmann on Cybersecurity
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
C
Check Point Blog
Y
Y Combinator Blog
Recent Announcements
Recent Announcements
博客园 - Franky
N
News | PayPal Newsroom
V
V2EX
A
About on SuperTechFans
The Register - Security
The Register - Security
月光博客
月光博客
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Google Online Security Blog
Google Online Security Blog
MyScale Blog
MyScale Blog
Cisco Talos Blog
Cisco Talos Blog
Vercel News
Vercel News
WordPress大学
WordPress大学
C
Cyber Attacks, Cyber Crime and Cyber Security
The Hacker News
The Hacker News
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
爱范儿
爱范儿
A
Arctic Wolf
L
LINUX DO - 最新话题
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

DEV Community

93 Agents. 2.6 Billion Tokens. One Working OS. And a Bill Under $1,000. Stop Getting 'It Depends' Answers About RAG Architecture Wrapping Hermes Agent with agent-stack: six tiny libs for the boring parts Templating got me to 33,620 pages. Indexing them was the hard part. EML Attachments Not Opening? Here’s How to Fix the Issue Easily in 2026 WordPress client onboarding: the exact process I use to start every maintenance contract right Models shouldn't have execution authority. Why we built a deterministic FSM runtime for AI agents. AI makes building faster, but semantic distribution is now the hard part How I Created My First Solana Token from Scratch (SPL Token Basics Explained) How I use WP-CLI to cut WordPress maintenance time from 6 hours to 20 minutes What Is Vibe Coding? And Does It Actually Work for Production Code? (I Tested 10 Tools) WordPress staging environments: the 15-minute setup that prevents client emergencies Reading Log #2 — Sapiens Imagined Communities: An Organization Is Made of Shared Fiction I Built a Branded Token on Solana in 5 Minutes (No Smart Contract Needed) The Confidence Gap: How AI Introduces Silent Errors on Production Sites Day 7 - Dense Embedding - RAG Why teachers need explainable AI, not just accurate AI — building the KC dashboard Closing the feedback loop: how mistake classification drives adaptive problem selection in NumPath Amazon Quick: AWS's Agentic Workspace, Explained for Engineers "My Coding Agent Remembered Sessions, Not Work. That Was the Bug" Reading Log #0 — Manga Was a Democratization Device for Cultural Capital SecOps Salary Guide 2025: GRC CTC Ranges in Indian GCCs vs Product Companies 🐍 python global vs nonlocal keyword — when to use each? 6 Free Online Video Cutters That Don’t Completely Ruin Your Footage (2026) Engineering Around Bitcoin's Traditional Platform Lockdowns AI 2026 ⚖️ Case File 4.1: The Efficiency Extortion Client-Side AI: The Next Era of Consumer E-Commerce? Why I'm building an AI math tutor for dyscalculia — and grounding it in 30 years of ITS research "My DingTalk Coding Bot Said It Started the Task. Then It Never Sent the Result" Your trycatch sucks - lets fix it I Built a Globally Distributed Blog Platform for ~$1/Month Awesome-Claude-Skills I built 135 Claude Skills with real formulas. Here's what "production-grade" actually means. How I engineered a Non-Euclidean AI framework for massive data reduction Automated 25 Minutes of My Morning With a Prompt (Not a Script) أدوات API ذاتية الاستضافة: هل يجب أن تترك السحابة؟ Never Use Service Classes in Rails How Markus Builds AI Teams That Actually Ship — Not Just Chat Pricing logic feels boring until it's wrong. Software Engineer Skills Companies Want in 2026: 48K-Posting Analysis Data Races Reproduced: Harnesses That Catch Heisenbugs Demystifying AI Agents: Building an Agentic Pipeline From Scratch in Pure Python Coding Agents Are Becoming Remote Workers. Enterprises Need an Agent Harness. How I Let an AI Refactor My Whole Codebase (Using Gemini 3.5) Flutter 3.44 Highlights From Google I/O 2026: What's New and What Matters The Hidden Cleanup Cost Behind AI Coding Velocity Promises A beginner's guide to the Image-Background-Remove model by Zf-Kbot on Replicate A beginner's guide to the Invsr model by Zf-Kbot on Replicate How to Automate Canadian T4 Slip Parsing with an API (No OCR Setup Required) حماية مفاتيح API من إضافة VS Code ضارة Agetor Review: An Open-Source Kanban Board for Orchestrating Claude Code Why most Marketo audits start at the wrong layer RevOps alignment is an operating-model problem, not a tooling problem Why Some Developers Are Moving Away From Tailwind CSS in 2026 API 키 보호: 악성 VS 코드 확장으로부터 안전하게 VS Code拡張機能によるAPIキー漏洩を防ぐ方法 Temporal Cloud Serverless: Durable Execution Without the Ops Overhead Why Freshers Must Build Real AI Products Instead of Endless App Clones I Built a Dynamic llms.txt for Next.js. Then Google Said Don't Bother. AWS Summit Seoul 2026: Korean Enterprises And Agentic AI Does AI Know How Many Tokens It Is Burning Selling Software in Countries PayPal Can't Reach - A Cautionary Tale of Crypto and Custom Solutions My Old MacBook Air Couldn't Handle It — So I Used Google Colab to Train an AI#1 The Discord.js gotchas that cost me a week each (so they don't have to cost you one) Leetcode QOTD:- 3043. Find the Length of the Longest Common Prefix MPT DEX Performance Test Report I shipped a working landing page in 14 KB. Here is every byte. Zero-Secret CI/CD: GitHub Actions + OIDC on AWS (Part 6) Building the React Frontend: Document Library and Chat UI (Part 5) Runtime Governance Evidence Anchors in 2026: A Public Ledger for Budget and Accountability Decisions RAG and Vector Search with pgvector and Amazon Bedrock (Part 4) Serverless Document Pipelines with AWS Step Functions (Part 3) Multi-Tenant Auth with Cognito and PostgreSQL Row-Level Security (Part 2) Building a Multi-Tenant AI Document Platform on AWS (Part 1: Architecture) Building a Nutrition Calculator in JavaScript: filter, map, and reduce on Objects Shipping an MCP server: parallel search, JSON output, and what broke along the way Runtime Governance Evidence Anchors in 2026: A Public Ledger for Budget and Accountability Decisions A 3-step agent cost me $4.20. agenttrace showed me the O(n ) tool call hiding in plain sight. Beyond WebView: The Next Evolution of Hybrid App Architecture Our retry loop made an outage worse. The circuit breaker stopped the cascade. Claude returned ```json blocks 14% of the time. Here is the Rust crate I wish I had earlier. I burned my Anthropic org cap and waited 3 days. Then I built llmfleet. One Open Source Project a Day (No. 71): CodeGraph — Pre-Index Your Codebase for AI Agents, Save 35% Cost and 70% Tool Calls The prompt your SDK sends is not the prompt you wrote The Context Tax: Why Every Cursor Session Costs You 15 Minutes Prompt Physics: Building a Cognitive Steering Layer for Gemma 4 Pain Points Will Always Outlive Platforms 92. BERT: The Model That Reads in Both Directions QAOA vs. 75,000 Nodes: Building a Hybrid Architecture to Solve NP-Hard Problems When Quantum Simulators Hit a Wall E2B? E4B? 26B A4B? The Gemma 4 Model Names Finally Explained One Tool That Cuts Token Costs 40-80% for Claude Code, Codex, opencode, and openclaw Building a 32-URL economy microsite on top of a 754,000-row SQLite dataset Coordinating 100+ AI Agents in the Field: Practical Patterns for Robotic Swarms Static site search for Astro in 2026: why I picked Pagefind over Algolia and Lunr How I built pairwise AI model compare pages with Claude Haiku and a budget cap Three post-deploy checks I run after every Cloudflare Pages build Why I'm betting on AI-curated directories when Google AI Overviews answer the same queries When boto3 doesn't have it (yet), you write it: a realtime speech-to-speech story in Python Zero-Trust RAG: Defeating the Shared Private Link Deadlock in Azure Terraform You Can't Co-Design What You Don't Operate
I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move
manja316 · 2026-04-18 · via DEV Community

manja316

Everyone says prediction markets are efficient. I spent 30 days collecting data to test that claim.

The result: 8.9 million data points across 9,550 markets — and the data tells a story most traders miss completely.

The Setup

I built an automated collector that snapshots every active Polymarket market every 15 minutes. Not just BTC or the US election — every market. Politics, sports, crypto, geopolitics, economics, entertainment, weather, science. All of it.

After 30 days of continuous collection:

Markets tracked 9,550 Price snapshots 8,158,672 Orderbook depth snapshots 792,527 Categories 10 Total data points 8,951,199
Metric Value

Most Polymarket datasets you'll find cover a single market or a single event. This covers the entire platform simultaneously — which lets you see patterns that single-market analysis can't.

Finding #1: Markets Are Not Efficient After Crashes

Everyone assumes prediction markets instantly price in new information. The data says otherwise.

I measured what happens after a price drops more than 20% between consecutive snapshots. Here's what the 5,629 crash events show:

+15 min +6.6% 5,629 +30 min +8.8% 5,629 +45 min +10.3% 5,629 +1 hour +11.0% 5,629
Time After Crash Average Return Events Measured

After a >20% crash, prices bounce back an average of 6.6% within 15 minutes.

This is classic mean reversion — and it's massive. For comparison, the S&P 500's average annual return is about 10%. These markets deliver that in an hour after a crash.

The reverse is also true. After a >10% pump:

+15 min -2.9% +30 min -3.7%
Time After Pump Average Return

Prices that spike tend to give it back. Markets overreact in both directions.

Finding #2: Hold Time Matters More Than Entry Price

I simulated the obvious strategy — buy the crash, sell the recovery — across all 30 days of data. 6,225 trades. Here's how hold time affects the result:

2 hours 10,204 54% $87 6 hours 8,324 64% $108 12 hours7,29570%$121 24 hours 6,225 75% $135 48 hours 5,352 81% $142
Max Hold Trades Win Rate Total P&L

The sweet spot is 12 hours. Going from 12h to 48h only adds $21 to total P&L but locks your capital 4x longer. Most of the money is made in the first few hours.

This surprised me. I expected entry price to be the key variable. It's not:

Under $0.10 74% $0.014 $0.10 — $0.30 74% $0.018 $0.30 — $0.50 79% $0.032 Above $0.50 86% $0.022
Entry Price Range Win Rate Avg P&L Per Trade

Higher-priced markets actually have better win rates. The cheap ones look tempting but they include more dust trades that go nowhere.

Finding #3: Category Is Your Edge Selector

Not all Polymarket categories behave the same:

Crypto 646 78%$0.030 Best per-trade Sports 1,050 79%$0.027 Most consistent Other 1,872 75% $0.024 Most volume Politics 1,362 76% $0.018 Decent Geopolitics 890 71% $0.016 Below average Economics 101 69% $0.008 Avoid Weather 21 57% Negative Avoid
Category Trades Win Rate P&L Per Trade Verdict

Crypto and sports markets have the strongest mean reversion. Economics and weather markets are traps — they crash and stay crashed.

Why? Sports and crypto have event-driven resolution (the game happens, the price discovers). Economics markets depend on slow-moving indicators — when they crash, it's often because the fundamentals actually changed.

Finding #4: The "Always Bet No" Strategy Is Overhyped

You may have seen the "Nothing Ever Happens" bot that bets NO on everything. The claim: 73% of Polymarket resolves NO.

I checked with 4,763 resolved binary markets from the API:

  • All markets: 52.3% resolve NO (not 73%)
  • Non-sports: 57%
  • "Will X happen?" framing: 59.3%

The 73% figure comes from a heavily filtered subset. Across all markets, the NO edge is barely there — and at typical NO prices ($0.65-0.85), the math doesn't work.

The Dataset Is Free to Explore

I'm releasing the full dataset across multiple platforms:

Free (sample):

  • Kaggle — markets.csv + 100K price preview + full SQLite DB
  • HuggingFace — same files, HF ecosystem integration
  • GitHub — browse the data, star if useful

Full dataset ($9):

  • Gumroad — 8.9M data points, orderbook depth, SQLite DB

The data updates weekly from an automated pipeline. If you want to reproduce any of these findings, everything is there.

What I'd Build Next

If I were starting a Polymarket quant project today, I'd focus on:

  1. Real-time crash detection — the 6.6% bounce after crashes is the clearest edge
  2. Category rotation — crypto and sports, skip economics and weather
  3. 12-hour max hold — the data is unambiguous on this
  4. Cross-market signals — does a crash in one political market predict crashes in related ones?

The prediction market space is where crypto was in 2017 — growing fast, most participants losing money, and the edge goes to people with data infrastructure.

The data was collected from Polymarket's Gamma API and CLOB API using an automated pipeline. All code and methodology are open source. I also maintain protodex.io, an index of 2,013 MCP servers with security scores.

Questions or want custom data cuts? LuciferForge@proton.me