How China’s Shadow AI API Market Works

Hacker News - Newest: "AI"

Ask HN: We need a standard way to say how much AI was used in a PR Anthropic, Microsoft in talks for AI chip deal after $5 billion investment Idea: Subreddits as curator blogs for the AI era The elephant in the room • Josh W. Comeau What Happens When AI Edits a Classical Chinese Academic Paper: What Happens When AI Edits a Classical Chinese Academic Paper / 当AI修改古汉语学术论文时发生了什么 China's AI optimism isn't what it seems Ask HN: How much AI is in your writing? wwwatch · AI intel for builders Diia - Ukraine gov app launched AI agent based on Google Gemini The IPO wave will enshrine the AI gods' control over the future We shipped 30 tools to our agent. The most-used one just reads our documentation. - kapa.ai - Instant AI answers to technical questions How we work: AI skills - Easy Cyber Protection Governor Newsom signs first-of-its-kind executive order to prepare workers and businesses for potential AI disruption | Governor of California Another California tech company lays off thousands - Los Angeles Times How the AI backlash could cost investors AI Has a Memory. It Just Doesn't Know What to Remember The Companies Cutting Headcount for AI Will Lose to the Ones Who Didn't Ask HN: Is there a better and more affordable AI coding tool than Claude? Food for Agile Thought #545: R/L Agentic Chaos, AI Killed the Agile Industry The current AI pricing was always going to go away A top K-drama star faces explosive backlash over AI-manipulated voice evidence Clickup mocks employees over AI 8 days before layoff Automated Expert Extraction: Behavioural Telemetry of Nyx Wave Ban on Authors Who Submit AI Content “Welcome but Unenforceable” Hollywood in the 60s and the Good AI Future — Joel Dueck Proton Pass for AI Agents Baby Magic-AI Baby Image & Video Generator Online Interactive AI Chat - Chrome 应用商店 Google I/O showed how the path for AI-driven science is shifting Google makes Gemini 3.5 Flash the default AI model for billions of users - Tech Three Dots AI didn't kill your junior pipeline. You did | Andrew Murphy Adobe, Canva, CapCut Are Coming to Gemini to Help You Edit AI Creations "Erase," an AI tool that can remove unwanted objects from images Steve Wozniak cheered after telling students they have AI – actual intelligence AI-Assisted Engineering Habits Worth Stealing (Week 2 Roundup) The best engineers in 2026 aren't the best coders. They're the best at not trusting AI code. GitHub - Woodman97/lucy-agent: AI agent for writing, research, code, DeFi & blockchain. Pay per task in USDC on Base or Solana. A2A + MCP + x402 protocols. $200/month per developer on AI tools. Most companies can't explain what they're getting. Spotify and UMG Announce Licensing Deal to Allow for AI Covers and Remixes CodeAlta After Automation Acrisure layoffs to number 2,250, attributed to AI advancements Report Alleges Chinese Influence Behind AI Data Center Pushback in the U.S. Pressure from Silicon Valley helped block Trump’s expected order on AI AI may be inflationary before it becomes productive Cisco used AI to write security incident reports, with mixed results PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications GitHub - ai-mf/media-engine Ask HN: What the Best AI for Coding? Meet Hell Grind, The First Feature Film "Created Entirely On The Higgsfield AI Platform" Navigating AI with paper maps The Unsustainable Subsidy An Uncharitable Taxonomy of the AI Discourse ReCardEx — AI Product Photography for Marketplaces White House yanked AI order after David Sacks raised industry concerns Best Practices to Produce Maintainable Code with AI [video] AI Slop & the Vulnerability Treadmill Crypto and AI-Funded Super PACs Are Metastasizing The AI Bubble — No One's Happy Lam Research focused on adding AI to chipmaking tools as it eyes US expansion Donald Trump abruptly postpones AI order after White House infighting Tell HN: I'm tired of AI-generated answers Design prompting: describe the world, not the widget AI Local Recorder App - App Store erlang_python — erlang_python v3.0.0 Outlier AI is paying cardiologists to review ECGs and train AI models (referral) Agentic Engineering Memory — A Memco Field Guide Igor Babuschkin Seeks Up To $1 Billion For River AI AI is killing the cheap smartphone web-ai-sdk · Building blocks for the Web's built-in AI China unveils 'world's first' underwater data center — 2,000 server facility is powered by offshore… AI for Solo Founders: Virtual Coffee Chat & Networking - #BosTechWeek | Partiful The Structural Barriers to AI Lawyers Roundtables: Can AI Learn to Understand the World? Spotify and Universal Music agree deal to let subscribers create AI remixes AI Tokenomics: How to Profitably Turn Tokens into Business Value [video] AI-assisted engineers are burning out, is this fine?—Martian Chronicles, Evil Martians’ team blog Trump pulls back AI order over fears it could slow US technology | AP News GitHub - simd-ai/agent Spotify and Universal Music strike deal allowing fan-made AI covers and remixes Best AI Audiobook Maker | Warblize dhrive: Squarespace for mobile apps GitHub - fireharp/coherence: Git-native drift detector for agent-assisted repos: catch stale docs, ADRs, tests, metrics, and generated artifacts. The AI has come for my code - The Boston Diaries Show HN: Synrix: hardware-verified memory routing for edge AI agents Starbucks scraps AI inventory tool across North America GitHub - bjcoombs/ai-native-toolkit: Claude Code configuration and customizations GitHub - VenturFlow/Assay Tanya Janca on AI Slop, Vibe Coding, & the Future of AppSec Ask HN: What is an optimal game theoretic response to AI adoption? Ask HN: What AI prompts have you found most reliable for actual work? White House postpones AI executive order signing ceremony Trump Postpones AI Executive Order Due to Concerns About Overregulation Show HN: Canonry tracks how AI cites you – agent-first, open source AMD Ryzen™ AI Halo for AI Developers I had to do therapy on my AI — Tin's Posts — Tin Marković Ask HN: Anyone else struggling with AI and work? Google quietly nerfed its AI Pro plan, and here’s what you get now Grok falls flat in Washington, undercutting SpaceX's AI growth story Why the Amish Are Falling in Love With AI

vincent_s · 2026-05-18 · via Hacker News - Newest: "AI"

China's shadow market offers access to Claude, Gemini, GPT, and other frontier models. Pay a local seller, get an API endpoint, connect it to a coding tool, and use models that are hard or impossible to reach directly from mainland China.

These API relay platforms are being advertised on Taobao and Xianyu. Sellers promise no-VPN access, low latency, large context windows, coding-tool compatibility, and official-looking Claude, Gemini, and ChatGPT access. Some listings claim "1:1 official models."

Claude and Gemini are not normally available in mainland China. Those shadow API sellers offer a workaround: instead of calling Anthropic, Google, or OpenAI directly, the developer calls the seller's server.

That server is the middleman. It receives the prompt, sends it somewhere else, gets an answer back, and returns that answer to the developer's tool.

The product is a working API path to models the buyer cannot easily call directly.

What you buy

The seller usually gives the buyer three things:

an API base URL
an API key
a list of model names the endpoint claims to support

To the buyer, it looks similar to using an official API. Put the base URL and key into your tool (e.g. a coding harness), choose a model name, and send requests.

The important difference is that the endpoint is not run by the model company. It is run by the seller. The buyer is not sending prompts directly to Anthropic, Google, or OpenAI. The buyer is sending prompts to a third party that decides what to do next.

The possible upstream routes can be an official API account, a cloud account, a consumer subscription, a pool of accounts, or a cheaper substitute model returned under a more expensive model name.

As a user you cannot verify which route the seller actually used for a request.

Where the request goes

The flow is straightforward:

Developer tool
        ->
Seller's API endpoint
        ->
Upstream account or model chosen by the seller
        ->
Answer returned to the user

The seller's endpoint is doing the routing. That means the seller controls the upstream account, the model choice, the logs, rate limits, and any fallback behavior.

A March 2026 audit of shadow APIs found weak transparency around provider identity, upstream models, and infrastructure. The authors identified 17 shadow API providers that had already appeared in research and open-source workflows.

So the model name you are seeing as a user might very well be fake. A dropdown that says Claude or Gemini does not prove that the request actually went to the official Claude or Gemini API.

Pricing below official API prices

Some sellers advertise access below official API prices.

Small discounts can have ordinary explanations: unused quota, promotional credits, volume pricing, or a cheaper payment path.

Very large discounts need a different supply source. ChinaTalk's investigation into cheap Claude tokens in China describes transfer stations, account merchants, SMS verification services, card merchants, proxy networks, subscription pooling, and downstream resellers. It says some Claude access is sold at roughly 10% of the official price.

Several mechanisms can reduce the seller's cost:

harvest trial credits
abuse educational or corporate discounts
turn subscription access into shared API-style access
use stolen credit cards
route traffic to cheaper models

Coding agents make subscription resale especially attractive. A normal subscription is priced for one user. If a seller turns that subscription into a shared endpoint for many users, the apparent per-user cost drops until the account is limited or banned.

Anthropic's February 2026 distillation report shows the same kind of account infrastructure at larger scale. Anthropic said DeepSeek, Moonshot, and MiniMax generated more than 16 million Claude exchanges through about 24,000 fraudulent accounts. It also described proxy networks where banned accounts are replaced and traffic is spread across many nodes.

The model might be fake

A shadow API seller can advertise one model and serve another.

If the buyer asks for Claude Opus or Gemini Pro, the seller can send the prompt to a cheaper model and still return the response under the expensive model name.

The shadow API audit found performance gaps, inconsistent safety behavior, and failed fingerprint checks when comparing shadow APIs with official APIs. In one reported case, an endpoint sold as Gemini-2.5 performed far below the official API on a medical benchmark.

This matters for normal users, but it also matters for research. If a paper or benchmark uses a shadow API endpoint while assuming it is testing an official model, the measured system may not be the model named in the paper.

The Logs May Be the Real Product

The obvious risk is privacy: you are sending proprietary code through an unknown server. A coding-agent session typically includes repository context, stack traces, package files, test outputs, tool calls, failed patches, successful patches, and human feedback. That is extremely valuable training data.

ChinaTalk argues that proxy logs may be one of the hidden monetization channels in this market. Every request passing through a proxy includes the full prompt, response, tool calls, and iteration history. For AI coding agents, those logs are unusually rich because they capture real engineering workflows and sometimes human-validated fixes.

Anthropic says that Chinese AI labs generated millions of Claude exchanges to train or improve their own models. This makes the economics easier to understand. A relay station might not need to make much profit on tokens if the traffic also produces valuable training data. The user thinks they are buying discounted inference while the operator is in the business of acquiring training data.

Why the Market Is Hard to Kill

Providers can ban accounts, block suspicious regions, require stronger verification, monitor traffic patterns, and shut down obvious abuse. But if one account dies, another replaces it. If one relay endpoint is blocked, traffic moves to another. If one seller disappears, another appears on a marketplace. If KYC gets stricter, someone will provide identity verification, overseas cards, phone numbers, or cloud accounts.

Anthropic described this kind of infrastructure as “hydra cluster” behavior: networks of fraudulent accounts with no single point of failure. In one case, it said proxy services mixed suspicious extraction traffic with unrelated customer traffic, making detection harder.

What I'm building

Delegate tasks. Get software.

Give Vroni a GitHub issue, bug report, spec, or rough idea. It reads the repo, plans the change, writes code, runs checks, and works toward a review-ready pull request.

Take a look at vroni.com

I respect your privacy. Unsubscribe at any time.

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。

推荐订阅源

Hacker News - Newest: "AI"

What you buy

Where the request goes

Pricing below official API prices

The model might be fake

The Logs May Be the Real Product

Why the Market Is Hard to Kill

Delegate tasks. Get software.