惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
WordPress大学
WordPress大学
小众软件
小众软件
Cloudbric
Cloudbric
AWS News Blog
AWS News Blog
腾讯CDC
量子位
人人都是产品经理
人人都是产品经理
大猫的无限游戏
大猫的无限游戏
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
V
Vulnerabilities – Threatpost
Scott Helme
Scott Helme
Hugging Face - Blog
Hugging Face - Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
The Hacker News
The Hacker News
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
IT之家
IT之家
Jina AI
Jina AI
Attack and Defense Labs
Attack and Defense Labs
S
SegmentFault 最新的问题
Simon Willison's Weblog
Simon Willison's Weblog
The Cloudflare Blog
阮一峰的网络日志
阮一峰的网络日志
T
Tailwind CSS Blog
Last Week in AI
Last Week in AI
博客园 - 【当耐特】
Google Online Security Blog
Google Online Security Blog
美团技术团队
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
罗磊的独立博客
L
LINUX DO - 最新话题
博客园 - Franky
博客园 - 叶小钗
Apple Machine Learning Research
Apple Machine Learning Research
The Last Watchdog
The Last Watchdog
J
Java Code Geeks
AI
AI
C
Cisco Blogs
酷 壳 – CoolShell
酷 壳 – CoolShell
C
Cyber Attacks, Cyber Crime and Cyber Security
Cisco Talos Blog
Cisco Talos Blog
博客园 - 三生石上(FineUI控件)
雷峰网
雷峰网
Help Net Security
Help Net Security
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
云风的 BLOG
云风的 BLOG
I
Intezer
S
Securelist

Thoughts while building

OpenFreeMap survived 100,000 requests per second Investigating AI Manipulation in Viral Chinese Paraglider Video The curious case of the packageManager field in package.json TIL: Get file list from eslint, biome and ruff TIL: Vercel AI SDK - the bloat king TIL: CSS-in-JS modules for SolidJS TIL: Auth provider updates Understanding Round Robin DNS First month of OpenFreeMap Is there now a generation of users who never worked with files? Comparing Auth from Supabase, Firebase, Auth.js, Ory, Clerk and others
2nd stage of the AI age: AI Labs Need Your Data
Zsolt Ero · 2024-12-19 · via Thoughts while building

First stage of the "AI age" was getting inference costs down to "too cheap to meter".

Cost of AI models:

  • Training time: a one-time, 3-month-long process costing up to a billion dollars

  • Inference time: when you ask questions in your favorite chat app. The cost of this is converging to a fraction of a cent.

Ilya Sutskever announced in a NeurIPS talk, that AI labs are running out of data, "The fossil fuel of AI".

"We have but one internet" - the data on the internet is limited (also meaning that they've already scraped whatever they could).

In light of this, some of the recent announcements by the companies behind the AI labs make immediate sense.

Microsoft is offering Copilot for free, previously $20 per month (still a loss leader even at that price!). The catch: in all plans under $19 per month, they are training on your data by default.

Google releases Gemini 2.0 Flash, one of the most advanced models ever, with real-time audio and video-streaming input! And if that's not enough, you get it entirely free on Google AI Studio and through the API!

The catch? When you use the Free version, Google uses your data for model training. At least they clearly point out that you should not submit "sensitive, confidential, or personal information to the Unpaid Services".

(Also, can anyone point out how to access the non-Free version? I really tried but couldn't find how. Do you need a Google Cloud account for that?)

OpenAI is offering previous premium features, like Search, to all users, even those on the Free plan. They really want to see Google dance. (Meanwhile, Google makes them dance with the Veo 2 video model - it's really fun to watch.)

They also now give Free users a monthly allowance of "Advanced voice mode" and have even started 1-800-ChatGPT, a free phone line where you can talk with ChatGPT!

The catch? You guessed it - for all consumer plans (aka not Teams or Enterprise), your data is used for model training. Ironically, this seems to include even the new $200/month Pro plan!

One company surprisingly missing from the last 2 weeks' series of announcements was Anthropic, which happens to be the only company that is not training on your data.

They seem to be taking a different direction compared to other players, and they also happen to have the best model for coding! Claude Sonnet is actually so good that Microsoft was forced to start offering it in GitHub Copilot; otherwise, developers would have left the platform.

It's wonderful to watch a dance of titans, who are outbidding each other while offering loss-making products for free. Just remember to stay aware of how they handle your data and which providers/plans actually respect your privacy.

Also, to be entirely honest, there might be other reasons for offering these products for free. For example, by offering CoPilot for free, Microsoft overnight decided the fate of dozens of "coding assistance" startups. The chances of a new "Cursor" being started in 2025 just got quite a bit slimmer.

I’ve also written 3 TIL posts since the last newsletter.

TIL: Vercel AI SDK - the bloat king

TIL: Vercel AI SDK - the bloat king

I was looking for a simple universal JS lib that could handle OpenAI/Anthropic/Google responses without having to install each client lib. Considering all these do is JSON requests, those client libs are actually quite big - for example, OpenAI is 87 kB on its own (bundled with Vite), though Google is only 25 kB.

Discussion about this post

Ready for more?