惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

Hacker News - Newest: "AI"

GitHub - Espenandreass1/agentslice: A Markdown workflow kit that makes Cursor, Claude Code, Codex and Windsurf ask before they edit. Show HN: I Built a Debugging Challenge for the AI Coding Age My AI agent called my code shit and took an unannounced vacation mid-sprint HTML Deployer: 1-Click AI Code To Website Publisher - Chrome 应用商店 College Kids Don't Want Your AI [video] How I Used AI to Untangle a Legacy Service I'd Never Touched Before — The AI Leverage Weekly Greetings, Class of 2026 Have You Heard About AI? Wait, Why Are You Booing? Uvora Growth OS – AI marketing automation and lead generation platform The Essential Cloud for AI: Why Purpose-Built Defines the Future of Intelligence No, AI is not making software worse, people are - Raphael Amorim If you let AI do your writing, I will come to your house and kill you AI Makes Adding Features Faster - So Why Not Add Just One More? Ask HN: How to get back into programming without AI? How Claude's AI model may cause security issues for your money Kevin O'Leary wants to build a massive AI data centre in Utah. Some residents aren't happy My AI coding flow was burning tokens to do things code should do Show HN: Live AI music sequencing agent The Dark Between the Stars GitHub - lynote-ai/humanize-text: Free open-source AI text humanizer to convert AI-generated content into undetectable, human-like writing. Bypass Turnitin, GPTZero, and all major AI detectors. No sign-up required. Try our unlimited free online tool Sign in Nobody Wants AI Anymore [video][12 mins] AI Has Taken Over Open Source How to Teach AI the "Taste" Global AI Diffusion: Q1 2026 Trends and Insights [pdf] HN: Silau – AI detects employee burnout" How AI Talks People Out of Conspiracy Theories–and What We Can Learn from That What to know about the AI models that are jolting Washington AI for design needs solving | by Megha Agrawal Client Challenge Predicting AI job exposure — Benedict Evans AI is becoming increasingly unpopular AI-Driven Design Automation What's Left for AI-Assisted Coding GitHub - Totes-MickGOATs/mcgoats-game-template: AI-powered game development template with CI/CD, auto-merge queue, TDD enforcement, 3-layer master protection, and 50+ skills for Godot/Unity/Unreal Vericoding: The End of "Trust Me Bro, The AI Wrote It". Bone Keeper AI Assisted Feature Film – Barrett Sonntag Nuance in all things. A dive into (Anti-) “AI” Myths AgentGate — Trust Authorization for Autonomous AI Agents AI is learning to fly airplanes – and aviation is starting to embrace it GitHub - oldrich-research/gravitational-constant-relation: A high-precision phenomenological relation for Newton's gravitational constant: G = (4/3)(hbar c / m_e^2) alpha^21 exp(-5 alpha/2). Companion to Zenodo DOI 10.5281/zenodo.20120946. Research performed by AI agents under named author's direction. AI agents just got their own web browser via a Firefox fork AI poses "urgent threat" to student learning and the HSC The AI Bifurcation of Tech The largest study of AI use by undergrads is in, revealing disparities in access — and in cheating NZ at wild frontier of AI superhacking The Race Is On Google CEO Sundar Pichai says booing graduates will shape AI's future Show HN: TalkTimer, a micro-SaaS run by an AI agent team Trickster's Table Venture Capitalist John Doerr Says AI Is the Biggest Tech 'Tsunami' AI Can’t Care – Dan Moore! GitHub - peterxcli/ccost: Turn local AI coding session logs into a searchable terminal UI with a cost lens. Ask HN: What is your daily AI stack? GitHub - PanzerPeter/Neuro: A programing language for AI Resyl: AI Memory for People - Apps on Google Play AI Chip Component Costs: Memory at 63% | Epoch AI Ask HN: Why do people seem to generally hate AI? Resonance, randomness, and negotiated meaning for AI-assisted tarot divination GitHub - Kind-Computers/quinlight-audio: Audophile-quality MOD music with AI remastering at 32-bit 96 kHz! The Case Against the AI Job Apocalypse AI and the Rise of Just-In-Time Knowledge Work Careers After AI There Is No AI (It's Just People), with Jaron Lanier [video] wolfram-fb0 — AI writes x86_64 asm + eBPF for fractals, in a real VM in your browser Bursting the AI Bubble: Fed Could Take Away the "Who Could Have Known?" Defense AI proves mathematicians wrong I built a free AI travel planner for budget Europe trips Our AI just got even better Integral Intelligence: a Catholic view of the AI debate How to Tame AI’s Voracious Appetite for Energy GitHub - atveit/pi-mojo: A mojo port of the PI AI Agent Toolkit Autotrader – paper trading AI agent for Indian equities The invisible fabric of AI: chips are not a war between two, but a global fabric - zoopa.es Responsible Work with AI The AI Existential Crisis: Western AI Agents Will Win Commerce Legal Ontologies for AI This AI Stock Is the Ultimate Set-It-and-Forget-It Buy for Long-Term Investors AI wealth must benefit the public, South Korea's deputy PM says amid Samsung labor tensions Forget electrons, this breakthrough uses light-matter particles to power AI State Explosion Security Problem in AI-Era Software Supply Chains ShannonBase: The Lightweight Semantic Layer for Enterprise AI SQL AI Content Got Too Real. Now OpenAI and Nvidia Are Using Google’s Watermarking System. - Firethering Karen Hao: AI creating a DESPERATE BASE OF WORKERS with no full-time employment GitHub - barvhaim/llm-learning-path: 🎓 Structured LLM Learning Path — From Zero to Researcher. 8-phase curriculum covering Transformers, pre-training, fine-tuning, alignment, agents, and advanced research. Letting Agents Write Code Without Ratcheting Up Risk Why Every Electronic Product May Need To Be Rebuilt For On-Device AI: The Chip Layer Will Decide The Next Hardware Wave – Easelink Tech Ask HN: I mapped 6,494 AI engines into a taxonomy – anyone else tried this? China behind in LLM race but it can still win in AI, ex-Tencent AI lead says Newsom signs order aimed at tackling AI job displacement How AI is redefining Software Engineering Hiro, AI job matching with real visa sponsorship data (550K jobs) For developers without design skills, how do you leverage AI for front end dev? The Anatomy of AI Power in 2026 | Wayne Research arxiv ‘AI washing’: firms are scrambling to rebrand themselves as tech-focused Clawd Cursor v0.9.7 SpaceX, OpenAI and Anthropic IPOs set to test limits of AI boom Export chats from 11 AI platforms to PDF or Markdown locally From Vibe Coding to AI-Assisted Engineering: Lessons from Real Projects Shannon Got AI This Far. Kolmogorov Shows Where It Stops GitHub - machineswillrise/jagent: AI coding agent in Java
Gemma 4: A new, budget-focused model in Posit AI
ionychal · 2026-05-25 · via Hacker News - Newest: "AI"

Gemma 4 is now available in Posit Assistant via the Posit AI provider. It's priced at a tenth of the price of Claude Sonnet 4.6 and less than a third of the price of our current cheapest offering, Claude Haiku 4.5. While less capable than Haiku, it's a good fit for basic data analysis and quick agentic coding tasks in R and Python. Here it is in action:

To use it, open Posit Assistant in RStudio or Positron and update to the most recent version of Posit Assistant when prompted. Then, select Gemma 4 in the model selector.

Meet Gemma 4

Gemma 4 26B A4B is a recent open-weights model release from Google Gemini. Up until this point, models of this size—small enough to run comfortably on high-end consumer laptops—were on our radar but not yet capable enough to drive an agent harness like Posit Assistant. This has changed in the last few months with releases like Gemma 4; this model is one of a couple "small" LLM releases that have really caught our attention recently.

While capable and very cost-efficient, Gemma 4 is more "jagged" than the Claude models we currently serve as part of Posit AI. The model will sometimes complete a substantive agentic coding session with remarkable coherence, stringing together reasonable tool calls and never losing the larger thread of the conversation. Just as often, the model will misinterpret your intent or lose the thread of the conversation after a few turns. Like frontier models of a year ago, you will want to steer Gemma 4 actively and audit code and output closely.

We recommend Gemma 4 for basic data analysis tasks. We do not recommend Gemma 4 for long-running agentic coding tasks.

Choosing a model

You can switch between available models at any time in Posit Assistant, including mid-conversation. So how should you make the decision about model choice? Generally, you’ll need to weigh costs and task complexity.  

Model costs

As of the time of writing, Posit AI now provides access to 4 models, each with a "cost multiplier."

Model

Description

Cost

Gemma 4

Fastest

.1x

Claude Haiku 4.5

Fast

.33x

Claude Sonnet 4.6

Balanced

1x

Claude Opus 4.6

Smartest

1.67x

Posit AI is $20/mo, $15 of which directly goes to model API costs. The cost multiplier refers to how quickly conversations will consume that $15.1

Users can choose between these four models at any time, as well as control the Thinking level. The default model with Posit AI is Claude Sonnet 4.6 with Medium Thinking. Among users that have opted-in to help us improve the service, we've seen that a large majority stick with Sonnet.

Claude Sonnet or Opus will consume those $15 in credits much more quickly than Haiku or Gemma 4. Per-token, Gemma 4 will consume $15 in credits at a tenth of the rate that Claude Sonnet does, and less than a third of the rate that Haiku will. We’re currently working on features that will help users better understand which models to choose in which situations; budget-minded model choice will make those credits go further, but certain tasks require the smartest models available.

Model capability

From our perspective, a few factors should influence model choice most heavily. For one, longer conversations (often, an analogue for greater task complexity) require more intelligence. If you’ll be asking Posit Assistant to import and tidy some data, or make a simple code refactor, a smaller model will do. In contrast, implementing entire package features or autonomously integrating many data sources necessitates larger, more expensive models.

Along a similar vein, exploratory tasks like EDA may not require the most capable models. This is more related to Posit Assistant’s design than the intelligence required, per se, for specific tasks. When carrying out exploratory tasks, Posit Assistant is prompted to work more closely with the user, launching only a few tool calls and allowing the user to keep up. This is in contrast to “deliverable” tasks, like implementing a feature in a package or writing a report, where the agent will work to completion. Broadly, smaller models should be supervised more closely, and exploratory tasks loop in the user more often.

As a rule of thumb:

Task

Model

Importing, tidying, visualizing, and summarizing data. Information retrieval.

Gemma 4 or Haiku 4.5.

Integrating multiple data sources, building Shiny apps.

Haiku 4.5 or Sonnet 4.6.

Complex package development and software engineering over long contexts.

Sonnet 4.6 or Opus 4.6.

Looking forward

We’re very excited to offer Gemma 4 as part of Posit AI. As we wrote last week, we’re working hard to help users get as much out of those $15 in credits as possible. Offering models that are on the frontier of cost vs. intelligence is part of that push.

That said, we expect small models to continue to improve. The near future will likely bring even more capable models at a similar price point as Gemma 4. We plan to evaluate these models as they come out, with the goal of eventually offering a similarly inexpensive model that can handle an even larger share of data science tasks.

1While evaluating model providers, we were not able to find a provider to sell us Gemma by-the-token that satisfied our latency, quality, and cost expectations. We thus self-serve Gemma, paying by the GPU-hour. This allows us to provide Gemma 4 tokens faster than any provider we evaluated, at a price point on par with other model providers.