惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

T
Threat Research - Cisco Blogs
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
V
Vulnerabilities – Threatpost
GbyAI
GbyAI
P
Proofpoint News Feed
L
LINUX DO - 热门话题
P
Palo Alto Networks Blog
A
About on SuperTechFans
T
Tenable Blog
M
MIT News - Artificial intelligence
IT之家
IT之家
I
Intezer
D
DataBreaches.Net
爱范儿
爱范儿
T
Threatpost
C
CERT Recently Published Vulnerability Notes
云风的 BLOG
云风的 BLOG
博客园 - 三生石上(FineUI控件)
WordPress大学
WordPress大学
K
Kaspersky official blog
大猫的无限游戏
大猫的无限游戏
A
Arctic Wolf
Y
Y Combinator Blog
Cyberwarzone
Cyberwarzone
酷 壳 – CoolShell
酷 壳 – CoolShell
D
Darknet – Hacking Tools, Hacker News & Cyber Security
H
Help Net Security
Microsoft Security Blog
Microsoft Security Blog
Spread Privacy
Spread Privacy
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
AWS News Blog
AWS News Blog
博客园 - 聂微东
C
Check Point Blog
S
Securelist
有赞技术团队
有赞技术团队
雷峰网
雷峰网
aimingoo的专栏
aimingoo的专栏
Last Week in AI
Last Week in AI
Stack Overflow Blog
Stack Overflow Blog
MongoDB | Blog
MongoDB | Blog
D
Docker
G
GRAHAM CLULEY
T
The Exploit Database - CXSecurity.com
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tailwind CSS Blog
L
Lohrmann on Cybersecurity
G
Google Developers Blog
C
Cyber Attacks, Cyber Crime and Cyber Security
L
LangChain Blog

Android Authority

I know YouTube Music is flawed, yet I prefer it over Spotify Survey reveals 50% of users don’t like the new Google Health app It’s time for Samsung’s S Pen to evolve or die The Motorola Moto G Stylus (2026) is a sequel we didn’t need NotebookLM is quickly becoming the podcast app I didn’t know I needed Samsung’s next Galaxy Watch update could finally make your health data useful Google’s Gemini Spark is ready to run your digital errands while your phone is off Telegram’s finally getting an official Wear OS app again Nintendo is back on mobile, and it wants to turn your selfies into minigames Google Drive’s big document scanner overhaul is finally here — don’t overlook its power Spotify will finally give you real profile tools to make music listening more social Acer’s new gaming handheld might dodge the worst of tech inflation Meta is cooking up a new line of smart glasses, and they may not be Ray-Bans ChatGPT is retiring this beloved legacy model in June Is Microsoft Copilot not working? Here’s what’s going on (Update: Back up) Samsung Gallery starts quietly ending OneDrive support ahead of schedule Here’s a first look at custom wallpapers in Google Messages Rivian is pretty sure customers want AI, not Android Auto Leaked iPhone 18 Pro dummy units may have just shown the next Android phone color trend A company spent $500 million in one month after forgetting to set AI usage limits Now even MediaTek’s cheap chips are embarrassing the Tensor G5 in one major area Pixel 10 Pro XL user says Google returned their phone worse than dead The best robot pool cleaners of 2026: Top picks for all budgets and pool sizes Claude Opus 4.8 is more honest, less deceptive, and considerably cheaper Roborock’s Qrevo Curv 2 Flow is ready to mop up the competition — and your filthy floors Google is making it easier to share Gemini chats, media, and more with your team One UI 9 borrows one of the iPhone’s most useful call features This is the biggest mistake Oura is making with the Oura Ring 5 This Verizon user owed $400, but the carrier made an unexpected move Google’s Fitbit Air makes a strong case for minimalism and ditching your smartwatch Survey says a Windows-powered streaming device could be a surprise hit with many How I created personalized Spotify playlist covers to spruce up my library I’m a long-time iPhone user, but these Android 17 features are tempting me to switch This company wants to clean your house for free, to train AI and robots As an Oura Ring 4 user, here are 3 reasons why I can’t wait to buy the Oura Ring 5 Google Photos could soon give you more tools to make your Memories shine Google may have fixed the issue that was exhausting your Gemini usage limits This cheap, swiveling Android handheld is a blast, but it literally hurts my hands ChatGPT is working on a slew of new features for Android users The Galaxy Z Fold 8 could be creaseless after all From Siri revamp to new tools: Here’s how Apple could rival Gemini (with Gemini) in iOS 27 Google Photos could finally be giving its automated edits a proper home Google Contacts on Wear OS is trying out a smart photos-first redesign A bizarre Chrome bug is locking some Android tablet users out of their browser The Chrome browser is getting a big safety upgrade — if you use Windows This new projector lineup is all about summer sports and outdoor viewing Samsung Galaxy Watch 9 codenames suggest there will be a new Classic this year This open source app lets you free your Oura Ring from its subscription Save $300 on the Samsung Freestyle 2nd Gen portable projector Proton Mail is making it easier to say goodbye to Gmail Spotify’s new features make it easier to manage and listen to your music The Pixel Buds app is getting a new look — in more ways than one AYN Thor goes full Nintendo DS with an official stylus add-on Survey shows you’re not buying the Googlebooks hype just yet YouTube Premium gets three new features for an even better podcast experience Google Messages mostly walks back SIM switcher change everyone hated Google Meet’s latest update puts Gemini right where you need it Having issues with T-Mobile’s fiber internet? Here’s what’s going on Save 20% on Govee Mini Panel Lights right now in Amazon Choice deal Fire TVs get new startup ad that takes over the entire screen Oura Ring 4 price slashed to $399 on Amazon Gemini, Claude, and ChatGPT were asked to run a radio station, and they slowly lost the plot Save $200 as Samsung ViewFinity S8 Monitor deal drops price by 33% The best deal of the year on this LG QNED soundbar just landed, saving you 29%! Intel’s Arc G3 chips are here to pick a fight with AMD’s Ryzen Z2 Highly rated UGREEN Uno 30W USB-C Charger price drops to $21.99 (27% off) This new gaming handheld wants to take on the Steam Deck with Intel Arc inside Snapdragon C is here to power $300 Windows laptops, undercutting the Macbook Neo Just as fitness trackers get interesting again, the Xiaomi Smart Band 10 Pro goes global These new Android phones go all in on zoom photography and battery life Galaxy S25 could soon get the S26’s smartest Galaxy AI features Oura’s newest smart ring is tiny on the finger but big on impact CapCut is here for Android tablets, and it’s completely free for now LG says reports of a TV business exit are completely ‘baseless’ (Updated) The Motorola Razr Fold proves skipping Elite silicon was a smart move Spotify now lets you share your favorite part of a podcast Did ANBERNIC quietly downgrade its GBA SP-like handheld again? Not so fast. Samsung is using Galaxy Watch 8 to study what Ozempic, other GLP-1 drugs might secretly do to you The Motorola Razr Fold shouldn’t matter, but I can’t put it down Your phone number for 15GB storage? New survey shows deep divide over Gmail’s latest experiment A Google employee allegedly used insider info to manipulate Polymarket bets These are the 5 popular apps I switched to this year Galaxy Z Fold 8 Wide dummy reveals an incredibly thin yet compact device I found a hidden way to use the Fitbit Air that Google didn’t tell you about The Google Fitbit Air’s ‘one size fit’ does not fit all One UI 9 could give users a killswitch for Android 17’s restrictive background playback controls Anthropic is preparing a major multilingual upgrade for Claude Voice Mode OnePlus 16 main camera leaks, and we’re not sure if it’s an upgrade or downgrade This luxury phone brand’s new foldable makes the Galaxy Z TriFold look cheap Roku’s biggest home screen refresh yet is rolling out now User claims Google locked down a 17-year-old account after a bizarre account change Walmart’s Onn just launched a $35 Google Home camera, and it looks like a steal! Android Auto just made switching media apps way less annoying Meta now lets you pay for the pleasure of using Facebook Google is making it easier to find the sites you actually care about in AI Search YouTube now lets you create a ‘custom feed’ about anything you want Upgrade to a 15.6-inch 4K portable monitor at a 20% discount Hot deal: PlayStation Pulse Explore buds drop to their all-time low price! This unusual ‘everything e-reader’ runs Android and lets you navigate with a knob Valve wants you to pay up to $300 more for the nearly three-year-old Steam Deck OLED
DiffusionGemma is Google’s fastest AI yet, but it comes with a big trade-off
Shimul Sood · 2026-06-11 · via Android Authority
DiffusionGemma

TL;DR

  • DiffusionGemma writes a whole chunk of text in one go and then keeps polishing it rather than building it word by word.
  • Google says it can be up to 4x faster, hitting 1,000+ tokens per second on NVIDIA H100 and around 700 on an RTX 5090, thanks to parallel processing.
  • Output quality is still inferior to Gemma 4, so it’s more of an experimental tool than a finished product.

Google has released DiffusionGemma, an experimental AI model that takes a very different approach to how most chatbots generate text today. Instead of writing one word after another in a strict sequence, it generates a whole block of text at once and then keeps refining it until it becomes readable. The idea is to push for speed and hardware efficiency, even if it means giving up some polish in the final output.

DiffusionGemma compared with other Gemma models

This new AI model is open-sourced under the Apache 2.0 license and is aimed at developers and researchers rather than everyday users. To understand why this matters, it helps to look at how most large language models work. Systems like Google’s Gemma 4 generate text step by step, one token at a time. Each new word depends on what came before it, which makes the process inherently sequential and harder to speed up.

DiffusionGemma, on the other hand, starts with a full canvas of random tokens, essentially noisy, unreadable text, and then repeatedly cleans it up in multiple passes. With each pass, the output becomes more structured and coherent until it settles into a final response. A simple way to picture it is that traditional models write, while DiffusionGemma drafts and edits everything at once.

That shift has a direct impact on performance. Per Google’s claims, DiffusionGemma can be up to four times faster than standard autoregressive models in low-concurrency scenarios, where a single user or process uses the GPU. On high-end hardware, the numbers are even more aggressive. The company asserts more than 1,000 tokens per second on an NVIDIA H100 and over 700 tokens per second on an RTX 5090.

Under the hood, DiffusionGemma is a 26-billion-parameter Mixture-of-Experts model, but it does not activate all of that at once. Only about 3.8 billion parameters are used during inference, helping keep compute requirements manageable. Google says this makes it possible to run the model on high-end consumer GPUs when quantized, with a memory footprint of around 18GB VRAM.

Where things get more interesting is how the model actually generates text. It can produce up to 256 tokens in parallel in a single step, and each token can attend to every other token in the block. That gives the model a global view of the output instead of a strictly linear one.

This makes it better suited for structured or rule-based tasks. For example, it can help fill in missing sections of code, complete structured formats like JSON, work through logic-heavy problems such as Sudoku-style puzzles, or handle mathematical patterns where consistency across the whole output matters more than sentence-by-sentence flow. Because it sees the entire block at once, it can also correct contradictions within the same generation cycle, rather than waiting for a later token to fix them.

But there is a catch, and Google is upfront about it. DiffusionGemma does not match the output quality of its standard Gemma 4 models. The writing can be less stable, less refined, and not as reliable for complex or nuanced responses. So, you get speed but lose some polish.

DiffusionGemma comparison

That is why Google is positioning it as an experimental tool — it is designed for scenarios where responsiveness matters more than perfection, such as real-time AI tools, inline writing or coding assistants, and fast iterative workflows where users care more about instant feedback than final-quality text.

Hence, DiffusionGemma is not meant to replace existing Gemini or Gemma models. It is a speed-first experiment that trades output quality for efficiency and responsiveness. But it also hints at a different direction for AI text generation, where models do not just predict the next word, but generate and refine entire blocks of text simultaneously.

Thank you for being part of our community. Read our Comment Policy before posting.