惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

W
WeLiveSecurity
D
DataBreaches.Net
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
The Exploit Database - CXSecurity.com
D
Darknet – Hacking Tools, Hacker News & Cyber Security
腾讯CDC
PCI Perspectives
PCI Perspectives
阮一峰的网络日志
阮一峰的网络日志
S
Security Archives - TechRepublic
Hugging Face - Blog
Hugging Face - Blog
U
Unit 42
IT之家
IT之家
T
Troy Hunt's Blog
P
Proofpoint News Feed
www.infosecurity-magazine.com
www.infosecurity-magazine.com
F
Full Disclosure
V
V2EX
Stack Overflow Blog
Stack Overflow Blog
C
Comments on: Blog
V
Vulnerabilities – Threatpost
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
V
V2EX - 技术
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
N
News | PayPal Newsroom
MyScale Blog
MyScale Blog
Google DeepMind News
Google DeepMind News
Application and Cybersecurity Blog
Application and Cybersecurity Blog
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
李成银的技术随笔
P
Privacy & Cybersecurity Law Blog
大猫的无限游戏
大猫的无限游戏
V
Visual Studio Blog
T
ThreatConnect
WordPress大学
WordPress大学
Security Latest
Security Latest
C
Cybersecurity and Infrastructure Security Agency CISA
Recent Announcements
Recent Announcements
Google DeepMind News
Google DeepMind News
SecWiki News
SecWiki News
Recorded Future
Recorded Future
小众软件
小众软件
K
Kaspersky official blog
T
Tor Project blog
Last Week in AI
Last Week in AI
GbyAI
GbyAI
人人都是产品经理
人人都是产品经理
Jina AI
Jina AI
S
SegmentFault 最新的问题
MongoDB | Blog
MongoDB | Blog
Simon Willison's Weblog
Simon Willison's Weblog

Interesting Engineering

Pentagon’s second UFO files reveal aerial sightings over conflict zones, witness accounts China’s perovskite-silicon solar cell retains 90% efficiency after 1,000 hours Nine US Navy destroyers that carry laser weapons for drone defense China tests humanoid robots in tea farms before the 2026 World Robot Games Heavy-duty robot takes over hazardous inspections at UAE gas plant Havoc Spear: US unveils new air-launched cruise missile with 460-mile-range 100 hours of stability: New hydrogen catalyst uses ‘hidden oxygen’ to ditch precious metals Quantum sensors beat hyped computers to the real world by measuring invisible fields UK military deploys low-cost laser-guided rockets to destroy drones with precision defense Russian trucks loaded with ballistic missiles seen in action during nuclear drill US firm introduces portable 1MW hydrogen generator for off-grid power Rapidly reproducing mutant ‘super pigs’ found in Fukushima nuclear disaster zone Stratolaunch aces latest Mach 5+ hypersonic test flight for US Missile Defense Agency US nuclear firm advances 4th-gen 45 MWth Kronos modular reactor with key clearance UK’s Humanoid partners with Bosch to mass-produce HMND robots for industries AGIBOT’s humanoid robot steals the show with dance, calligraphy at cultural event Science Archives - Interesting Engineering Cambridge team makes ‘new atlas’ to help find critical rare earth metal deposits Eight tech giants deploy autonomous fleets across Singapore’s large-scale public test bed New Eurofighter jet with advanced radar, more upgrades showcased, flight testing soon US advances drone warfare, new interceptors, low-cost weapons to tackle aerial threat US to build first quantum wafer foundry with IBM to scale next-gen computing Starship V3: World’s tallest rocket misses launch after SpaceX scrub at T-40 seconds Skin-like autonomous computing patch maps fatal heart rhythms with 99.6% accuracy China’s new lithium battery reaches 451 Wh/kg with 3-minute charging and 700 cycles Meta settles major Kentucky school lawsuit over alleged teen social media addiction US launches Minuteman III missile; test verifies ICBM’s propulsion, guidance, re-entry systems Human gut chip reveals three hidden drivers behind IBD damage and cancer risk China builds automated coal-chemical hub in Xinjiang to reduce oil reliance New acoustic metasurface creates bass-heavy sound bubbles without headphones UK firm to extract rare earths from industrial waste using modular refining model China’s solid-state EV battery giant validates 400 Wh/kg cell with 1,100 charge cycles Hybrid-electric eVTOL could fly troops and military supplies across 1,000 miles US bolsters naval warfare, submarines loaded with Navy SEALs work with unmanned vehicles Depleted uranium being added to Russian missile warheads, Ukraine alleges US firm successfully completes second hypersonic capsule reentry for defense payloads Advanced recon drones to support US Army scouting and base security in Europe Physicists run one million trials to show bizarre ‘negative time’ quantum effect is real Japan begins nuclear waste site survey on remote Island 1,180 miles from Tokyo Video: US firm debuts 5.7-pound throwable recon robot with live thermal vision Video: Helios humanoid robot brings a four-armed design for in-orbit missions Bill Gates-backed TerraPower partners with Hyundai for next-gen 345-MW nuclear reactor Modular 20 MW Orion electrolyzer to accelerate industrial hydrogen scaling Video: New robot dog hunts tiny gas leaks at Norway’s massive carbon storage hub Spanish Navy tests drones controlled from helicopter in boat chase drill US: Contaminated nuclear site threatened by fast-moving California wildfire High-precision laser spectroscopy confirms proton is smaller than expected, at 0.84 fm Low-parasitic-inductance module cuts converter footprint for EVs and solar inverters NATO integrates radars, kinetic interceptors to hit target; boost counter-drone lethality Tesla inches one step closer to achieve 100 GW US solar manufacturing ambition: Report New erase-and-reprint resin survives 10 cycles of high-precision 3D printing New electronic warfare system can locate enemy drones, radars without sending signals Alibaba unveils 128-chip server system for autonomous agents as China moves past NVIDIA Perovskite solar cells hit 24.3% efficiency with new 10-minute vacuum process Perovskite solar cells hit 24.3% efficiency with new 10-minute vacuum process 80-year-old geometry mystery cracked by OpenAI using deep number theory Elon Musk could become first trillionaire as SpaceX targets historic $2 trillion IPO US spends $40M to rebuild Cold War-era Arctic radar for missile, low-flying threats ‘Like a flowing material’: Robot swarm uses physics, not commands to self-organize 3D-sensing technology could improve self-driving cars and robotic surgery China’s new robotic hand combines hybrid actuation for smarter robot manipulation DARPA`s orbital robotic servicing satellite set for 2026 launch US firm receives contract for manufacturing fully indigenous permanent magnets for defense use NextSilicon’s Spectra supercomputer achieves full acceptance at US’ Sandia National Laboratories Russia starts nuclear drill with submarines, 7,800 types of equipment, weapons and 65,000 troops Airbus triples compute power with supercomputer upgrade for next-gen aircraft design Hypersonic ramjet engine designing time cut from months to seconds by GE Aerospace NVIDIA hand-delivers first 1.2 TB/s Vera CPUs to OpenAI, Anthropic, and SpaceX World’s most powerful neutron source unlocks crystals 100x smaller with 2.8 MW boost Pentagon taps US firm for coordinated attack by intelligent, low-cost drone swarms Ford’s ‘breakthrough’ EV motor built using 100% recycled magnets aces durability test Engineered microbes turn biodiesel waste into plastic material in 79-gallon trial run NASA satellite to test space ‘gas station’ tech for Moon and Mars trips Chinese firm pushes humanoid robot intelligence forward with 300 FPS control speed US Air Force’s next-gen combat drones to get GE426 engines for enhanced performance New digital system deciphers 3,500-year-old Hittite script with 90% accuracy Solar thermochemical reactor splits CO2 into syngas feedstock for plastics at 2,552 °F First-in-US: Gatsby’s humanoid robot performs home cleaning service for client Sweden picks 4,390-ton, 400 ft French FDI frigate for blue-water fleet expansion 10x tougher bio-inspired ceramics survive 1112°F temps for aerospace and beyond Metallium awarded Phase II SBIR contract for recovering critical minerals from e-waste New heat-pressed silk material outperforms wood, rivals Kevlar and carbon fiber New US turret fires 54 barrels in 360 degrees at drone swarms using acoustic sensors End of keyword search? Google introduces AI agents that think, track, and act NASA tests 80-pound student-built robot designed to mine soil for Artemis moon bases Comet autonomous warship packs missile strikes, 45-knot speed and 10,000-pound payload Google claims new Gemini 3.5 Flash runs 4x faster than rival frontier models 3D-printed artificial egg hatches live chicks in lab test China activates world’s first offshore wind-powered underwater data center Mobile plant to extract lithium from 300-million-year-old brines for EV batteries Smart flight system lets drones avoid obstacles instantly and fly more efficiently Floating offshore solar farms produce 12% more power than land-based panels New marine-based foam offers sustainable solution for automotive manufacturing Air Force explores structural conversion of oil rigs for orbital-class booster recovery US firm’s NOS Security combines drones, robots, cyber defense for nuclear plant safety World’s first humanoid robot auction to debut at China’s biggest shopping event 75,000 miles up: China’s SMILE satellite launches to map Earth’s ‘invisible shield’ 75,000 miles up: China’s SMILE satellite launches to map Earth’s ‘invisible shield’ XPENG launches China’s first mass-produced robotaxi to challenge Tesla’s FSD Ramjet engine for Mach 5-speeding hypersonic aircraft tested at 1,832°F by Japan
Google rolls out Gemini Omni Flash for autonomous video creation across apps
Neetika Walt · 2026-05-23 · via Interesting Engineering

Google has started rolling out Gemini Omni Flash, its new multimodal AI model that can generate and edit videos using text, images, audio and video inputs. The rollout follows the model’s announcement during Google I/O 2026 and marks the point where users can now actively use the system inside the Gemini app, Google Flow and YouTube Shorts.

The company says the model is designed to combine reasoning and creative generation in a single system, allowing users to build and modify video content through natural conversation.

With Gemini Omni Flash, users can prompt the model to create videos from scratch or modify existing clips step by step. Each instruction builds on the previous one, allowing continuous refinement of scenes without breaking continuity. Google says this helps maintain consistency in characters, objects and environments across edits, even as the video changes through multiple iterations.

The model also supports multi-input workflows, where users can combine different types of inputs such as text prompts, images, video clips and audio references. This allows a single output video to be shaped using multiple reference points instead of relying on a single prompt. Google says the system is built to understand how these inputs relate to each other and produce a coherent final scene.

The rollout is part of Google’s broader push to integrate generative AI into its consumer ecosystem, especially platforms focused on short-form video creation. YouTube Shorts and the YouTube Create app are among the first platforms where Omni Flash capabilities are being introduced, signalling a tighter connection between AI generation tools and content creation pipelines.

The company also says all outputs generated through the system will include SynthID watermarking for identification of AI-generated content.

Conversational video editing

Gemini Omni Flash allows users to edit videos using natural language commands instead of traditional editing tools. Users can describe changes such as altering environments, adding objects or changing actions within a scene, and the model updates the video accordingly while preserving overall structure.

The system is designed to maintain visual continuity across edits, ensuring that characters and objects remain consistent as changes are made over multiple steps. Google says this makes the editing process more iterative and flexible compared to conventional video production tools.

The model also draws on Gemini’s broader world knowledge to improve realism in generated content. It uses this understanding to simulate physical interactions such as motion, lighting and environmental effects more accurately, according to Google.

From prompts to production

Google has positioned Gemini Omni Flash as part of a wider shift toward multimodal AI systems that can handle creation and reasoning together. The model is designed to process multiple input formats and generate output video that reflects combined instructions rather than isolated prompts.

The company says the goal is to reduce the gap between idea and execution, allowing users to move from concept to finished video using a single conversational interface. Over time, Google plans to expand output formats beyond video, with support for images and audio also planned for future updates.

The rollout of Gemini Omni Flash is currently limited to select subscription tiers in the Gemini app, with broader access expected as the deployment expands.

The Blueprint

Get the latest in engineering, tech, space & science - delivered daily to your inbox.

With over a decade-long career in journalism, Neetika Walter has worked with The Economic Times, ANI, and Hindustan Times, covering politics, business, technology, and the clean energy sector. Passionate about contemporary culture, books, poetry, and storytelling, she brings depth and insight to her writing. When she isn’t chasing stories, she’s likely lost in a book or enjoying the company of her dogs.