惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
V
Vulnerabilities – Threatpost
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
V
Visual Studio Blog
月光博客
月光博客
IT之家
IT之家
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Tailwind CSS Blog
罗磊的独立博客
S
SegmentFault 最新的问题
博客园 - 三生石上(FineUI控件)
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
量子位
V
V2EX
Jina AI
Jina AI
The GitHub Blog
The GitHub Blog
小众软件
小众软件
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
阮一峰的网络日志
阮一峰的网络日志
Recent Announcements
Recent Announcements
MongoDB | Blog
MongoDB | Blog
Y
Y Combinator Blog
H
Help Net Security
博客园_首页
Cyberwarzone
Cyberwarzone
T
Tenable Blog
A
Arctic Wolf
C
CERT Recently Published Vulnerability Notes
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
T
Threat Research - Cisco Blogs
aimingoo的专栏
aimingoo的专栏
Google DeepMind News
Google DeepMind News
博客园 - 叶小钗
C
Cyber Attacks, Cyber Crime and Cyber Security
美团技术团队
Attack and Defense Labs
Attack and Defense Labs
GbyAI
GbyAI
博客园 - 【当耐特】
Cloudbric
Cloudbric
NISL@THU
NISL@THU
B
Blog RSS Feed
K
Kaspersky official blog
Hugging Face - Blog
Hugging Face - Blog
P
Privacy International News Feed
博客园 - Franky
博客园 - 司徒正美
Microsoft Azure Blog
Microsoft Azure Blog
Apple Machine Learning Research
Apple Machine Learning Research
Webroot Blog
Webroot Blog
Microsoft Security Blog
Microsoft Security Blog

Google DeepMind News

Investing in multi-agent AI safety research DiffusionGemma: 4x faster text generation Fluid, natural voice translation with Gemini 3.5 Live Translate Measuring the impact of learning with AI in Sierra Leone and beyond Powering the future of robotics in Europe Introducing Gemma 4 12B: a unified, encoder-free multimodal model Strengthening Singapore’s AI Future: A New National Partnership Simulate real-world places with Project Genie and Street View Introducing Gemini Omni Gemini for Science: AI experiments and tools for a new era of discovery Making it easier to understand how content was created and edited Gemini 3.5: frontier intelligence with action Co-Scientist: A multi-agent AI partner to accelerate research How WeatherNext helped the National Hurricane Center better predict Hurricane Melissa’s historic landfall in Jamaica Fast-tracking genetic leads to reverse cellular aging Finding the molecular switches behind new infectious diseases Opening new paths in aging research Accelerating discovery of liver disease mechanisms Uniting biological toolkits for a new approach to ALS Uncovering repurposed medicines to fight liver fibrosis Google Antigravity We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks. Reimagining the mouse pointer for the AI era AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields Enabling a new model for healthcare with AI co-clinician Announcing our partnership with the Republic of Korea Decoupled DiLoCo: A new frontier for resilient, distributed AI training Partnering with industry leaders to accelerate AI transformation Gemini 3.1 Flash TTS: the next generation of expressive AI speech Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning Gemma 4: Byte for byte, the most capable open models Gemini 3.1 Flash Live: Making audio AI more natural and reliable Protecting people from harmful manipulation Lyria 3 Pro: Create longer tracks in more Google products Measuring progress toward AGI: A cognitive framework From games to biology and beyond: 10 years of AlphaGo’s impact Gemini 3.1 Flash-Lite: Built for intelligence at scale Nano Banana 2: Combining Pro capabilities with lightning-fast speed Gemini 3.1 Pro: A smarter model for your most complex tasks A new way to express yourself: Gemini can now create music Accelerating discovery in India through AI-powered science and education Gemini 3 Deep Think: Advancing science, research and engineering Accelerating Mathematical and Scientific Discovery with Gemini Deep Think Project Genie: Experimenting with infinite, interactive worlds D4RT: Teaching AI to see the world in four dimensions Veo 3.1 Ingredients to Video: More consistency, creativity and control Google's year in review: 8 areas with research breakthroughs in 2025 Gemma Scope 2: helping the AI safety community deepen understanding of complex language model behavior Google DeepMind supports U.S. Department of Energy on Genesis: a national mission to accelerate innovation and scientific discovery Gemini 3 Flash: frontier intelligence built for speed Improved Gemini audio models for powerful voice interactions Deepening our partnership with the UK AI Security Institute Strengthening our partnership with the UK government to support prosperity and security in the AI era FACTS Benchmark Suite: Systematically evaluating the factuality of large language models Engineering more resilient crops for a warming climate AlphaFold: Five years of impact Revealing a key protein behind heart disease How we’re bringing AI image verification to the Gemini app Build with Nano Banana Pro, our Gemini 3 Pro Image model Introducing Nano Banana Pro We’re expanding our presence in Singapore to advance AI in the Asia-Pacific region Start building with Gemini 3 A new era of intelligence with Gemini 3 Google Antigravity WeatherNext 2: Our most advanced weather forecasting model SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds Teaching AI to see the world more like we do How AI is giving Northern Ireland teachers time back Mapping, modeling, and understanding nature with AI Accelerating discovery with the AI for Math Initiative MedGemma: Our most capable open models for health AI development VaultGemma: The world's most capable differentially private LLM Bringing AI to the next generation of fusion energy Introducing Veo 3.1 and advanced capabilities in Flow How a Gemma model helped discover a new potential cancer therapy pathway Introducing the Gemini 2.5 Computer Use model Introducing CodeMender: an AI agent for code security Gemini Robotics 1.5 brings AI agents into the physical world Strengthening our Frontier Safety Framework Discovering new solutions to century-old problems in fluid dynamics Gemini achieves gold-medal level at the International Collegiate Programming Contest World Finals Using AI to perceive the universe in greater depth Image editing in Gemini just got a major upgrade Introducing Gemma 3 270M: The compact model for hyper-efficient AI How AI is helping advance the science of bioacoustics to save endangered species Genie 3: A new frontier for world models Rethinking how we measure AI intelligence Try Deep Think in the Gemini app AlphaEarth Foundations helps map our planet in unprecedented detail Aeneas transforms how historians connect the past Gemini 2.5 Flash-Lite is now stable and generally available Exploring the context of online images with Backstory Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad T5Gemma: A new collection of encoder-decoder Gemma models Introducing Gemma 3n: The developer guide AlphaGenome: AI for better understanding the genome Gemini Robotics On-Device brings AI to local robotic devices We’re expanding our Gemini 2.5 family of models Gemini 2.5: Updates to our family of thinking models How we're supporting better tropical cyclone prediction with AI
Behind “ANCESTRA”: combining Veo with live-action filmmaking
Kory Mathewson · 2025-06-13 · via Google DeepMind News

We partnered with Darren Aronofsky, Eliza McNitt and a team of more than 200 people to make a film using Veo and live-action filmmaking.

General summary

Google DeepMind partnered with Primordial Soup to produce "ANCESTRA" a short film premiering at the Tribeca Festival. The film combines live-action with video generated by Veo, Google's video generation model. Google DeepMind developed new Veo capabilities to enable personalization, precise motion matching, and blending of live-action and generative footage.

Summaries were generated by Google AI. Generative AI is experimental.

Bullet points

  • "ANCESTRA" is a short film combining live-action with Veo, Google DeepMind's video generation model, premiering at the Tribeca Festival.
  • Google DeepMind partnered with Primordial Soup to put generative AI in filmmakers' hands, pushing storytelling and tech boundaries.
  • Gemini, Imagen, and Veo were used to generate shots based on mood, color, and emotion, using photos as inspiration.
  • New Veo capabilities were developed for personalized video, precise motion matching, and blending live-action with generated footage.
  • Generative AI complements filmmaking, empowering artists to overcome limitations and create difficult or expensive scenes.

Summaries were generated by Google AI. Generative AI is experimental.

Explore other styles:

Today, Eliza McNitt’s short film, “ANCESTRA,” premieres at the Tribeca Festival. It’s the story of a mother, and what happens when her child is born with a hole in its heart. Inspired by the dramatic events of McNitt's own birth, the film portrays a mother's love as a cosmic, life-saving force.

This is the first of three short films produced in partnership between our team at Google DeepMind and Primordial Soup, a new venture dedicated to storytelling innovation founded by director Darren Aronofsky. Together, we founded this partnership to put the world’s best generative AI into the hands of top filmmakers, to advance the frontiers of storytelling and technology.

“ANCESTRA” combined live-action scenes with sequences generated by Veo, our state-of-the-art video generation model. McNitt described her experience working with our technology: "Veo is another lens through which I get to imagine the universe around me.”

To create “ANCESTRA”, Google DeepMind assembled a multidisciplinary creative team of animators, art directors, designers, writers, technologists and researchers who worked closely with more than 200 experts in traditional filmmaking and production, a live-action crew and cast, plus an editorial team, visual effects (VFX) artists, sound designers and music composers.

Bringing our most advanced generative models to the screen

While McNitt wrote the script for “ANCESTRA,” she worked with a storyboard artist to visualize the live-action scenes and collaborated with our team to generate imagery for sequences that could benefit from AI generation.

We used Gemini to develop our prompts, and used Veo and our image generation model, Imagen, to create a series of potential shots, organized by mood, color and emotion. Here’s a breakdown of how we planned and created the AI elements of the film:

  • Gemini: Our team uploaded photos taken by McNitt’s father of the day she was born, and asked Gemini to describe these photos in precise aesthetic detail. These descriptions became the prompts for creating new images and videos.
  • Imagen: We generated the film's key concept art, defining the overall look, style and mood. These images became the starting point for our videos.
  • Veo: We animated the generated images and wrote additional text prompts for guiding the action and movement to create the final shots.

Developing new Veo capabilities together

While Veo made it possible to generate scenes that combined live-action acting and generative footage of a realistic newborn baby, it also posed new challenges. For example, McNitt wanted the generated video to match the quality and color of her live-action scenes. She also needed to control the camera motion and subject matter of the generated video. To meet these challenges, we developed several new Veo capabilities to enable greater personalization, precise motion matching, and the ability to blend live-action and generative footage.

Personalized video generation

We aimed to generate videos that felt as intimate and personal as the story itself. For example, McNitt wanted to generate footage of a realistic-looking baby in utero, while controlling the art direction, composition and motion. So we fine-tuned an Imagen model to match the style of reference images. Then, we worked with Gemini to craft and refine prompts to generate realistic images of a baby in the womb. Finally, we turned those images into animated scenes using Veo’s image-to-video capability.

By fine-tuning an Imagen model, we maintained specific and consistent art direction between different scenes of the AI-generated baby.

A grid of four distinct generated images of a baby drifting in a dimly-lit, murky environment — her face with closed eyes, detail of her foot, back of the head, and chest.

Motion matched video generation

In one scene, McNitt wanted to take the viewer on a journey through the human body, eventually landing in the womb to show a baby being born via C-section. To follow this precise camera motion, we created a virtual, 3D model of a human body and recorded a draft shot of the scene by moving a virtual camera through this model. Then we used Veo to track the draft shot’s motion and generate new videos using that same movement. We guided the generated video with text prompts, until we achieved the shot McNitt had in mind.

McNitt mapped out her desired camera motion using a virtual model of the human body. Then we used Veo’s motion matching to generate a video with that same movement.

In another scene, McNitt wanted to show an array of organic holes closing up, alluding to the hole in the baby’s heart. So, we gave Veo reference videos of this motion and prompted it to motion match across different shots. Producing these sequences with just computer generated imagery (CGI) would have been complex and time-intensive, and it would have been difficult to control motion using text prompts alone. With Veo’s help, we could produce high-quality scenes in just a few minutes.

We gave Veo an input video with the desired motion. Then, Veo combines the reference motion with a text prompt to generate a new motion-matched scene.

Blending traditional filmmaking and generative video

Imagery of babies produced using traditional VFX runs the risk of looking uncanny, and it's challenging and time-consuming for directors to get the exact performance they have in mind. So, for the birth, we composed the actor’s performance and generated a realistic looking newborn to fit the scene. First, we gave Veo the live-action footage, a text prompt describing the scene, and a defined area for adding the baby. Then, using Veo’s “add object” capability, we generated the AI image of a baby into the live-action footage — keeping everything else consistent — and we refined the shot with traditional VFX and color grading.

We added a generated newborn baby to live-action footage and refined the final shot with VFX and color grading.

Adding generative video to traditional workflows

Many scenes in the film use multiple AI-generated images and videos that are seamlessly composed using traditional filmmaking workflows. For example, we created a scene showing complex textures on the inside of a recently hatched crocodile egg at sunset. To construct this shot, we combined multiple generated videos and images with traditional VFX compositing techniques.

This shot captures the point-of-view from inside a cracking crocodile egg, at sunset with the protective mother crocodile nearby. We used Veo and Imagen to generate the key visual elements, which were then seamlessly composited in a traditional VFX pipeline to bring this specific creative vision to life.

Partnering with the film industry to tell new stories

“ANCESTRA” is the first of three films we're making with Primordial Soup. Each film in this partnership is directed by an emerging filmmaker who is mentored by Darren Aronofsky and supported by our team.

Many amazing movies have been created with live-action filmmaking, CGI and VFX toolkits. Generative AI can complement existing creative and production workflows, empowering filmmakers to overcome practical limitations with difficult-to-capture or prohibitively expensive scenes.

By working with artists, we ensure that the tools we’re building are useful and rooted in the needs of professional filmmakers. Collaborating with visionaries like McNitt and Aronofsky helps us explore the creative potential of today's technologies and imagine what we could create next.