The pulse of generative image & video AI.

Twice a week, the most important stories in image and video generation - new models, notable research, and meaningful product releases - distilled into a 2-minute read. No hype, no filler.

Free. Unsubscribe any time. No spam, ever.

Archive

May 23, 2026Video

Google Genie 3 Turns Street View Pins Into Walkable AI-Generated Worlds

Google DeepMind has connected its Genie 3 world model to Street View data, letting users drop a map pin and receive an interactive, explorable environment generated from real-location imagery. The demo is immediately interesting for games and creative tools, but the longer-term intent appears to be training AI agents and robotics systems on grounded spatial data.

May 22, 2026Video

Beeble AI Launches Canvas, a Node-Based AI Compositor for VFX Pipelines

Beeble AI has released Canvas, a node-based visual compositing platform that integrates AI video models, traditional compositing tools, and workflow automation in a single environment. The system includes native access to Beeble's SwitchX and SwitchLight models and supports external generative models for batch iteration across multiple shots.

May 22, 2026Video

Google Launches Gemini Omni for Video Generation and Editing

Announced at Google I/O, Gemini Omni Flash is a multimodal model that takes text, images, audio, and video as inputs and generates or edits video through natural conversation. Google says the model draws on its knowledge base of science, history, and cultural context to improve physical plausibility in generated footage. A SynthID watermark is embedded in all output.

May 22, 2026Image

OpenAI Adds SynthID Watermarking Alongside C2PA to All Generated Images

OpenAI announced it will embed Google's SynthID watermark in images generated through ChatGPT, Codex, and its API, layering it on top of the C2PA content credentials it already applies. The two systems are designed to complement each other: C2PA carries detailed provenance metadata while SynthID provides a signal that can survive format changes and metadata stripping.

May 22, 2026Multimodal

Google Expands SynthID and C2PA Detection to Chrome and Search

Google is bringing AI content verification to Chrome and several Search surfaces, including Lens and AI Mode, letting users check whether an image carries SynthID markers or C2PA content credentials directly in the browser. The expansion, announced at Google I/O, coincides with OpenAI and Nvidia both adopting SynthID, pushing the watermarking standard toward broader industry use.

May 22, 2026Image

Google Pics Brings Comment-Style AI Image Editing to Workspace

Google launched Pics, a new AI image generation and editing app for Workspace, with an interface that lets users click on specific parts of an image and annotate what they want changed - similar to leaving a comment in a Google Doc. The app is powered by Gemini and Google's Nano Banana 2 image model.

May 22, 2026Video

Google's Genie 3 World Model Connects to Street View for Location-Grounded AI Environments

Google DeepMind has integrated its Genie 3 world model with Street View imagery, allowing users to drop a pin on a map and generate a walkable, interactive AI environment based on that real-world location. The project makes nearly two decades of Street View data available as grounding material for generative world-building.

May 22, 2026Video

YouTube Shorts Gets AI Remix Feature Powered by Gemini Omni

Google has added a remix option to YouTube Shorts that lets viewers restyle clips or insert themselves into other people's videos using Gemini Omni. Creators can disable the feature for their content, and all remixed output carries a SynthID watermark.

May 22, 2026Video

First AI-Produced Feature Film Screens at Cannes, Made With Higgsfield

Hell Grind, a 90-minute sci-fi heist film produced entirely with Higgsfield AI, is screening at the Cannes Film Festival. Made with a team of 15 filmmakers who generated over 16,000 clips to produce 253 final shots, the production cost under $500,000 - a figure the company contrasts with a claimed $50 million equivalent using traditional methods.

May 22, 2026Video

Take It Down Act Takes Effect, Requiring 48-Hour Removal of Sexual Deepfakes

A US federal law requiring social media platforms to remove nonconsensual intimate imagery - including AI-generated deepfakes - within 48 hours of a valid request has come into force. Experts have raised concerns that the takedown mechanism could be misused for censorship, and that practical enforcement for victims remains uncertain.

May 21, 2026Video

Google Launches Gemini Omni for Video Generation and Conversational Editing

Announced at Google I/O, Gemini Omni Flash is a new multimodal model that accepts text, image, audio, and video inputs to generate and edit video through natural conversation. The model embeds SynthID watermarks automatically and is positioned as a broader platform that will eventually handle more modalities beyond video.

May 21, 2026Image

OpenAI Adds C2PA Metadata and SynthID Watermarks to All Generated Images

OpenAI announced that images produced through ChatGPT, Codex, and its API will now carry both C2PA content credentials and Google's SynthID watermark. The dual approach is intended to maintain traceability even when metadata is stripped, since the two systems work differently and compensate for each other's gaps.

May 20, 2026Multimodal

Asset Studio is entering a new era of AI-powered creativity.

We’re introducing new multimodal capabilities into Google Ads’ Asset Studio — your creative destination to create, build and test your assets.

May 20, 2026Video

Google Introduces Gemini Omni AI Model That Can “Create Anything” With Video

Remember that lull from the past few months where we didn’t get the surprise announcement of another new “revolutionary” AI video model every single day? Well, that lull is over as the AI wars are heating up once again. However, with this AI revolution now several months and years old, it is admittedly getting harder to keep up with what every model is offering these days. We’re also several years into generative AI video, and it still hasn’t taken over in any meaningful way. Will this new Gemin

May 20, 2026Multimodal

It’s make or break time for AI labeling systems

If robust AI labeling was in place when these swagged out images of Pope Francis went viral, it may have been easier for people to tell they were fake. | Image: via Reddit We're about to find out if the systems designed to make deepfakes and AI-generated content easy to spot are actually up to snuff. SynthID and C2PA Content Credentials, two distinct technologies for invisibly tagging image, video, and audio files with information about their origins, are getting their biggest expansion to date,

May 20, 2026Image

OpenAI Gets Serious About Detecting Fake Images

OpenAI has announced that images generated with ChatGPT, Codex, and its API will include C2PA metadata and a SynthID watermark -- the two leading protocols in identifying AI images. [Read More]

May 19, 2026Video

Google rolls out Gemini Omni AI for video generation and editing

What's new? Gemini Omni Flash is a multimodal AI model that takes text, image, video, and audio inputs. It supports video editing and embeds a SynthID watermark.

May 19, 2026Image

Google Pics Makes AI Image Generation Way Less Annoying

Today at Google I/O 2026, the tech giant announced that it is bringing a new AI image creation and editing tool, Google Pics, to Workspace. [Read More]

May 19, 2026Video

Netflix Staffing AI Animation Unit Called Inkubator

Netflix is building a small internal team called Inkubator to explore generative AI workflows in animation, starting with short-form content. Job listings include CG artists, compositors, and a Head of Technology, suggesting a production-ready ambition rather than a pure research exercise.

May 19, 2026Video

Runway Positions Video Generation as a Path to World Models

In a TechCrunch profile, Runway laid out its thesis that video generation is the foundation for building general world models, framing its position outside the large-lab ecosystem as a strategic advantage. The piece offers a candid look at how the company sees its long-term competitive footing against well-resourced rivals like Google.

May 19, 2026Image

Soderbergh Used AI-Generated Images Throughout Lennon Documentary

Director Steven Soderbergh has confirmed that his documentary 'John Lennon: The Last Interview' incorporates AI-generated imagery, offering one of the more prominent examples of a name-brand filmmaker deliberately weaving generative visuals into a non-fiction work. Soderbergh discussed his reasoning publicly, adding a notable data point to the ongoing conversation about AI's role in documentary production.

May 19, 2026Image

YouTube Extends AI Deepfake Detection to All Adult Users

YouTube is rolling out its likeness-detection feature-previously limited to creators and public figures-to all users over 18. The tool uses a face scan to flag potential AI-generated lookalikes on the platform, after which users can request removal of matched content.

May 19, 2026Video

NVIDIA Releases Fine-Tuning Guide for Cosmos Predict 2.5 Video Model

NVIDIA has published a detailed walkthrough on fine-tuning its Cosmos Predict 2.5 world model using LoRA and DoRA techniques, specifically targeting robot video generation. The guide, hosted on Hugging Face, offers a practical path for teams wanting to adapt the model to domain-specific physical AI applications without full retraining.

May 19, 2026Image

Hasselblad Masters Disqualifies Entry Over AI Generation

Hasselblad has removed a shortlisted entry from its Masters 2026 competition after determining the image involved generative AI, following public scrutiny of the work. The case adds to a string of similar disqualifications at major photography competitions and highlights the continued difficulty of detection at the judging stage.

May 19, 2026Video

Streamer Realtime Deepfakes Himself into Mr. Beast, Says He Loves 'Touching Little Boys'

The software, called Delulu, is marketed specifically to streamers and lets them easily transform into other people including George Floyd, Jeffrey Epstein, and other streamers.

May 19, 2026Image

Google's Circle to Search feature can tell you if an image was AI-generated

Google is bringing its AI-detecting feature to more of its products to make it easier for people to verify if something was generated or edited with AI.

May 19, 2026Video

Google's Gemini Omni can generate 'anything from any input,' starting with video

Google didn't forget AI creators in its latest round of Gemini announcements.

No image

May 19, 2026Image

OpenAI is making it easier to check if an image was made by their models

OpenAI announced two new measures to help detect AI generated imagery: joining the open C2PA standard and adding Google's SynthID to its products.

May 19, 2026Video

Project Genie adds Google Street View integration and goes live for global AI Ultra users

Ground your snow-globe worlds in real-world locations from Google Maps.

May 19, 2026Image

Google's newest app is an AI-powered image editor

Google Pics isn't Photoshop, but it could be better than what's in Google Photos.

May 19, 2026Video

Agora-1 turns the N64 classic GoldenEye into a playable AI simulation for four players

Odyssey has released Agora-1, a world model that lets up to four players act simultaneously in an AI-generated world—tested on the N64 classic GoldenEye. Two separate models handle game state simulation and rendering in real time. The team sees potential in collaborative robotics and AI agent training. The article Agora-1 turns the N64 classic GoldenEye into a playable AI simulation for four players appeared first on The Decoder.

May 18, 2026Multimodal

Exclusive: Early look at the next Gemini desktop upgrade

Google’s Gemini desktop client for Mac is set to gain Voice Mode, Stream to Cursor, Omni video generation, and Spark-powered features.

No image

May 17, 2026Video

Simulate real-world places with Project Genie and Street View

We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.

May 13, 2026Video

Podcast: The Chinese Deepfake Software Powering Scams

We got Haotian AI, the Chinese-language deepfake software powering scams. We also talk about a man finding $1 million of Yu-Gi-Oh cards, and how the AI hard drive shortage is impacting internet archiving.