gen‑ai.news

The pulse of generative image & video AI.

Twice a week, the most important stories in image and video generation - new models, notable research, and meaningful product releases - distilled into a 2-minute read. No hype, no filler.

Free. Unsubscribe any time. No spam, ever.

Archive

Video

Google Genie 3 Turns Street View Pins Into Walkable AI-Generated Worlds

Google DeepMind has connected its Genie 3 world model to Street View data, letting users drop a map pin and receive an interactive, explorable environment generated from real-location imagery. The demo is immediately interesting for games and creative tools, but the longer-term intent appears to be training AI agents and robotics systems on grounded spatial data.

Video

Beeble AI Launches Canvas, a Node-Based AI Compositor for VFX Pipelines

Beeble AI has released Canvas, a node-based visual compositing platform that integrates AI video models, traditional compositing tools, and workflow automation in a single environment. The system includes native access to Beeble's SwitchX and SwitchLight models and supports external generative models for batch iteration across multiple shots.

Video

Google Launches Gemini Omni for Video Generation and Editing

Announced at Google I/O, Gemini Omni Flash is a multimodal model that takes text, images, audio, and video as inputs and generates or edits video through natural conversation. Google says the model draws on its knowledge base of science, history, and cultural context to improve physical plausibility in generated footage. A SynthID watermark is embedded in all output.

Image

OpenAI Adds SynthID Watermarking Alongside C2PA to All Generated Images

OpenAI announced it will embed Google's SynthID watermark in images generated through ChatGPT, Codex, and its API, layering it on top of the C2PA content credentials it already applies. The two systems are designed to complement each other: C2PA carries detailed provenance metadata while SynthID provides a signal that can survive format changes and metadata stripping.

Multimodal

Google Expands SynthID and C2PA Detection to Chrome and Search

Google is bringing AI content verification to Chrome and several Search surfaces, including Lens and AI Mode, letting users check whether an image carries SynthID markers or C2PA content credentials directly in the browser. The expansion, announced at Google I/O, coincides with OpenAI and Nvidia both adopting SynthID, pushing the watermarking standard toward broader industry use.

Image

Google Pics Brings Comment-Style AI Image Editing to Workspace

Google launched Pics, a new AI image generation and editing app for Workspace, with an interface that lets users click on specific parts of an image and annotate what they want changed - similar to leaving a comment in a Google Doc. The app is powered by Gemini and Google's Nano Banana 2 image model.

Video

YouTube Shorts Gets AI Remix Feature Powered by Gemini Omni

Google has added a remix option to YouTube Shorts that lets viewers restyle clips or insert themselves into other people's videos using Gemini Omni. Creators can disable the feature for their content, and all remixed output carries a SynthID watermark.

Video

First AI-Produced Feature Film Screens at Cannes, Made With Higgsfield

Hell Grind, a 90-minute sci-fi heist film produced entirely with Higgsfield AI, is screening at the Cannes Film Festival. Made with a team of 15 filmmakers who generated over 16,000 clips to produce 253 final shots, the production cost under $500,000 - a figure the company contrasts with a claimed $50 million equivalent using traditional methods.

Video

Take It Down Act Takes Effect, Requiring 48-Hour Removal of Sexual Deepfakes

A US federal law requiring social media platforms to remove nonconsensual intimate imagery - including AI-generated deepfakes - within 48 hours of a valid request has come into force. Experts have raised concerns that the takedown mechanism could be misused for censorship, and that practical enforcement for victims remains uncertain.

Video

Google Launches Gemini Omni for Video Generation and Conversational Editing

Announced at Google I/O, Gemini Omni Flash is a new multimodal model that accepts text, image, audio, and video inputs to generate and edit video through natural conversation. The model embeds SynthID watermarks automatically and is positioned as a broader platform that will eventually handle more modalities beyond video.

Image

OpenAI Adds C2PA Metadata and SynthID Watermarks to All Generated Images

OpenAI announced that images produced through ChatGPT, Codex, and its API will now carry both C2PA content credentials and Google's SynthID watermark. The dual approach is intended to maintain traceability even when metadata is stripped, since the two systems work differently and compensate for each other's gaps.

Video

Google Introduces Gemini Omni AI Model That Can “Create Anything” With Video

Remember that lull from the past few months where we didn’t get the surprise announcement of another new “revolutionary” AI video model every single day? Well, that lull is over as the AI wars are heating up once again. However, with this AI revolution now several months and years old, it is admittedly getting harder to keep up with what every model is offering these days. We’re also several years into generative AI video, and it still hasn’t taken over in any meaningful way. Will this new Gemin

Multimodal

It’s make or break time for AI labeling systems

If robust AI labeling was in place when these swagged out images of Pope Francis went viral, it may have been easier for people to tell they were fake. | Image: via Reddit We're about to find out if the systems designed to make deepfakes and AI-generated content easy to spot are actually up to snuff. SynthID and C2PA Content Credentials, two distinct technologies for invisibly tagging image, video, and audio files with information about their origins, are getting their biggest expansion to date,

Image

OpenAI Gets Serious About Detecting Fake Images

OpenAI has announced that images generated with ChatGPT, Codex, and its API will include C2PA metadata and a SynthID watermark -- the two leading protocols in identifying AI images. [Read More]

Video

Netflix Staffing AI Animation Unit Called Inkubator

Netflix is building a small internal team called Inkubator to explore generative AI workflows in animation, starting with short-form content. Job listings include CG artists, compositors, and a Head of Technology, suggesting a production-ready ambition rather than a pure research exercise.

Video

Runway Positions Video Generation as a Path to World Models

In a TechCrunch profile, Runway laid out its thesis that video generation is the foundation for building general world models, framing its position outside the large-lab ecosystem as a strategic advantage. The piece offers a candid look at how the company sees its long-term competitive footing against well-resourced rivals like Google.

Image

Soderbergh Used AI-Generated Images Throughout Lennon Documentary

Director Steven Soderbergh has confirmed that his documentary 'John Lennon: The Last Interview' incorporates AI-generated imagery, offering one of the more prominent examples of a name-brand filmmaker deliberately weaving generative visuals into a non-fiction work. Soderbergh discussed his reasoning publicly, adding a notable data point to the ongoing conversation about AI's role in documentary production.

Image

YouTube Extends AI Deepfake Detection to All Adult Users

YouTube is rolling out its likeness-detection feature-previously limited to creators and public figures-to all users over 18. The tool uses a face scan to flag potential AI-generated lookalikes on the platform, after which users can request removal of matched content.

Video

NVIDIA Releases Fine-Tuning Guide for Cosmos Predict 2.5 Video Model

NVIDIA has published a detailed walkthrough on fine-tuning its Cosmos Predict 2.5 world model using LoRA and DoRA techniques, specifically targeting robot video generation. The guide, hosted on Hugging Face, offers a practical path for teams wanting to adapt the model to domain-specific physical AI applications without full retraining.

Image

Hasselblad Masters Disqualifies Entry Over AI Generation

Hasselblad has removed a shortlisted entry from its Masters 2026 competition after determining the image involved generative AI, following public scrutiny of the work. The case adds to a string of similar disqualifications at major photography competitions and highlights the continued difficulty of detection at the judging stage.

Video

Agora-1 turns the N64 classic GoldenEye into a playable AI simulation for four players

Odyssey has released Agora-1, a world model that lets up to four players act simultaneously in an AI-generated world—tested on the N64 classic GoldenEye. Two separate models handle game state simulation and rendering in real time. The team sees potential in collaborative robotics and AI agent training. The article Agora-1 turns the N64 classic GoldenEye into a playable AI simulation for four players appeared first on The Decoder.

Video

Podcast: The Chinese Deepfake Software Powering Scams

We got Haotian AI, the Chinese-language deepfake software powering scams. We also talk about a man finding $1 million of Yu-Gi-Oh cards, and how the AI hard drive shortage is impacting internet archiving.