gen‑ai.news

The pulse of generative image & video AI.

Twice a week, the most important stories in image and video generation - new models, notable research, and meaningful product releases - distilled into a 2-minute read. No hype, no filler.

Free. Unsubscribe any time. No spam, ever.

Archive

Multimodal

Industry leaders share new perspectives on generative media for startups

Google for Startups has published a new report examining how early-stage companies are approaching generative media tools and workflows. The findings draw on perspectives from founders and industry figures navigating this space. The report aims to offer practical context for startups integrating AI-generated image and video into their products.

Multimodal

Let us filter AI slop, you cowards

Content labels on AI-generated images and videos have become more common across major platforms, but critics argue that labeling alone is not enough. The Verge makes the case that YouTube, Instagram, TikTok, and others should go a step further and give users the ability to actively filter AI-generated content from their feeds. Without that option, labels function more as a disclosure footnote than a meaningful tool for audience control.

No image
Image

Google’s Dreambeans, its weirdest-named AI tool to date, will turn your life into a cartoon

Google has introduced Dreambeans, a tool that pulls personal data from your Google account to generate AI-illustrated stories in a cartoon style. The feature represents a notable step toward using ambient personal data - photos, calendar events, and similar account content - as direct source material for generative image output. It is, by most measures, one of the more unusually named products Google has shipped.

Image

A British MP is suing to see if xAI is legally responsible for the images Grok produces

A British MP has filed a lawsuit against xAI to establish whether the company bears legal responsibility for images generated by its Grok AI system. The case is part of a broader wave of scrutiny that includes investigations in the EU, the UK, and California. At issue is how far platform liability extends when an AI image generator produces harmful or problematic content.

Multimodal

DaVinci Resolve 21 Officially Released With New Photo Editing, AI Tools, and Much More

Blackmagic Design has shipped the final release of DaVinci Resolve 21, marking one of the most substantial updates the software has seen. The version adds a dedicated Photo page for still-image editing alongside a set of AI-powered tools spread across the editing, color, audio, and visual effects areas of the application.

Image

Amazon’s search bar will invent AI-generated products you can’t buy

Amazon has added AI-generated product imagery to its search bar, showing visual suggestions as shoppers type descriptions of clothing and home goods. The idea is to help users find items when they lack the precise terminology for a style or texture they have in mind.

No image
Image

Amazon will show AI product images when you search for some reason

Amazon is introducing AI-generated product images into its search results, displaying visuals that correspond to what shoppers type in rather than pulling exclusively from existing seller photography. The company says the feature is intended to help users navigate toward relevant products more easily.

Image

Director Martin Scorsese Joins AI Image Startup Black Forest Labs

Martin Scorsese has taken on an adviser role at Black Forest Labs, the AI image generation startup behind the Flux family of models. The move has drawn surprise and some criticism from filmmakers and artists who had expected the director to maintain a more cautious stance toward generative AI tools.

Image

Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning

At Build 2026, Microsoft unveiled seven proprietary AI models, including its first in-house reasoning model, signaling a broader push toward self-reliance in foundation model development. The announcements also covered a new tuning approach and an autonomous agent capable of operating in the background. On image generation, Microsoft's internal models reportedly outperform Google's current offerings.

Video

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA has released Cosmos 3, an open omnimodal foundation model that combines a vision-language reasoning component with a diffusion-based video generator in a two-tower architecture. The system is designed to support physical AI applications by linking language-grounded reasoning with the generation of plausible world states and robot actions.

No image
Multimodal

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

MiniMax has released M3, a multimodal model built around a new sparse attention architecture that supports context windows up to one million tokens. The model handles text, images, and video natively, and includes capabilities for agentic tasks such as computer use and coding. The release positions M3 as a general-purpose foundation model aimed at both long-context reasoning and real-world tool use.

Video

Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot

Nvidia used GTC Taipei to unveil several new tools aimed at physical AI applications, including a new world model, a larger autonomous driving model, and an open reference platform for humanoid robots. The announcements signal a continued push to make simulation and synthetic data central to how robots and vehicles are trained. Here is a closer look at what was shown and why it matters.

Image

Model Sues Fashion Brand After it AI-Generated Pictures of Her

A model has filed a lawsuit against a fashion brand after the company used AI to generate images of her without permission. The case highlights growing legal and ethical tensions as brands increasingly turn to AI-generated visuals to cut production costs, often at the expense of the people whose likenesses or work previously fed those systems.

Image

The Myth of Intent in Photography

As AI-generated images grow indistinguishable from photographs, photography faces a deeper question about what authenticity actually means. Judges and seasoned experts are increasingly unable to tell the two apart, raising doubts about whether intent behind an image can serve as a meaningful dividing line. The crisis touches competitions, editorial standards, and how audiences relate to images.

Image

Microsoft readies new MAI voice and image models for Build 2026

Microsoft is preparing a set of new in-house AI models under the MAI brand for its Build 2026 developer conference. The lineup includes an image model, a transcription model, and a multilingual voice model. The announcements would mark a continued push by Microsoft to develop proprietary AI capabilities alongside its existing partnerships.

Video

The AI Film ‘Dreams of Violets’ Is How You Get Me to Hate Movies

A critic at PetaPixel responds to news that the fully AI-generated film 'Dreams of Violets' has been accepted into the Tribeca Film Festival. The piece argues that the film represents a troubling direction for cinema rather than a promising one. It raises questions about what it means when AI-generated work enters spaces traditionally reserved for human creative labor.

Image

AI grifters are creating fake Black people to sell Shein junk

Sellers on TikTok and other platforms are using AI-generated personas depicting Black women in emotional distress to market mass-produced dropshipped goods. The fake influencers are designed to exploit sympathy and identity, presenting factory items as handmade products from real small-business owners. The trend raises questions about platform enforcement and the harm done to actual Black creators.

Video

A Feature-Length AI-Generated ‘Live Action’ Movie Is Premiering at Tribeca for Some Reason

The Tribeca Film Festival will premiere what is being described as the first feature-length film produced entirely through AI-generated imagery, presented in a live-action format. The selection marks a notable moment for a festival long associated with independent and boundary-pushing cinema. Whether the work signals a new creative direction or raises questions about the festival's curatorial standards remains a point of debate.

Video

Tech companies desperately want to film you doing chores

A startup called Shift is offering free home cleaning to New Yorkers - with a condition attached. The company films its cleaners working through ordinary household tasks, and that footage becomes training data for robotic systems. It is one of several efforts underway to capture the kind of mundane domestic labor that has proved difficult to teach machines.

Image

Nano Banana 2 and Nano Banana Pro are now in General Availability

Google Cloud has moved Nano Banana 2 and Nano Banana Pro into general availability, bringing scalable image generation with 1K and 2K output resolution to production workloads. The models also include a preview feature for video input, expanding their range of supported media types.

Image

Adobe’s conversational AI agent is a mediocre design intern

Adobe's Firefly AI Assistant takes a different approach than most generative image tools - acting as a conversational agent that operates Adobe's design apps on your behalf rather than generating images from scratch. A hands-on beta test finds the concept genuinely interesting but the actual output underwhelming, somewhere between useful shortcut and frustrating limitation.

Video

A $2,000 AI-generated film will make its debut at Tribeca

A 75-minute AI-generated film about the Iranian government's killing of protesters will premiere at the Tribeca Festival next month. Called Dreams of Violets, it was made for roughly $2,000 by two Iranian-born brothers using AI to recreate people and events drawn from journalistic records and eyewitness accounts.

No image
Video

Amazon MGM’s Dream of a ‘New Golden Age’ of Animation Hinges on Three Iffy-Looking AI-Created Kids Shows

Amazon MGM Studios is betting on generative AI to power a new wave of animated children's programming, with three shows greenlit following presentations at the AI on the Lot conference. COO Albert Cheng and the creators behind each project made their case for using AI tools in production, though the results have drawn skeptical early reactions. The move signals a significant shift in how a major studio is approaching animation pipelines and content development.

Video

Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video

Amazon MGM Studios and AWS are launching a "GenAI Creators' Fund" that gives filmmakers money and access to the in-house AI platform "Project Nara." Three animated series are already in production - the teams had five weeks for their pilots. Amazon says it now has the "only end-to-end AI content ecosystem in the industry." The article Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video appeared first on The Decoder.

Image

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals. The article Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks appeared first on The Decoder.

Video

YouTube will try to automatically flag AI videos starting this month

YouTube is tightening its AI labeling rules. Labels for photorealistic or heavily AI-altered content will now show up in more visible spots, below the player for long videos and as an overlay on Shorts. Starting May 2026, an automatic detection system will flag AI-generated content even if creators don't disclose it. Recommendations and monetization won't be affected. The article YouTube will try to automatically flag AI videos starting this month appeared first on The Decoder.

No image
Video

YouTube will now automatically label AI videos

YouTube will now automatically label videos that use significant photorealistic AI, instead of relying solely on creators to disclose AI-generated content themselves. It's also making AI labels more prominent.

Image

Imagen Is Offering Full AI Editing Access for $10, Just In Time for Peak Season

Post-processing has long been the most time-consuming part of a photographer's workflow, and the numbers back that up. According to the 2026 Zenfolio State of the Photography Industry report, about 70% of photographers spend between 26% and 75% of their working time on editing. Only 5% of photographers surveyed feel they are managing the stress of running their business well. [Read More]

Image

NYC Gallery Sold an AI-Generated Ansel Adams Photo Without Permission

The New York Danziger Gallery displayed for sale an AI-generated version of Ansel Adams' photo "Moonrise Over Hernandez" without consulting the photographer's trust, effectively stealing the legendary artist's work and dramatically altering it with AI for the sake of profit. [Read More]

Multimodal

SynthID Watermarking Gains OpenAI, Nvidia, and Industry Backing

Google's SynthID invisible watermarking standard is being adopted by OpenAI, Nvidia, and others, marking its most significant expansion since launch. OpenAI has confirmed that images generated through ChatGPT, Codex, and its API will carry both C2PA metadata and SynthID markers. The parallel adoption of two complementary standards could finally give AI content labeling real coverage.

Video

Luma and Wonder Project Form Production Company to Move AI Video Into Narrative Film

Luma AI and Wonder Project have jointly launched Innovative Dreams, a production company aimed at using AI video tools for longer-form narrative content rather than short clips. The partnership reflects a broader shift among AI video companies toward working with professional production pipelines rather than pitching consumer tools directly to audiences.

Multimodal

Adobe Brings Creative Cloud to Gemini Agentic Workflows

Adobe is extending its Creative Cloud connector to Google Gemini, following a similar deal with Anthropic's Claude announced weeks earlier. The integration lets Gemini orchestrate Photoshop, Illustrator, Premiere, and Express directly, with users describing what they want and the Adobe tools handling execution. Rollout is expected within a few weeks of Google I/O.

Video

Google Genie 3 Turns Street View Pins Into Walkable AI-Generated Worlds

Google DeepMind has connected its Genie 3 world model to Street View data, letting users drop a map pin and receive an interactive, explorable environment generated from real-location imagery. The demo is immediately interesting for games and creative tools, but the longer-term intent appears to be training AI agents and robotics systems on grounded spatial data.