The pulse of generative image & video AI.

Twice a week, the most important stories in image and video generation - new models, notable research, and meaningful product releases - distilled into a 2-minute read. No hype, no filler.

Free. Unsubscribe any time. No spam, ever.

Archive

June 4, 2026Multimodal

Industry leaders share new perspectives on generative media for startups

Google for Startups has published a new report examining how early-stage companies are approaching generative media tools and workflows. The findings draw on perspectives from founders and industry figures navigating this space. The report aims to offer practical context for startups integrating AI-generated image and video into their products.

Groovely turns your photos into AI dance videos

Upload a photo, pick a trending dance template, and Groovely animates it into a vertical clip synced to the beat. No camera, choreography, or editing required.

June 4, 2026Multimodal

Let us filter AI slop, you cowards

Content labels on AI-generated images and videos have become more common across major platforms, but critics argue that labeling alone is not enough. The Verge makes the case that YouTube, Instagram, TikTok, and others should go a step further and give users the ability to actively filter AI-generated content from their feeds. Without that option, labels function more as a disclosure footnote than a meaningful tool for audience control.

June 4, 2026Video

xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution

xAI has updated its Grok Imagine system to version 1.5, adding an image-to-video model that converts still images into short video clips at up to 720p resolution. The new model accepts text prompts to guide motion and style, and multiple generated clips can be joined into longer sequences.

No image

June 3, 2026Image

Google’s Dreambeans, its weirdest-named AI tool to date, will turn your life into a cartoon

Google has introduced Dreambeans, a tool that pulls personal data from your Google account to generate AI-illustrated stories in a cartoon style. The feature represents a notable step toward using ambient personal data - photos, calendar events, and similar account content - as direct source material for generative image output. It is, by most measures, one of the more unusually named products Google has shipped.

June 3, 2026Image

A British MP is suing to see if xAI is legally responsible for the images Grok produces

A British MP has filed a lawsuit against xAI to establish whether the company bears legal responsibility for images generated by its Grok AI system. The case is part of a broader wave of scrutiny that includes investigations in the EU, the UK, and California. At issue is how far platform liability extends when an AI image generator produces harmful or problematic content.

June 3, 2026Image

Ideogram 4.0 drops as an open-weight model with native 2K resolution and improved text rendering

Ideogram has released version 4.0 of its text-to-image model as an open-weight release, featuring native 2K resolution output, bounding box layout control, and refined text rendering. On the DesignArena leaderboard, it leads all open models, sitting just below closed systems from OpenAI and Google. Commercial use of the weights requires a paid license.

June 3, 2026Multimodal

DaVinci Resolve 21 Officially Released With New Photo Editing, AI Tools, and Much More

Blackmagic Design has shipped the final release of DaVinci Resolve 21, marking one of the most substantial updates the software has seen. The version adds a dedicated Photo page for still-image editing alongside a set of AI-powered tools spread across the editing, color, audio, and visual effects areas of the application.

June 3, 2026Image

Amazon’s search bar will invent AI-generated products you can’t buy

Amazon has added AI-generated product imagery to its search bar, showing visual suggestions as shoppers type descriptions of clothing and home goods. The idea is to help users find items when they lack the precise terminology for a style or texture they have in mind.

No image

June 3, 2026Image

Amazon will show AI product images when you search for some reason

Amazon is introducing AI-generated product images into its search results, displaying visuals that correspond to what shoppers type in rather than pulling exclusively from existing seller photography. The company says the feature is intended to help users navigate toward relevant products more easily.

June 3, 2026Image

Director Martin Scorsese Joins AI Image Startup Black Forest Labs

Martin Scorsese has taken on an adviser role at Black Forest Labs, the AI image generation startup behind the Flux family of models. The move has drawn surprise and some criticism from filmmakers and artists who had expected the director to maintain a more cautious stance toward generative AI tools.

June 3, 2026Image

Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning

At Build 2026, Microsoft unveiled seven proprietary AI models, including its first in-house reasoning model, signaling a broader push toward self-reliance in foundation model development. The announcements also covered a new tuning approach and an autonomous agent capable of operating in the background. On image generation, Microsoft's internal models reportedly outperform Google's current offerings.

June 3, 2026Video

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA has released Cosmos 3, an open omnimodal foundation model that combines a vision-language reasoning component with a diffusion-based video generator in a two-tower architecture. The system is designed to support physical AI applications by linking language-grounded reasoning with the generation of plausible world states and robot actions.

No image

June 1, 2026Multimodal

MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multimodality, and Agentic Coding

MiniMax has released M3, a multimodal model built around a new sparse attention architecture that supports context windows up to one million tokens. The model handles text, images, and video natively, and includes capabilities for agentic tasks such as computer use and coding. The release positions M3 as a general-purpose foundation model aimed at both long-context reasoning and real-world tool use.

June 1, 2026Image

Years in the Making, Glass Imaging Is Delivering on its Promise to Transform Smartphone Photography

Glass Imaging's GlassAI neural image signal processing technology has found a prominent home in the new Honor 600 smartphone, marking a significant step for the startup after years of development. The integration focuses on improving zoom photography, an area where smartphone cameras have historically struggled with noise and detail loss.

June 1, 2026Video

Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot

Nvidia used GTC Taipei to unveil several new tools aimed at physical AI applications, including a new world model, a larger autonomous driving model, and an open reference platform for humanoid robots. The announcements signal a continued push to make simulation and synthetic data central to how robots and vehicles are trained. Here is a closer look at what was shown and why it matters.

June 1, 2026Image

Model Sues Fashion Brand After it AI-Generated Pictures of Her

A model has filed a lawsuit against a fashion brand after the company used AI to generate images of her without permission. The case highlights growing legal and ethical tensions as brands increasingly turn to AI-generated visuals to cut production costs, often at the expense of the people whose likenesses or work previously fed those systems.

May 31, 2026Image

The Myth of Intent in Photography

As AI-generated images grow indistinguishable from photographs, photography faces a deeper question about what authenticity actually means. Judges and seasoned experts are increasingly unable to tell the two apart, raising doubts about whether intent behind an image can serve as a meaningful dividing line. The crisis touches competitions, editorial standards, and how audiences relate to images.

May 30, 2026Image

Microsoft readies new MAI voice and image models for Build 2026

Microsoft is preparing a set of new in-house AI models under the MAI brand for its Build 2026 developer conference. The lineup includes an image model, a transcription model, and a multilingual voice model. The announcements would mark a continued push by Microsoft to develop proprietary AI capabilities alongside its existing partnerships.

No image

May 30, 2026Video

A Darren Aronofsky-Produced Short Used Google Veo to Bring Dustin Yellin’s Sculptures to Life

A short film produced by Darren Aronofsky and starring Chris Rock and Paul Rudd used Google's Veo video generation model to animate the layered glass sculptures of artist Dustin Yellin. The project, titled 'Goodnight Lamby,' premiered at Cannes in the Classics section. It represents one of the more high-profile uses of generative video in a festival-circuit film context.

May 30, 2026Video

The AI Film ‘Dreams of Violets’ Is How You Get Me to Hate Movies

A critic at PetaPixel responds to news that the fully AI-generated film 'Dreams of Violets' has been accepted into the Tribeca Film Festival. The piece argues that the film represents a troubling direction for cinema rather than a promising one. It raises questions about what it means when AI-generated work enters spaces traditionally reserved for human creative labor.

May 30, 2026Image

AI grifters are creating fake Black people to sell Shein junk

Sellers on TikTok and other platforms are using AI-generated personas depicting Black women in emotional distress to market mass-produced dropshipped goods. The fake influencers are designed to exploit sympathy and identity, presenting factory items as handmade products from real small-business owners. The trend raises questions about platform enforcement and the harm done to actual Black creators.

May 29, 2026Video

A Feature-Length AI-Generated ‘Live Action’ Movie Is Premiering at Tribeca for Some Reason

The Tribeca Film Festival will premiere what is being described as the first feature-length film produced entirely through AI-generated imagery, presented in a live-action format. The selection marks a notable moment for a festival long associated with independent and boundary-pushing cinema. Whether the work signals a new creative direction or raises questions about the festival's curatorial standards remains a point of debate.

May 29, 2026Video

Google fixes several bugs in Gemini usage limits that burned through quotas too fast

Google has patched a billing bug in its Gemini app that caused a small number of Omni video generations to consume an entire usage quota. The fix also doubles the video generation allowance for Ultra subscribers and stops charging users for failed requests.

May 29, 2026Video

Tech companies desperately want to film you doing chores

A startup called Shift is offering free home cleaning to New Yorkers - with a condition attached. The company films its cleaners working through ordinary household tasks, and that footage becomes training data for robotic systems. It is one of several efforts underway to capture the kind of mundane domestic labor that has proved difficult to teach machines.

No image

May 29, 2026Multimodal

Jorge R. Gutierrez Won’t Make AI-Generated ‘Punky Duck’ Series at Amazon MGM After Backlash

Director Jorge R. Gutierrez has walked back plans to produce an AI-generated animated series called 'Punky Duck' for Amazon MGM following significant backlash from the animation community. Gutierrez said his original intent was to highlight artists using the technology, but the response made clear the project would not move forward in its planned form.

May 29, 2026Image

Nano Banana 2 and Nano Banana Pro are now in General Availability

Google Cloud has moved Nano Banana 2 and Nano Banana Pro into general availability, bringing scalable image generation with 1K and 2K output resolution to production workloads. The models also include a preview feature for video input, expanding their range of supported media types.

May 29, 2026Image

Adobe’s conversational AI agent is a mediocre design intern

Adobe's Firefly AI Assistant takes a different approach than most generative image tools - acting as a conversational agent that operates Adobe's design apps on your behalf rather than generating images from scratch. A hands-on beta test finds the concept genuinely interesting but the actual output underwhelming, somewhere between useful shortcut and frustrating limitation.

May 28, 2026Video

A $2,000 AI-generated film will make its debut at Tribeca

A 75-minute AI-generated film about the Iranian government's killing of protesters will premiere at the Tribeca Festival next month. Called Dreams of Violets, it was made for roughly $2,000 by two Iranian-born brothers using AI to recreate people and events drawn from journalistic records and eyewitness accounts.

No image

May 28, 2026Video

Amazon MGM’s Dream of a ‘New Golden Age’ of Animation Hinges on Three Iffy-Looking AI-Created Kids Shows

Amazon MGM Studios is betting on generative AI to power a new wave of animated children's programming, with three shows greenlit following presentations at the AI on the Lot conference. COO Albert Cheng and the creators behind each project made their case for using AI tools in production, though the results have drawn skeptical early reactions. The move signals a significant shift in how a major studio is approaching animation pipelines and content development.

May 28, 2026Video

Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video

Amazon MGM Studios and AWS are launching a "GenAI Creators' Fund" that gives filmmakers money and access to the in-house AI platform "Project Nara." Three animated series are already in production - the teams had five weeks for their pilots. Amazon says it now has the "only end-to-end AI content ecosystem in the industry." The article Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video appeared first on The Decoder.

May 27, 2026Image

Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules

DiffusionBlocks converts residual networks into independently trainable blocks by interpreting layer updates as reverse diffusion denoising steps. The post Sakana AI Proposes DiffusionBlocks: a Block-wise Training Framework That Converts Residual Networks into Independently Trainable Denoising Modules appeared first on MarkTechPost.

May 27, 2026Video

Google’s New Gemini Omni AI Video Model Can Do Crazy Things

Google's new Gemini Omni artificial intelligence (AI) model can do some wild things. The model's key promise is to create anything from, well, anything. [Read More]

May 27, 2026Image

Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks

Microsoft's MAI-Image-2.5 ranks third on Arena's text-to-image leaderboard, on par with Google's Nano Banana 2 but still behind OpenAI's Image-2. The model shows clear gains over its predecessor, especially in rendering text inside images and commercial visuals. The article Microsoft's MAI-Image-2.5 pulls even with Google's Nano Banana 2 on benchmarks appeared first on The Decoder.

May 27, 2026Video

YouTube will automatically detect and label AI-generated videos

It should be easier to tell at a glance if a YouTube video will contain AI-generated gunk.

May 27, 2026Video

YouTube will try to automatically flag AI videos starting this month

YouTube is tightening its AI labeling rules. Labels for photorealistic or heavily AI-altered content will now show up in more visible spots, below the player for long videos and as an overlay on Shorts. Starting May 2026, an automatic detection system will flag AI-generated content even if creators don't disclose it. Recommendations and monetization won't be affected. The article YouTube will try to automatically flag AI videos starting this month appeared first on The Decoder.

No image

May 27, 2026Video

YouTube will now automatically label AI videos

YouTube will now automatically label videos that use significant photorealistic AI, instead of relying solely on creators to disclose AI-generated content themselves. It's also making AI labels more prominent.

May 27, 2026Video

A Streaming Service Made Up Entirely of AI-Generated Shows is About to Launch

Digital asset platform Artlist is launching Artlist TV, a streaming platform that appears to be exclusively populated by AI-generated shows. [Read More]

May 26, 2026Image

Imagen Is Offering Full AI Editing Access for $10, Just In Time for Peak Season

Post-processing has long been the most time-consuming part of a photographer's workflow, and the numbers back that up. According to the 2026 Zenfolio State of the Photography Industry report, about 70% of photographers spend between 26% and 75% of their working time on editing. Only 5% of photographers surveyed feel they are managing the stress of running their business well. [Read More]

May 26, 2026Image

Pope Leo Warns AI Images Are a ‘Powerful Amplifier’ of Disinformation

Pope Leo has released an encyclical about artificial intelligence, urging authorities to regulate the technology and warning that AI image tools have become a "powerful amplifier" for those spreading disinformation. [Read More]

May 26, 2026Image

NYC Gallery Says it Has ‘Every Right’ to Create AI Version of Iconic Ansel Adams Photo

The owner of the Danziger Gallery has released a statement defending his actions after putting an AI-generated version of Ansel Adams' Moonrise on sale at The Photography Show in New York. [Read More]

May 25, 2026Image

NYC Gallery Sold an AI-Generated Ansel Adams Photo Without Permission

The New York Danziger Gallery displayed for sale an AI-generated version of Ansel Adams' photo "Moonrise Over Hernandez" without consulting the photographer's trust, effectively stealing the legendary artist's work and dramatically altering it with AI for the sake of profit. [Read More]

May 23, 2026Video

Google's Gemini Omni Brings Any-Input Video Generation to Shorts, Workspace, and Beyond

Google launched Gemini Omni Flash at I/O, a multimodal model that accepts text, images, audio, and video as inputs and generates or edits video through conversational prompts. The model is already powering a YouTube Shorts remix feature and will feed into Google Workspace tools. SynthID watermarking is embedded in all output.

May 23, 2026Multimodal

SynthID Watermarking Gains OpenAI, Nvidia, and Industry Backing

Google's SynthID invisible watermarking standard is being adopted by OpenAI, Nvidia, and others, marking its most significant expansion since launch. OpenAI has confirmed that images generated through ChatGPT, Codex, and its API will carry both C2PA metadata and SynthID markers. The parallel adoption of two complementary standards could finally give AI content labeling real coverage.

May 23, 2026Multimodal

ByteDance Releases Lance, a 3B-Parameter Unified Model for Image and Video Generation and Editing

ByteDance's Intelligent Creation Lab has open-sourced Lance, a multimodal model that handles image and video understanding, generation, and editing within a single 3B-parameter framework. The unified architecture is designed to avoid the fragmentation of maintaining separate models for each task. Code and weights are publicly available.

May 23, 2026Video

Luma and Wonder Project Form Production Company to Move AI Video Into Narrative Film

Luma AI and Wonder Project have jointly launched Innovative Dreams, a production company aimed at using AI video tools for longer-form narrative content rather than short clips. The partnership reflects a broader shift among AI video companies toward working with professional production pipelines rather than pitching consumer tools directly to audiences.

May 23, 2026Multimodal

Adobe Brings Creative Cloud to Gemini Agentic Workflows

Adobe is extending its Creative Cloud connector to Google Gemini, following a similar deal with Anthropic's Claude announced weeks earlier. The integration lets Gemini orchestrate Photoshop, Illustrator, Premiere, and Express directly, with users describing what they want and the Adobe tools handling execution. Rollout is expected within a few weeks of Google I/O.

May 23, 2026Video

Google Genie 3 Turns Street View Pins Into Walkable AI-Generated Worlds

Google DeepMind has connected its Genie 3 world model to Street View data, letting users drop a map pin and receive an interactive, explorable environment generated from real-location imagery. The demo is immediately interesting for games and creative tools, but the longer-term intent appears to be training AI agents and robotics systems on grounded spatial data.