gen‑ai.news
← Back
Video

Google Introduces Gemini Omni AI Model That Can “Create Anything” With Video

Remember that lull from the past few months where we didn’t get the surprise announcement of another new “revolutionary” AI video model every single day? Well, that lull is over as the AI wars are heating up once again. However, with this AI revolution now several months and years old, it is admittedly getting harder to keep up with what every model is offering these days. We’re also several years into generative AI video, and it still hasn’t taken over in any meaningful way. Will this new Gemin

Enjoy this story? Get the next one in your inbox.

Twice a week: the most important stories in generative image and video AI, distilled into a 2-minute read.

Free. Unsubscribe any time. No spam, ever.

Your next read

Video

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA has released Cosmos 3, an open omnimodal foundation model that combines a vision-language reasoning component with a diffusion-based video generator in a two-tower architecture. The system is designed to support physical AI applications by linking language-grounded reasoning with the generation of plausible world states and robot actions.

Video

Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot

Nvidia used GTC Taipei to unveil several new tools aimed at physical AI applications, including a new world model, a larger autonomous driving model, and an open reference platform for humanoid robots. The announcements signal a continued push to make simulation and synthetic data central to how robots and vehicles are trained. Here is a closer look at what was shown and why it matters.