xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution

xAI has released grok-imagine-video-1.5-preview, a new image-to-video model built into its Grok Imagine platform. The model takes a still image as input and, guided by a text prompt, generates video footage at resolutions up to 720p. The preview label suggests the release is still in an early testing phase, with a more polished version likely to follow.
One notable feature is the ability to stitch multiple generated clips together into longer scenes. This kind of multi-clip composition is increasingly common in AI video tools, allowing users to build more complex sequences rather than being limited to a single short output. It also gives creators more control over pacing and narrative without needing to export and edit clips in a separate application.
The image-to-video format - where a reference image anchors the visual style and content of the resulting footage - has become a standard approach among AI video generators. It gives users a concrete starting point and tends to produce more predictable results than generating video purely from text. Competitors including Runway, Kling, and Pika have offered similar capabilities for some time, so xAI is entering a fairly established segment of the market.
The 720p output resolution is functional for web use but falls short of the 1080p or higher outputs now available from several competing platforms. Whether xAI plans to raise the resolution ceiling in future updates remains to be seen. For now, the 1.5 preview positions Grok Imagine as a more complete creative tool than its earlier text-to-image-only iteration, though it will need continued development to stand alongside the more mature offerings already on the market.

