The News: Elon Musk confirmed that xAI's Grok Imagine image and video generation tool is getting better "almost every day."
Why It Matters: Grok Imagine is already the top-ranked image-to-video AI on the Artificial Analysis leaderboard ā and it's still accelerating. For Tesla owners who use X Premium or SuperGrok, the tool you already have access to is quietly becoming one of the most capable AI generators on the market.
Source: @elonmusk on X
Grok Imagine Is Improving Almost Every Day ā Here's Where It Stands Right Now
Elon Musk doesn't often drop product status updates in a single sentence, but when he does, it's worth paying attention. On March 7, 2026, Musk posted that Grok Imagine gets better almost every day ā a brief but telling signal about the pace of development at xAI.
That's not marketing language. xAI has been shipping updates to Grok Imagine at a pace that few AI labs match, and the numbers back it up.
š Key Figures
| Metric | Value | Context |
|---|---|---|
| Videos generated (30-day window) | 1.245 billion | As of Feb 2, 2026 |
| Leaderboard ranking | #1 | Artificial Analysis Image-to-Video Arena (ELO 1,336) |
| Max video length (v1.0) | 15 seconds | 720p with synchronized audio |
| Max image resolution | 2048 Ć 2048 px | Default output, multiple aspect ratios |
| Training compute | 110,000 NVIDIA GB200 GPUs | Aurora autoregressive engine |
| API video pricing | $0.05 / second | API launched Jan 28, 2026 |
| Consumer access (X Premium) | From $8/month | Premium+ at $40/mo or $395/yr |
From Launch to Leaderboard: A Fast Timeline
Grok Imagine Video launched in August 2025 ā less than seven months ago. By February 2026, xAI shipped version 1.0 with 15-second clip support, 720p resolution, and synchronized audio. The engine powering it ā Aurora ā is an autoregressive model trained on a cluster of 110,000 NVIDIA GB200 GPUs, one of the largest training runs disclosed by any AI lab.
The most recent feature addition, as of March 2, 2026, is "Extend from Frame" ā a capability that lets users take the final frame of an existing video clip and use it as the starting point for a new scene, generating continuation clips in 6ā10 second increments. It's a practical workflow upgrade that moves Grok Imagine closer to a full short-form video production tool.
The result: Grok Imagine Video now sits at the top of the Artificial Analysis Image-to-Video Arena leaderboard with an ELO score of 1,336 ā ahead of every other publicly available image-to-video model at the time of writing.
Who Has Access ā and What You Get
š Access Tiers
| Tier | Price | What You Get |
|---|---|---|
| Free (X user) | $0 | Up to 10 images/day, watermarked, no video |
| X Premium | From $8/mo | More daily generations, higher quality output |
| X Premium+ | $40/mo or $395/yr | Enhanced output quality, higher generation limits |
| Developer API | $0.05/second of video | Full programmatic access, no watermarks |
š The BASENOR Take
The speed of Grok Imagine's improvement is the real story here. Going from launch to leaderboard-leader in under seven months ā while generating over a billion videos in a single 30-day window ā is a signal that xAI is treating this as a flagship product, not a side feature.
For Tesla owners specifically, the connection is straightforward: xAI is the AI arm of Elon Musk's broader technology ecosystem, and the same engineering culture and compute infrastructure that powers Grok Imagine is adjacent to what's being built for Tesla's AI stack. A team that ships this fast on consumer AI is one worth watching closely.
The "Extend from Frame" feature in particular hints at where this is heading ā not just static image generation, but a full narrative video tool. At $0.05 per second via API and free basic access for any X user, the barrier to entry is low enough that this is already in the hands of millions of people. Musk's comment that it improves "almost every day" suggests the gap between Grok Imagine and more established tools will continue to narrow ā and possibly widen in xAI's favor.
š° Deep Dive
What makes Musk's comment notable isn't the enthusiasm ā it's the specificity of "almost every day." That's a development cadence claim, not a marketing tagline. For a product that only launched in August 2025, shipping meaningful improvements at daily or near-daily frequency implies a large, focused engineering team and a feedback loop that's working.
The Aurora engine underpinning Grok Imagine is built on an autoregressive architecture ā the same class of model that powers large language models, applied to video generation. Training it on 110,000 NVIDIA GB200 GPUs puts xAI in a very small group of organizations with the compute to run experiments at that scale. That infrastructure advantage compounds over time: more compute means faster iteration, which means more improvements per week.
The "Extend from Frame" feature released on March 2 is a good example of how xAI is thinking about practical usability rather than just benchmark performance. Generating a 15-second clip is useful; being able to chain clips together from a shared visual starting point is how you build longer-form content. It's a workflow feature, and workflow features are what turn a demo into a daily-use tool.
With 1.245 billion videos generated in a single 30-day window as of early February, Grok Imagine already has the usage data to train on real-world prompts at massive scale. That feedback loop ā billions of real generations informing the next model update ā is likely a significant part of why the improvement curve is so steep. The question isn't whether Grok Imagine will keep improving. It's how long before the gap between it and the rest of the field becomes difficult to close.





![BASENOR Phone Mount for 2025 2026 Tesla Model Y Juniper/Model 3 Highland, Dashboard Phone Holder Does Not Block View [No Adhesive][Dual Arms][360° Adjustable] Tesla Accessories Fit All Smartphone](http://www.basenor.com/cdn/shop/files/basenor-phone-mount-for-2025-2026-tesla-model-y-juniper-model-3-highland.jpg?v=1768393169&width=400)


