Wan 2.1 Open Source: Alibaba’s Game-Changing AI Video Model Takes on Sora, Minimax, Kling, and Google Veo 2

On February 25, 2025, Alibaba Cloud announced that Wan 2.1, their advanced AI video generation model, is now open-source. This model, part of the Tongyi series, transforms text prompts into high-quality videos, excelling in complex movements and spatial relationships. Its open-sourcing marks a significant step towards democratizing AI-driven video creation.

Wan 2.1 enters a competitive landscape dominated by models like: OpenAI’s Sora, Minimax, Kling from Kuaishou, Google’s Veo 2

With a leading VBench score of 84.7%, multilingual support for Chinese and English, and the promise of free access, Wan 2.1 is poised to make a substantial impact. This article delves into Wan 2.1’s performance, features, pricing, user experience, and how it compares to its rivals, ultimately exploring what its open-source status means for the future of AI video generation.

💡

Want to use top AI video generators like Runway ML, Minimax, Luma AI, Wan 2.1 (coming soon!), Pyramid Flow, LTX Video, and more in one place? Anakin AI is your perfect solution. This user-friendly platform brings these cutting-edge models together, letting you create stunning videos — whether for TikTok, ads, or films — with ease and speed.

Sign up for Anakin AI today at anakin.ai and start for free. Join creators worldwide and get ready for Wan 2.1 — stay ahead with Anakin!

Start for free

Anakin.ai - One-Stop AI App Platform

Generate Content, Images, Videos, and Voice; Craft Automated Workflows, Custom AI Apps, and Intelligent Agents. Your exclusive AI app customization workstation.

Anakin.ai

Wan 2.1: What Makes It Stand Out?

Wan 2.1, also known as Wanx 2.1 or Tongyi Wanxi, is part of Alibaba’s Tongyi series of multimodal AI models. It turns text prompts into stunning videos, mastering complex movements — like figure skating or multi-object interactions — and spatial relationships. Per Alibaba and the VBench leaderboard, it leads with an 84.7% score, outshining rivals in dynamic motion, smoothness, and aesthetics, delivering lifelike visuals of twirling dancers or racing cars that match your descriptions.

A standout feature is its multilingual support for Chinese and English, perfect for creators targeting diverse markets. It produces crisp 1080p videos at 30 FPS, rivaling professional-grade visuals. Announced as open-source on February 25, 2025, Wan 2.1 could inspire innovation akin to Stable Diffusion, raising questions about its real-world impact and edge over competitors.

Comparing Wan 2.1 to Sora, Minimax, Kling, and Google Veo 2

To understand Wan 2.1’s place in the AI video generation landscape, let’s stack it up against some of the biggest names: OpenAI’s Sora, Minimax, Kling (from Kuaishou), and Google’s Veo 2. Each of these models has its strengths, but Wan 2.1 brings unique advantages that could make it a top contender.

OpenAI’s Sora

Sora, launched in late 2024, generates 20-second, high-quality videos for social media and marketing, bundled with ChatGPT Plus/Pro ($20/$200 monthly). It offers unlimited holiday access and higher resolutions for Pro users, excelling in short, stunning clips. Limited to English and brevity, it scores 82% on VBench, while Wan 2.1’s 84.7% and multilingual support suggest greater versatility for longer, complex sequences.

Minimax

Minimax, a Chinese multimodal AI, integrates video generation with text-to-image efficiency. It trails Wan 2.1 in dynamic scenes per VBench, and its global access is restricted, often needing enterprise subscriptions. Wan 2.1’s open-source edge broadens reach, though Minimax may excel in niche real-time editing due to speed.

Kling (Kuaishou)

Kling, from Kuaishou, shines in short-form content like Douyin with high-res outputs and smooth motion. Limited to China and not open-source, it’s less global than Wan 2.1, which offers multilingual support and broader potential, though Kling benefits from Kuaishou’s regional ecosystem.

Google Veo 2

Veo 2, released post-Sora in December 2024, targets U.S. early access users via Google Cloud’s Vertex AI, with usage-based pricing (e.g., $0.00003/1k characters). Strong in realism and aesthetics, its 83% VBench score lags Wan 2.1’s 84.7%. Wan 2.1’s open-source flexibility appeals to developers, while Veo 2 suits enterprise needs with robust infrastructure.

How Good Will Wan 2.1 Be?

Based on its VBench leadership and technical specs, Wan 2.1 is poised to be one of the best open-source AI video models available. Its 84.7% VBench score indicates it handles complex motions, multi-object interactions, and spatial relationships better than most competitors. For creators, this means videos with fewer artifacts, smoother transitions, and more accurate adherence to prompts — whether you’re animating a sci-fi battle or a serene nature scene.

However, no model is perfect. Wan 2.1 might struggle with extremely long sequences or highly abstract prompts, areas where proprietary models like Sora or Veo 2 could have an edge due to their closed-system optimizations. Its open-source nature also means its performance could vary depending on how developers fine-tune it, potentially leading to inconsistent results if not managed well. Still, for most users — especially those in creative industries or small businesses — Wan 2.1’s combination of quality, accessibility, and cost-effectiveness looks incredibly promising.

Pricing, Performance, Features, and User Experience

Pricing

How much will Wan 2.1 cost? As an open-source model, its core is likely free to use, download, and modify, like Stable Diffusion or LLaMA. Alibaba Cloud may charge for premium features, cloud hosting, or API access — possibly $100–$500/month for enterprise use, similar to Sora or Veo 2. It’s currently free on Alibaba’s Model Studio, and community hosting could further cut costs for hobbyists.

Performance

Wan 2.1 excels with an 84.7% VBench score, handling dynamic scenes, spatial consistency, and aesthetics well. It delivers 1080p at 30 FPS with realistic motion via its space-time attention mechanism, leading open-source video generation, though proprietary models like Sora might edge it in specific areas.

Features

Wan 2.1 supports multilingual text (Chinese and English), customizable resolution, and frame rates, integrating with Alibaba’s ecosystem. It’s user-friendly, needing no technical skills, and open-source potential could add features, though it may miss proprietary perks like real-time editing.

User Experience

Its interface is simple — input text, tweak settings, and download videos — great for non-technical users. Open-source setup might need some know-how, unlike Sora’s ChatGPT ease or Veo 2’s Google Cloud polish, but community growth could enhance it over time.

The Open-Source Announcement: A Turning Point

The February 25, 2025, X announcement by Alibaba_Wan (“This is not a drill — Wan 2.1 OPEN SOURCE is finally here!”) has sparked global excitement. The 11:00 PM UTC+8 live broadcast likely covered open-source terms, availability, and restrictions. This could make AI video generation accessible to hobbyists, startups, and researchers, similar to Stable Diffusion’s impact, but open-source risks include inconsistent quality or security if not managed well.

For users, it offers a powerful, potentially free tool, but raises concerns about long-term support, updates, and enterprise reliability. Alibaba’s AI and cloud reputation suggests solid backing, though the community’s role will be crucial for Wan 2.1’s success.

Conclusion: Wan 2.1’s Place in the AI Video Revolution

Wan 2.1 isn’t just another AI model — it’s a potential game-changer, especially with its open-source status. Its VBench-leading performance, multilingual support, and accessibility make it a strong rival to Sora, Minimax, Kling, and Veo 2, particularly for creators and developers seeking flexibility and cost savings. While it might not match proprietary models in every niche (e.g., real-time editing or enterprise integrations), its open-source nature could foster a vibrant community, driving innovation and adoption.

For everyday users, Wan 2.1 offers a chance to create professional-quality videos with minimal effort and cost, whether you’re a marketer crafting ads, an educator making lessons, or a filmmaker experimenting with new ideas. Its pricing (potentially free or low-cost for premium features), top-tier performance, and user-friendly experience position it as a must-watch in the AI video space. As the live broadcast unfolds and more details emerge, Wan 2.1 could redefine how we think about AI-driven creativity — open, accessible, and endlessly adaptable.

from Anakin Blog http://anakin.ai/blog/wan-2-1-open-source-alibabas-game-changing-ai-video-model-takes-on-sora-minimax-kling-and-google-veo-2/
via IFTTT

Anakin

Tuesday, February 25, 2025

Wan 2.1 Open Source: Alibaba’s Game-Changing AI Video Model Takes on Sora, Minimax, Kling, and Google Veo 2