Alibaba has just revealed Wan2.1-VACE, a new open-source AI model poised to transform the landscape of video creation and editing. As an extension of Alibaba’s existing Wan2.1 video AI family, VACE is being touted as the “first open-source model in the industry to provide a unified solution for various video generation and editing tasks.” If successful in consolidating multiple tools into a single platform, it could significantly streamline workflows for creators and businesses.
What Can Wan2.1-VACE Do?
VACE boasts a wide array of capabilities, capable of generating videos from diverse inputs such as text descriptions, still images, and even existing video clips. Beyond creation, its robust editing toolkit supports referencing images or specific frames for guidance, advanced video “repainting,” precise adjustments to selected video segments, and video expansion.
Key features highlighted include:
- Generating videos with specific characters based on reference photos.
- Adding natural movement to still images.
- Advanced “video repainting” for tasks like pose transfer, motion control, depth adjustment, and color changes.
- Selective editing of specific areas without affecting the surroundings.
- Expanding the video canvas and intelligently filling in new areas.
- Turning flat photographs into dynamic videos with controlled object movement.
- Swapping or animating referenced characters and objects with precise pose control.
- Intelligently expanding vertical images into widescreen videos by adding relevant content.
The Tech Behind VACE
To handle complex video tasks, VACE incorporates clever technology. A core component is the Video Condition Unit (VCU), designed for unified processing of multimodal inputs like text, images, video, and masks. Additionally, a “Context Adapter structure” helps the AI understand and process temporal and spatial dimensions within the video, enabling more nuanced task execution.
Potential Applications
Alibaba anticipates VACE being valuable across numerous sectors, including:
- Creating quick social media clips.
- Generating eye-catching advertisements and marketing content.
- Handling heavy-duty post-production special effects for film and TV.
- Developing custom educational and training videos.
Open Source: Lowering the Barrier to Entry
Making such a powerful AI model open source is a significant move, especially considering the typical cost and resources required for development. Alibaba states that “Open access helps lower the barrier for more businesses to leverage AI, enabling them to create high-quality visual content tailored to their needs, quickly and cost-effectively.” This initiative aims to democratize access to high-tier AI tools, benefiting smaller businesses and individual creators who may lack extensive resources.
Wan2.1-VACE is available in two versions: a powerful 14-billion parameter model and a lighter 1.3-billion parameter model. Both are freely accessible on platforms like Hugging Face, GitHub, and Alibaba Cloud’s ModelScope community.


Leave a Reply