Friday, 28 November 2025
27.9 C
Singapore
14.5 C
Thailand
20.6 C
Indonesia
28.1 C
Philippines

Alibaba introduces open-source model for digital human video generation

Alibaba launches open-source Wan2.2-S2V model, enabling lifelike digital human video generation from portraits and audio.

Alibaba has unveiled Wan2.2-S2V, an open-source speech-to-video model designed to generate digital human videos. The technology enables users to convert portrait photos into film-quality avatars capable of speaking, singing, and performing, broadening the possibilities for professional content creation.

Expanding video creation capabilities

Part of the Wan2.2 video generation series, Wan2.2-S2V allows creators to animate videos using a single image and an audio clip. It supports multiple framing options including portrait, bust, and full-body perspectives, and can dynamically generate character actions and environmental details based on prompts.

The model is powered by advanced audio-driven animation technology that delivers natural and expressive performances, from dialogue to musical pieces. It also supports scenes featuring multiple characters and a wide range of avatars, including cartoon, animal, and stylised designs.

To meet varied production needs, the tool provides flexible output resolutions of 480P and 720P. This makes it suitable for both professional presentations and social media content while ensuring quality visuals for different creative contexts.

Combining innovation and efficiency

Wan2.2-S2V improves upon traditional talking-head animation by merging text-guided global motion control with audio-driven fine-grained local movements. This combination allows for expressive and lifelike performances across complex scenarios.

A notable advancement lies in its frame processing approach. By compressing historical frames of any length into a single latent representation, the model reduces computational demands and ensures stability in long-video generation, addressing a common challenge for extended animated productions.

Alibaba’s research team also built a large-scale audio-visual dataset tailored to film and television scenarios to train the model. Using multi-resolution training, it supports video creation in diverse formats, from short-form vertical content to conventional horizontal film and television outputs.

Commitment to open-source community

The Wan2.2-S2V model is available for download on Hugging Face, GitHub, and Alibaba Cloud’s ModelScope. Alibaba has been steadily contributing to the open-source ecosystem, previously releasing Wan2.1 models in February 2025 and Wan2.2 models in July. Together, the Wan series has recorded over 6.9 million downloads across Hugging Face and ModelScope.

Alibaba said the release reflects its ongoing efforts to support professional creators with advanced AI tools while contributing to the wider developer community.

Hot this week

Singapore consumers show growing interest in AI shopping companions

Research shows rising consumer interest in AI shopping agents in Singapore, with strong demand for cost savings and secure automation.

Warner Music ends lawsuit against Suno after reaching new licensing agreement

Warner Music ends its lawsuit against Suno after securing a licensing deal that gives artists opt-in control over AI-generated music.

Google warns staff of rapid scaling demands to keep pace with AI growth

Google tells staff it must double AI capacity every six months as leaders warn of rapid growth, rising demand, and tough years ahead.

Apple to prioritise performance and AI upgrades in iOS 27

Apple is expected to focus on performance improvements and stronger AI features in iOS 27, shifting from major redesigns to software refinement.

Global mobile gaming ads surge in 2025 as AI and interactivity reshape engagement

Mobile gaming ads grew strongly in 2025 as AI-driven optimisation and interactive formats reshaped global user acquisition strategies.

ShadowV2 botnet spotted during AWS outage, researchers warn of possible return

ShadowV2 botnet briefly emerged during the AWS outage, targeting IoT devices, raising concerns about future cyberattacks.

Battlefield 6 launches week-long free-to-play trial for new players

Battlefield 6 launches a week-long free trial with multiple playlists, map access, and progress carryover ahead of its Winter Offensive update.

Sony announces December PS Plus Monthly Games lineup featuring five titles

Sony unveils a five-game PS Plus lineup for December, including Lego Horizon Adventures, Neon White, and several horror titles.

Global mobile gaming ads surge in 2025 as AI and interactivity reshape engagement

Mobile gaming ads grew strongly in 2025 as AI-driven optimisation and interactive formats reshaped global user acquisition strategies.

Related Articles

Popular Categories