Wednesday, 17 December 2025
27.3 C
Singapore
30.9 C
Thailand
27.6 C
Indonesia
27.4 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Google extends repair and warranty programme for Pixel 9 Pro and Fold devices

Google extends repair and warranty programmes for Pixel 9 Pro, Pixel 9 Pro XL, and Pixel 9 Pro Fold devices.

NVIDIA debuts Nemotron 3 family of open models for agentic AI

NVIDIA launches the open Nemotron 3 AI model family, targeting efficient, transparent multi-agent systems across enterprise and startup use cases.

Apple’s next AirTag could introduce major upgrades to tracking and battery features

Apple’s next AirTag may bring improved pairing, longer tracking range and better battery reporting, based on features found in iOS 26.

Samsung Galaxy Z TriFold sells out first batch, second waitlist opens in Singapore

Samsung’s Galaxy Z TriFold sells out its first batch in Singapore, with a second waitlist now open for the premium tri-fold phone.

DJI launches Neo 2, its lightest and most compact drone yet

DJI launches the Neo 2, a lightweight, compact drone with advanced shooting modes and obstacle avoidance.

Dishonored and Deus Ex lead reflects on Arkane Austin’s closure

Harvey Smith reflects on Arkane Austin’s closure, Redfall’s challenges, and the human cost of layoffs in today’s games industry.

LG introduces Micro RGB evo TV ahead of CES 2026

LG unveils its first Micro RGB evo TV for CES 2026, promising wider colour gamut, higher brightness, and LCD performance closer to OLED.

Apple’s next AirTag could introduce major upgrades to tracking and battery features

Apple’s next AirTag may bring improved pairing, longer tracking range and better battery reporting, based on features found in iOS 26.

Apple Studio Display 2 tipped to add 120Hz refresh rate and HDR support

Apple Studio Display 2 is tipped to feature 120Hz refresh rates, HDR support, and possibly mini-LED technology, with a launch expected in 2026.

Related Articles

Popular Categories