Saturday, 29 November 2025
25.2 C
Singapore
14.7 C
Thailand
21 C
Indonesia
27.1 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Sony teases A7 V as next addition to its Alpha camera lineup

Sony hints at a possible A7 V launch with a 2 December teaser featuring a bold “V”, sparking rumours of major upgrades to the Alpha series.

POCO enters premium smartphone segment with new F8 series

POCO launches the F8 Ultra, F8 Pro, and two new tablets as it enters the premium flagship market with new performance and audio features.

The forgotten battle royale that ended a studio still deserved more than a one-month run

A look back at Radical Heights, the short-lived battle royale that showed promise but shut down after just one month.

Google limits free Nano Banana Pro image generation due to high demand

Google is reducing free Nano Banana Pro and Gemini 3 Pro usage due to high demand, limiting daily access while paid plans remain unchanged.

Honor launches Magic8 Pro in Singapore with new MagicBook Art 14 and Watch Fit

Honor launches the Magic8 Pro in Singapore with upgraded imaging, AI features and companion devices including the MagicBook Art 14 and Watch Fit.

Apple is expected to overtake Samsung as the world’s leading smartphone maker

Apple is projected to overtake Samsung as the world’s top smartphone maker, driven by strong iPhone 17 demand and upcoming device launches.

Singapore orders Apple and Google to stop spoofed government identities on messaging apps

Singapore orders Apple and Google to block spoofed government identities on messaging apps to curb rising impersonation scams.

Nintendo acquires Bandai Namco Studios Singapore

Nintendo acquires Bandai Namco Studios Singapore to boost game development and expand its subsidiary network.

Google DeepMind opens new AI research lab in Singapore to strengthen regional language capabilities

Google DeepMind opens a new AI lab in Singapore to boost regional language understanding, research partnerships, and real-world innovation.

Related Articles

Popular Categories