Saturday, 6 December 2025
27.4 C
Singapore
14.8 C
Thailand
23.9 C
Indonesia
26.7 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Tiger Brokers: Bringing institutional-grade AI intelligence to global retail investors

AI is redefining retail investing as platforms like Tiger Brokers’ TigerAI integrate verified intelligence, personalisation, and long-term wealth management to empower global investors.

Samsung introduces Galaxy Tab A11+ with larger display, AI features, and long-term software support

Samsung launches the Galaxy Tab A11+, an affordable 11-inch tablet with AI tools, long battery life, and seven years of software support.

DeepSeek launches open AI model achieving gold-level scores at the Maths Olympiad

DeepSeek launches Math-V2, the first open AI model to achieve gold-level scores at the International Mathematical Olympiad.

Pure Storage reports 16 percent revenue growth in third quarter of fiscal 2026

Pure Storage reports strong third-quarter results with 16 percent revenue growth, higher guidance, and continued product expansion.

Meta and Google reportedly close to landmark AI chip agreement

Meta is in talks with Google on a major AI chip deal that could reshape the competitive landscape across cloud and hardware markets.

Google highlights Singapore’s top trending searches in 2025

Google reveals Singapore’s top trending searches for 2025, highlighting SG60 celebrations, elections, pop culture and financial concerns.

HPE expands hybrid cloud portfolio with new virtualisation, security and AI capabilities

HPE expands its GreenLake cloud portfolio with new virtualisation, security and AI capabilities to support modern hybrid cloud demands.

EOY music, comics and arts festival returns with new venue and expanded programme

EOY 2025 returns with a new venue, international guests and expanded activities celebrating Japanese pop culture in Singapore.

Tiger Brokers: Bringing institutional-grade AI intelligence to global retail investors

AI is redefining retail investing as platforms like Tiger Brokers’ TigerAI integrate verified intelligence, personalisation, and long-term wealth management to empower global investors.

Related Articles

Popular Categories