Sunday, 16 November 2025
28 C
Singapore
32.8 C
Thailand
25.1 C
Indonesia
28.9 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Singapore FinTech Festival 2025 marks 10 years with focus on the next decade of finance

Singapore FinTech Festival 2025 celebrates its 10th year, spotlighting AI, tokenisation, and quantum technologies shaping global finance.

GovWare 2025 closes with focus on AI security, quantum risks and regional cyber resilience

GovWare 2025 closes with global leaders discussing AI security, quantum risks and the need for stronger regional cyber resilience.

ASUS opens pre-orders for ROG x Hatsune Miku gaming PC in Singapore

ASUS opens pre-orders in Singapore for its themed ROG x Hatsune Miku gaming PC and peripherals bundle.

Businesses report rising revenue loss from inefficient tech as AI adoption grows

New research shows two in five global businesses face revenue loss due to tech inefficiencies, with many turning to AI to improve productivity.

Singapore businesses expand globally as one in four sell internationally with PayPal

One in four Singapore businesses now sell internationally via PayPal, led by gaming, beauty, and fashion exports worth over US$1.6B.

vivo X300 Pro review: A flagship built for serious photography

A detailed look at the vivo X300 Pro’s camera system, design, battery life and everyday performance in real-world use.

Businesses report rising revenue loss from inefficient tech as AI adoption grows

New research shows two in five global businesses face revenue loss due to tech inefficiencies, with many turning to AI to improve productivity.

Meta announces Southeast Asia’s most impactful Reels campaigns and creators

Meta highlights brands and creators shaping Southeast Asia’s short-form video landscape at the 2025 Reels Impact Awards.

Toyota Gazoo Racing Asia brings 2025 Esports GT Championship Finals to Thailand

Toyota Gazoo Racing Asia brings the 2025 Esports GT Championship Finals to Thailand, featuring top sim drivers and an expanded racing programme.

Related Articles

Popular Categories