Wednesday, 26 November 2025
28 C
Singapore
19.5 C
Thailand
21.1 C
Indonesia
27.7 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Kaspersky reports surge in shopping phishing and gaming-related attacks in 2025

Kaspersky reports 6.4 million shopping phishing attempts and more than 20 million gaming-related attacks detected in 2025.

Final Fantasy Tactics remake brings renewed challenge to modern consoles

A modern remake of Final Fantasy Tactics brings updated visuals, classic strategy gameplay and steep challenges to today’s major consoles.

Singapore sees surge in ransomware attacks during holidays, Semperis study finds

A new Semperis study shows 59% of ransomware attacks in Singapore occur during holidays, driven by reduced staffing and major corporate events.

Salesforce study finds most Singapore technical leaders see data overhaul as vital for AI success

A new Salesforce study finds most Singapore technical leaders say major data overhauls are needed before AI ambitions can succeed.

Heidi launches in Singapore after securing US$65 million in Series B funding

Heidi opens its Singapore hub after raising US$65 million, aiming to expand healthcare AI adoption across Southeast Asia.

Qualcomm introduces Snapdragon 8 Gen 5 as streamlined alternative to Elite chipset

Qualcomm launches the Snapdragon 8 Gen 5 chipset, offering strong performance, AI features, and expected availability in devices within weeks.

Warner Music ends lawsuit against Suno after reaching new licensing agreement

Warner Music ends its lawsuit against Suno after securing a licensing deal that gives artists opt-in control over AI-generated music.

Asia’s boards place AI and digital transformation at the top of 2026 priorities

Nearly half of Asia’s governance leaders plan to prioritise AI in 2026 as digital transformation reshapes board agendas.

ChatGPT introduces new shopping research tool for personalised product guidance

ChatGPT launches a shopping research tool that creates personalised buyer’s guides through interactive product discovery.

Related Articles

Popular Categories