Sunday, 2 November 2025
28.6 C
Singapore
27.6 C
Thailand
28.1 C
Indonesia
28.9 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Thief VR: Legacy of Shadow launches on 4 December

The classic stealth series returns with Thief VR: Legacy of Shadow, launching 4 December on Meta Quest, PS VR, and SteamVR.

Samsung previews trifold phone prototype at APEC summit in Korea

Samsung previews its first trifold smartphone prototype at the APEC summit in Korea, hinting at a possible launch later this year.

Check Point and NVIDIA enhance enterprise AI security with AI Cloud Protect

Check Point and NVIDIA launch AI Cloud Protect, securing enterprise AI factories and workloads without performance loss.

OPPO launches Find X9 series globally with advanced camera and battery upgrades

OPPO launches the Find X9 series globally, featuring a 200MP Hasselblad camera, 7500mAh battery, and new ColorOS 16 for enhanced performance.

Bluesky tests the dislike button and ‘social proximity’ to improve user interactions

Bluesky tests a private dislike button and ‘social proximity’ system to improve conversations and foster more meaningful online interactions.

Bluesky tests the dislike button and ‘social proximity’ to improve user interactions

Bluesky tests a private dislike button and ‘social proximity’ system to improve conversations and foster more meaningful online interactions.

Innovation drives legacy industries at TechInnovation 2025

Industry leaders at TechInnovation 2025 shared how innovation and collaboration are helping legacy businesses modernise for the future.

Informatica unveils Fall 2025 release to power the era of agentic AI

Informatica’s Fall 2025 release introduces new AI-driven data management tools to power agentic AI with trusted enterprise data.

Commvault launches Data Rooms to connect enterprise data with AI platforms securely

Commvault introduces Data Rooms, a secure platform enabling enterprises to safely activate and share backup data for AI use.

Related Articles

Popular Categories