Friday, 29 August 2025
29.7 C
Singapore
29.5 C
Thailand
23.4 C
Indonesia
27.5 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

xAI makes Grok 2.5 open source as Grok 3 release nears

xAI makes its Grok 2.5 AI model open source on Hugging Face, with Elon Musk confirming Grok 3 will follow in six months.

Oyster malware campaign targets IT professionals with fake software tools

Oyster malware campaign targets IT professionals with fake tools like WinSCP and PuTTY, raising ransomware concerns.

PlayStation announces Ghost of Yotei Gold Limited Edition PS5 bundle

PlayStation unveils the Ghost of Yotei Gold Limited Edition PS5 bundle and accessories, with pre-orders set to open in Singapore on 4 September.

Casio introduces the MR-G MRG-B5000HT as a limited-edition art piece

Casio launches the MR-G MRG-B5000HT, a limited-edition G-Shock featuring hand-hammered titanium and Japanese craftsmanship.

ChatGPT referral traffic to websites drops by 52% in one month

ChatGPT referral traffic to websites has dropped 52% in a month, as Reddit and Wikipedia rise under OpenAI’s new citation weighting.

Kobo introduces Instapaper integration to replace Pocket on e-readers

Kobo replaces Pocket with Instapaper on its e-readers through a free firmware update, ensuring users maintain a seamless read-it-later experience.

Plaud.ai introduces Note Pro with smarter recording and AI-powered features

Plaud.ai unveils the Note Pro, a smarter AI note-taking device with improved recording, automation, and app integration, launching in October 2025.

Casio introduces the MR-G MRG-B5000HT as a limited-edition art piece

Casio launches the MR-G MRG-B5000HT, a limited-edition G-Shock featuring hand-hammered titanium and Japanese craftsmanship.

PlayStation announces Ghost of Yotei Gold Limited Edition PS5 bundle

PlayStation unveils the Ghost of Yotei Gold Limited Edition PS5 bundle and accessories, with pre-orders set to open in Singapore on 4 September.

Related Articles

Popular Categories