Thursday, 18 September 2025
31 C
Singapore
32.7 C
Thailand
25 C
Indonesia
28.6 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

GitLab survey shows AI software innovation could unlock over S$6 billion in Singapore

GitLab survey finds AI software innovation could generate over S$6 billion annually in Singapore, with skills and governance key to success.

StarHub introduces dynamic ad pods for live TV advertising in Singapore

StarHub launches Dynamic Ad Pods in Singapore, bringing personalised, real-time ad replacement to live broadcast TV.

Epson Southeast Asia highlights circular economy progress in sustainability report

Epson’s FY2024 Southeast Asia sustainability report highlights emissions cuts, circular economy gains, and community programmes.

Keeper Security publishes back-to-school cybersecurity guide for schools and families

Keeper Security has launched a back-to-school cybersecurity guide to help schools and families strengthen digital safety against rising threats.

Beijing AIForce Technology wins PepsiCo’s 2025 Greenhouse Accelerator in Asia Pacific

Beijing AIForce Technology wins PepsiCo’s 2025 Greenhouse Accelerator in Asia Pacific with its autonomous low-carbon tractors.

Half of Singapore workers face financial strain as demand for pay flexibility rises

Half of Singapore’s workforce is financially vulnerable, with rising demand for flexible pay and payroll teams struggling under mounting pressure.

IBS Software and Emirates Skywards launch new loyalty platform partnership

IBS Software and Emirates Skywards launch iLoyal, a next-gen loyalty platform serving 35 million members with enhanced digital experiences.

GitLab survey shows AI software innovation could unlock over S$6 billion in Singapore

GitLab survey finds AI software innovation could generate over S$6 billion annually in Singapore, with skills and governance key to success.

New Relic study shows IT outages cost Southeast Asian firms up to US$165.5 million a year

A New Relic report finds IT outages cost Southeast Asian firms up to US$165.5m yearly, with AI driving demand for observability.

Related Articles

Popular Categories