Tuesday, 2 December 2025
24 C
Singapore
28.8 C
Thailand
29.5 C
Indonesia
27.8 C
Philippines

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research Asia has just unveiled VASA-1, an experimental AI tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Prebuilt PC prices set to rise as RAM costs surge worldwide

PC makers warn of higher prebuilt prices as RAM and SSD costs surge due to global demand and supply pressures.

DJI Osmo Pocket 4 leak suggests launch may be imminent

DJI’s Osmo Pocket 4 appears in FCC filings, hinting at an imminent launch amid rumours of new features and a possible US product ban.

Sony announces December PS Plus Monthly Games lineup featuring five titles

Sony unveils a five-game PS Plus lineup for December, including Lego Horizon Adventures, Neon White, and several horror titles.

Google denies claims that Gmail data is used to train the Gemini AI model

Google denies claims that Gmail content is used to train its Gemini AI model, offering clarity on Smart Features and user privacy.

Statrys expands in Singapore with unified CAB platform for SMEs

Statrys launches a unified platform in Singapore to streamline incorporation, accounting and cross-border payments for SMEs.

Honor showcases early low-light camera performance of the Magic 8 Pro

Honor offers an early look at the Magic 8 Pro’s upgraded low-light camera performance during brief testing at the Singapore Oceanarium.

Porsche unveils new electric-only Cayenne with up to 1,140hp and wireless charging

Porsche launches the new electric-only Cayenne with up to 1,140hp, ultra-fast charging, wireless charging, and improved practicality.

Team Cherry confirms more Silksong content without a release date

Team Cherry is working on new Hollow Knight: Silksong content, but no release date has been announced.

Ayaneo unveils the Next II, a powerful handheld with a 9-inch display

Ayaneo reveals the Next II handheld with a 9-inch OLED display, a Ryzen AI Max+ chip, and advanced controls, aimed at high-end gamers.

Related Articles

Popular Categories