Friday, 13 December 2024
28 C
Singapore

Microsoft’s AI could soon make your photos talk and sing

Explore how Microsoft's new AI tool VASA-1 can bring your photos to life by creating realistic videos of them talking and singing.

Microsoft Research has just unveiled VASA-1, an experimental tool that could transform still images or drawings of people into realistic videos where they appear to talk or sing. Using an existing audio file, this tool can animate your photos with facial expressions, head movements, and perfectly synced lip movements that match the audio’s speech or song.

On the project’s webpage, you can find numerous examples that showcase how lifelike these animations can be. Although some lip and head movements might still look a bit mechanical and not perfectly in sync, the overall effect is convincing enough that it could easily be mistaken for real footage.

There’s a significant potential for misuse, particularly in the creation of deepfake videos, which is something Microsoft’s researchers are quite aware of. Consequently, they have decided against releasing any public demos, APIs, or additional details about the implementation until they can ensure the tool will be used responsibly and in accordance with stringent regulations. They haven’t mentioned specific safeguards to prevent misuse by malicious actors for harmful purposes like creating deepfake pornography or misinformation campaigns.

Despite these concerns, the technology promises several beneficial applications. It could enhance educational equity and improve accessibility for individuals with communication challenges by giving them access to an avatar that can communicate on their behalf. Additionally, this tool could provide companionship and therapeutic support, especially in programmes that offer interactions with AI-powered characters.

VASA-1 was trained using the VoxCeleb2 dataset, which includes over 1 million spoken expressions from 6,112 celebrities extracted from YouTube videos. Interestingly, it works not just on real faces but also on artistic ones. An amusing example is the animation of the Mona Lisa synced with an audio clip of Anne Hathaway’s viral rendition of Lil Wayne’s “Paparazzi,” which is quite delightful and worth a watch.

Hot this week

Grok’s ‘Aurora’ image generator rolls out globally on X

X launches the Aurora image generator globally, offering advanced photorealistic AI tools. The rollout sparks questions about content safeguards.

2024 set to be a pivotal year for AI, NetApp report reveals

NetApp's 2024 report unveils the critical year ahead for AI, highlighting the importance of data management, security, and sustainability in leveraging AI's potential.

Microsoft AI chief and Sam Altman differ on AGI timeline and vision

Microsoft AI chief Mustafa Suleyman and OpenAI CEO Sam Altman clash on AGI timelines, definitions, and hardware needs, reflecting evolving AI strategies.

Reddit introduces AI-powered tool to simplify your searches

Explore Reddit’s new AI tool, Reddit Answers, designed to simplify searches by providing clear responses and direct links to Reddit posts.

ASUS VivoWatch blood pressure app receives Thailand FDA certification

ASUS VivoWatch blood pressure app achieves Thailand FDA certification, marking a milestone in smart healthcare innovation for Southeast Asia.

VisionOS 2.2 introduces Ultrawide Mac Virtual Display for Vision Pro

VisionOS 2.2 brings Ultrawide Mac Virtual Display to Vision Pro, offering incredible multitasking with 32:9 and 21:9 screen options.

Yamaha Corporation boosts innovation with Informatica’s AI-driven data management

Yamaha Corporation partners with Informatica to use AI-driven data management for innovation and enhanced customer engagement.

2024 set to be a pivotal year for AI, NetApp report reveals

NetApp's 2024 report unveils the critical year ahead for AI, highlighting the importance of data management, security, and sustainability in leveraging AI's potential.

Infosys Compaz and StarHub enhance partnership for innovative tech solutions

Infosys Compaz and StarHub strengthen their collaboration, aiming to transform business operations with advanced AI and cloud technologies.

Related Articles

Popular Categories