Wednesday, 29 October 2025
27.7 C
Singapore
24.5 C
Thailand
20.9 C
Indonesia
28.2 C
Philippines

OpenAI introduces a cutting-edge audio tool capable of replicating human voices

OpenAI unveils Voice Engine, an audio tool that accurately replicates human voices, showcasing the potential and ethical considerations of AI advancements.

OpenAI, a leading name in the artificial intelligence sector, has recently showcased its latest advancement: an audio tool with the ability to convert text into speech that sounds remarkably human. This development is at the forefront of AI technology, yet it also introduces potential concerns regarding the creation of deepfakes.

A cautious rollout amid ethical considerations

So far, the new tool, named Voice Engine, has been made available to a select group of around 10 developers. Despite plans for a broader release to potentially 100 developers, OpenAI decided to limit access after consulting with various stakeholders, including policymakers, industry specialists, and educational professionals. This cautious approach, detailed in a company blog post on March 29, reflects the potential ethical and safety implications, particularly in the context of an election year.

Voice Engine differs from previous audio content technologies by accurately replicating the voice of specific individuals, requiring only a 15-second audio sample. During a demonstration, Bloomberg experienced a clip in which OpenAI’s CEO, Sam Altman, discussed the technology in a voice generated by the AI that was virtually indistinguishable from his own.

However, OpenAI is proceeding with caution due to the precise nature of the voice replication, emphasizing the importance of safety in its use. The technology’s potential benefits were also highlighted, such as assisting patients at the Norman Prince Neurosciences Institute to regain their voices. In one instance, a young patient was able to speak clearly again for a school project after losing her voice to a brain tumour, thanks to the Voice Engine.

Expanding the potential of voice replication

Moreover, the tool’s capability extends to translating generated audio into various languages, proving useful for companies like Spotify in making content more accessible across different linguistic groups. OpenAI has outlined strict usage policies for its partners, including obtaining consent from the voice’s original owner and informing listeners that the speech they hear is AI-generated. Additionally, an inaudible audio watermark is being used to track the origin of audio clips.

As OpenAI considers wider release, it seeks feedback to gauge the global response to such technology, emphasizing the importance of public understanding and preparation for AI advancements. The firm is advocating for measures to increase societal resilience against the potential misuse of AI technologies, such as phasing out voice authentication in banks and educating the public on detecting AI-generated content.

Hot this week

Deel launches new tools to simplify year-end planning and payroll

Deel unveils year-end upgrades featuring AI-driven tools to simplify payroll, compliance, and workforce planning for global teams.

MacBook Pro M5 brings improved battery access but still faces limitations

Apple’s MacBook Pro M5 offers easier battery access and improved repairability, but limitations and performance concerns remain.

Rubrik introduces Agent Cloud to accelerate secure enterprise AI adoption

Rubrik launches Agent Cloud, a new platform enabling enterprises to monitor, govern, and undo AI agent actions across major platforms.

Keeper Security partners with Chillisoft to enhance privileged access protection in the South Pacific

Keeper Security and Chillisoft partner to enhance privileged access management and cybersecurity resilience across the South Pacific.

Veeam to acquire Securiti AI for US$1.725 billion to advance safe AI and data resilience

Veeam will acquire Securiti AI for US$1.725 billion to combine data resilience, AI trust, and security into one unified platform.

Adobe unveils new AI tools for Photoshop and Premiere Pro at Max 2025

Adobe unveils powerful new AI features for Photoshop, Premiere Pro, and Lightroom, enhancing creative control and streamlining editing workflows.

OXS launches Thunder Duo on Kickstarter as first studio-grade gaming speakers with true Dolby Atmos

OXS launches Thunder Duo on Kickstarter, a studio-grade gaming speaker series with true Dolby Atmos, modular design, and immersive 360° sound.

OpenAI outlines major improvements and new features for ChatGPT Atlas

OpenAI announces major updates to ChatGPT Atlas, including tab groups, user profiles, improved sidebar tools, and enhancements to Agent mode.

Clair Obscur fans speculate that the Expedition 33 update could introduce an evil Esquie boss fight

Fans speculate that Clair Obscur: Expedition 33's upcoming update may introduce a darker version of Esquie, following new artwork and social media hints.

Related Articles