Tuesday, 29 April 2025
27.8 C
Singapore
30.2 C
Thailand
20.4 C
Indonesia
29.2 C
Philippines

AI startup Sesame unveils base model for its voice assistant

AI startup Sesame has released CSM-1B, the base model behind its voice assistant Maya, raising concerns over voice cloning risks and safeguards.

Sesame, the AI startup behind the widely discussed virtual assistant Maya, has released the base model that powers its advanced voice technology. The company’s new AI model, CSM-1B, is now available under an Apache 2.0 licence, meaning it can be used commercially with minimal restrictions.

The model is built with 1 billion parameters, referring to the individual components that help process and generate responses. According to Sesame, CSM-1B produces “RVQ audio codes” from text and audio inputs. This process, known as Residual Vector Quantisation (RVQ), converts audio into digital tokens called codes. RVQ is commonly used in AI-powered audio tools, including Google’s SoundStream and Meta’s Encodec.

How Sesame’s AI model works

CSM-1B combines Meta’s Llama language model with an audio decoder, creating a system capable of generating realistic speech. While the base model can produce various voices, Sesame notes that it is not specifically fine-tuned for any particular voice. However, the company has developed a refined version that powers its virtual assistant, Maya.

Sesame acknowledges that the model has some capacity to understand and generate non-English languages, but this is limited due to the nature of its training data. The company has not disclosed details about the datasets used to train CSM-1B.

Despite this AI’s impressive capabilities, Sesame has implemented very few safeguards. Developers and users are urged to follow an honour system, refraining from using the model to replicate voices without consent, spread misinformation, or engage in harmful activities. However, there are no built-in restrictions to prevent misuse.

Concerns over voice cloning risks

A hands-on test of CSM-1B’s Hugging Face demo on the AI platform revealed how quickly it can replicate a person’s voice. The cloning process took less than a minute, and from there, generating speech on various topics, including politically sensitive issues like elections and Russian propaganda, was effortless.

Consumer Reports recently raised concerns about the growing number of AI-powered voice cloning tools, warning that many lack meaningful safeguards to prevent fraud or abuse. The rapid development of these technologies has sparked discussions about the potential risks of deepfake audio and misinformation.

Sesame was co-founded by Brendan Iribe, best known as the co-creator of Oculus. The company gained widespread attention in February when Maya and its other AI assistant, Miles, were unveiled. Unlike traditional virtual assistants, these AI voices take breaths, pause naturally, and can even be interrupted mid-sentence—features similar to OpenAI’s Voice Mode, which aims to make AI interactions more human-like.

Sesame has secured funding from major investors, including Andreessen Horowitz, Spark Capital, and Matrix Partners. In addition to developing AI voice assistants, the company is also working on AI-powered smart glasses. These wearable devices, designed for all-day use, will integrate Sesame’s custom AI models to enhance user interactions.

As AI voice technology evolves, concerns over ethical use and security risks remain. With CSM-1B now open to the public, it is yet to be seen how developers will use it—and whether safeguards will eventually be put in place to prevent misuse.

Hot this week

Bowers & Wilkins unveil updated headphones and McLaren-themed earbuds in Singapore

Bowers & Wilkins launches Px7 S3 headphones and Pi8 McLaren earbuds in Singapore. These headphones blend high-quality sound with comfort and stylish design.

Preorders for the Nintendo Switch 2 officially begin on April 24

Preorders for the Nintendo Switch 2 open on April 24. The device's launch date is June 5, and it offers new features, games, and accessories to explore.

Anthropic aims to uncover how AI models think by 2027

Anthropic CEO Dario Amodei aims to understand how AI models work by 2027 and urges industry-wide action for safety and transparency.

Bitdefender launches GravityZone PHASR to combat stealthy endpoint threats

Bitdefender unveils GravityZone PHASR, a dynamic endpoint security tool that reduces attack surfaces using behaviour-based automation.

Anbernic stops US shipments amid rising tariff concerns

Anbernic halts US shipments due to rising tariffs, urging customers to order from its US warehouse to avoid high import duties.

Nintendo Pop-Up Store and Mario Kart Fun Return to Jewel Changi Airport

Experience the magic of Nintendo at Jewel Changi Airport with the return of the Pop-Up Store and the exciting Mario Kart Jewel Circuit Challenge!

Lian Li’s new Lancool 207 Digital case brings a 6-inch LCD screen to your PC

Lian Li's Lancool 207 Digital PC case brings a bright 6-inch LCD screen to your setup, offering style, function, and full customisation.

Google to end support for early Nest thermostats on October 25

Google will stop supporting first—and second-generation Nest thermostats on October 25 and end new Nest launches in Europe.

DeepMind team in London seeks to unionise over AI concerns

DeepMind employees in London seek to unionise with the Communication Workers Union over concerns about Google’s AI policies and military contracts.

Related Articles

Popular Categories