Saturday, 5 July 2025
27.9 C
Singapore
30 C
Thailand
20.3 C
Indonesia
28.8 C
Philippines

AI startup Sesame unveils base model for its voice assistant

AI startup Sesame has released CSM-1B, the base model behind its voice assistant Maya, raising concerns over voice cloning risks and safeguards.

Sesame, the AI startup behind the widely discussed virtual assistant Maya, has released the base model that powers its advanced voice technology. The company’s new AI model, CSM-1B, is now available under an Apache 2.0 licence, meaning it can be used commercially with minimal restrictions.

The model is built with 1 billion parameters, referring to the individual components that help process and generate responses. According to Sesame, CSM-1B produces “RVQ audio codes” from text and audio inputs. This process, known as Residual Vector Quantisation (RVQ), converts audio into digital tokens called codes. RVQ is commonly used in AI-powered audio tools, including Google’s SoundStream and Meta’s Encodec.

How Sesame’s AI model works

CSM-1B combines Meta’s Llama language model with an audio decoder, creating a system capable of generating realistic speech. While the base model can produce various voices, Sesame notes that it is not specifically fine-tuned for any particular voice. However, the company has developed a refined version that powers its virtual assistant, Maya.

Sesame acknowledges that the model has some capacity to understand and generate non-English languages, but this is limited due to the nature of its training data. The company has not disclosed details about the datasets used to train CSM-1B.

Despite this AI’s impressive capabilities, Sesame has implemented very few safeguards. Developers and users are urged to follow an honour system, refraining from using the model to replicate voices without consent, spread misinformation, or engage in harmful activities. However, there are no built-in restrictions to prevent misuse.

Concerns over voice cloning risks

A hands-on test of CSM-1B’s Hugging Face demo on the AI platform revealed how quickly it can replicate a person’s voice. The cloning process took less than a minute, and from there, generating speech on various topics, including politically sensitive issues like elections and Russian propaganda, was effortless.

Consumer Reports recently raised concerns about the growing number of AI-powered voice cloning tools, warning that many lack meaningful safeguards to prevent fraud or abuse. The rapid development of these technologies has sparked discussions about the potential risks of deepfake audio and misinformation.

Sesame was co-founded by Brendan Iribe, best known as the co-creator of Oculus. The company gained widespread attention in February when Maya and its other AI assistant, Miles, were unveiled. Unlike traditional virtual assistants, these AI voices take breaths, pause naturally, and can even be interrupted mid-sentence—features similar to OpenAI’s Voice Mode, which aims to make AI interactions more human-like.

Sesame has secured funding from major investors, including Andreessen Horowitz, Spark Capital, and Matrix Partners. In addition to developing AI voice assistants, the company is also working on AI-powered smart glasses. These wearable devices, designed for all-day use, will integrate Sesame’s custom AI models to enhance user interactions.

As AI voice technology evolves, concerns over ethical use and security risks remain. With CSM-1B now open to the public, it is yet to be seen how developers will use it—and whether safeguards will eventually be put in place to prevent misuse.

Hot this week

Dubai gears up for air taxi revolution

Joby delivers its first air taxi to Dubai, moving closer to a 2026 launch and signalling real progress in the future of flying taxis.

Runway moves into gaming with new AI platform Game Worlds

Runway launches Game Worlds, an AI platform aiming to reshape game creation and expand its success from film into the gaming industry.

Moneythor launches AI Suite to help banks deliver deeper customer experiences

Moneythor unveils AI Suite to help banks deliver personalised, app-like customer experiences and improve digital engagement.

Apple could launch a low-cost MacBook with an iPhone chip by 2026

Apple may release a cheaper MacBook with an iPhone chip, possibly launching in late 2025 with the A18 Pro and four colour options.

Apple plans to launch 7 headsets and smart glasses by 2028, analyst says

Apple is planning to launch at least seven headsets and glasses by 2028, including smart glasses, a Vision Air, and updated Vision Pro models.

DeepSWE, powered by Alibaba’s Qwen3-32B, outperforms rivals in global benchmark

Alibaba’s open-source Qwen model powers DeepSWE to global victory in AI agent rankings, signalling a shift in open-weight AI innovation.

E Ink transforms laptop touchpads into smart e-reader displays for AI use

E Ink’s new touchpad brings e-reader tech to laptops, offering a low-power screen for AI apps and assistants right under your fingertips.

Xiaomi opens new store at City Square Mall and launches Shopee presence in Singapore

Xiaomi opens its ninth store in Singapore at City Square Mall and launches its official Shopee store with promotional offers across both platforms.

Tools for Humanity: Why Southeast Asia is shaping the future of humanness in the Age of AI

Southeast Asia is pioneering the future of digital identity with World ID, offering private, secure, and human-first verification at scale.

Related Articles

Popular Categories