Tuesday, 26 August 2025
29.1 C
Singapore
28.7 C
Thailand
19.2 C
Indonesia
27.6 C
Philippines

Embedded LLM and AMD launch TokenVisor to boost AI monetisation for GPU neoclouds

Embedded LLM and AMD launch TokenVisor, a platform enabling monetisation and management of AMD GPU clusters for LLM workloads.

Embedded LLM has unveiled TokenVisor, a new GPU monetisation and management platform co-launched with AMD, designed to help neocloud providers and enterprises manage and monetise AMD GPU clusters for Large Language Model (LLM) workloads. The announcement was made during the Advancing AI 2025 event, held on 12 June in Santa Clara, California.

Supporting monetisation and governance for GPU-powered neoclouds

TokenVisor is the first control plane built specifically for the AMD GPU-powered neocloud ecosystem. It enables users to manage LLM workloads more effectively while accelerating time-to-revenue. The platform helps providers streamline deployment, billing, and governance, offering a clear path to Return on Investment (ROI).

Key features include automated resource allocation, real-time usage monitoring, rate-limiting policies, and support for custom pricing. These capabilities allow GPU owners to efficiently commercialise their infrastructure while giving enterprise clients tools to enforce internal cost control and compliance.

Early adopters have reported significant benefits, including faster monetisation after hardware installation and better support for popular LLM and multi-modal models. The combination of technical responsiveness and comprehensive model compatibility has been cited as a major strength by users looking to quickly recover AI infrastructure investments.

Born out of community collaboration and open-source values

TokenVisor was developed in consultation with the AMD GPU neocloud community, embracing the collaborative ethos showcased at Advancing AI 2025. The platform reflects Embedded LLM’s commitment to empowering the decentralised AI ecosystem with enterprise-grade solutions.

“TokenVisor is the hypervisor for the AI Token era – unlocking decentralised GPU computing’s potential requires tools as powerful and flexible as the hardware,” said Ooi Ghee Leng, CEO of Embedded LLM. “Co-launched at Advancing AI 2025, an event that celebrates AI innovation and open-source collaboration, marks an important milestone for the AMD GPU neocloud community.”

Mahesh Balasubramanian, Senior Director of Product Marketing, Data Center GPU Business at AMD, added, “TokenVisor brings powerful new capabilities to the AMD GPU neocloud ecosystem, helping providers efficiently manage and monetise LLM workloads.”

Strengthening Singapore’s AI and cloud innovation goals

Based in Singapore, Embedded LLM is part of the country’s expanding deep tech sector and supports national goals to position itself as a hub for AI and cloud infrastructure in Southeast Asia. The launch of TokenVisor contributes to Singapore’s push for AI sovereignty and regional leadership in digital innovation.

Embedded LLM continues to develop LLM platforms aimed at making generative AI more accessible. It is an active contributor to open-source tools, including enhancements to vLLM for AMD ROCm and orchestration platforms like JamAI Base. With TokenVisor, the company offers a practical solution to monetise and manage GPU clusters in a decentralised AI environment.

Hot this week

Vivo unveils Vision headset to rival Apple’s Vision Pro

Vivo launches Vision headset, a lighter and cheaper rival to Apple’s Vision Pro, as China’s VR market grows.

Google adds AI-powered audio feature to Docs

Google introduces a new Gemini AI feature in Docs, allowing users to listen to documents with customisable voices and playback speeds.

Qualcomm introduces Snapdragon W5 Gen 2 chips with satellite support for smartwatches

Qualcomm launches Snapdragon W5 Gen 2 chips for smartwatches, featuring satellite support, enhanced GPS accuracy, and improved efficiency.

Meta introduces an AI dubbing tool for Instagram and Facebook videos

Meta rolls out an AI dubbing tool for Instagram and Facebook reels, starting with English-Spanish translations for eligible creators.

Belkin introduces first Qi2.2 chargers with 25W wireless charging speeds

Belkin launches its first Qi2.2-certified chargers, offering 25W wireless charging speeds with three models designed for both home and travel use.

NVIDIA unveils Jetson Thor, its next-generation robotics computing platform

NVIDIA launches Jetson Thor, a next-gen AI robotics platform with 7.5x computing power, designed for developers and large-scale robotics projects.

Apple set to bring back Touch ID with upcoming foldable iPhone

Apple is expected to launch its first foldable iPhone in 2026, featuring Touch ID, four cameras and a slim in-cell display design.

Apple’s upcoming iPhone strategy signals a major design shift

Apple is set to launch a slimmer iPhone Air next month, with a foldable model expected in 2026 and a curved-glass 20th anniversary device planned for 2027.

Elon Musk’s xAI files lawsuit against Apple and OpenAI over chatbot integration

Elon Musk’s xAI sues Apple and OpenAI, alleging their iPhone ChatGPT partnership harms competition and gives OpenAI an unfair advantage.

Related Articles

Popular Categories