Sunday, 12 October 2025
28.1 C
Singapore
27.2 C
Thailand
19.8 C
Indonesia
27.4 C
Philippines

Embedded LLM and AMD launch TokenVisor to boost AI monetisation for GPU neoclouds

Embedded LLM and AMD launch TokenVisor, a platform enabling monetisation and management of AMD GPU clusters for LLM workloads.

Embedded LLM has unveiled TokenVisor, a new GPU monetisation and management platform co-launched with AMD, designed to help neocloud providers and enterprises manage and monetise AMD GPU clusters for Large Language Model (LLM) workloads. The announcement was made during the Advancing AI 2025 event, held on 12 June in Santa Clara, California.

Supporting monetisation and governance for GPU-powered neoclouds

TokenVisor is the first control plane built specifically for the AMD GPU-powered neocloud ecosystem. It enables users to manage LLM workloads more effectively while accelerating time-to-revenue. The platform helps providers streamline deployment, billing, and governance, offering a clear path to Return on Investment (ROI).

Key features include automated resource allocation, real-time usage monitoring, rate-limiting policies, and support for custom pricing. These capabilities allow GPU owners to efficiently commercialise their infrastructure while giving enterprise clients tools to enforce internal cost control and compliance.

Early adopters have reported significant benefits, including faster monetisation after hardware installation and better support for popular LLM and multi-modal models. The combination of technical responsiveness and comprehensive model compatibility has been cited as a major strength by users looking to quickly recover AI infrastructure investments.

Born out of community collaboration and open-source values

TokenVisor was developed in consultation with the AMD GPU neocloud community, embracing the collaborative ethos showcased at Advancing AI 2025. The platform reflects Embedded LLM’s commitment to empowering the decentralised AI ecosystem with enterprise-grade solutions.

“TokenVisor is the hypervisor for the AI Token era – unlocking decentralised GPU computing’s potential requires tools as powerful and flexible as the hardware,” said Ooi Ghee Leng, CEO of Embedded LLM. “Co-launched at Advancing AI 2025, an event that celebrates AI innovation and open-source collaboration, marks an important milestone for the AMD GPU neocloud community.”

Mahesh Balasubramanian, Senior Director of Product Marketing, Data Center GPU Business at AMD, added, “TokenVisor brings powerful new capabilities to the AMD GPU neocloud ecosystem, helping providers efficiently manage and monetise LLM workloads.”

Strengthening Singapore’s AI and cloud innovation goals

Based in Singapore, Embedded LLM is part of the country’s expanding deep tech sector and supports national goals to position itself as a hub for AI and cloud infrastructure in Southeast Asia. The launch of TokenVisor contributes to Singapore’s push for AI sovereignty and regional leadership in digital innovation.

Embedded LLM continues to develop LLM platforms aimed at making generative AI more accessible. It is an active contributor to open-source tools, including enhancements to vLLM for AMD ROCm and orchestration platforms like JamAI Base. With TokenVisor, the company offers a practical solution to monetise and manage GPU clusters in a decentralised AI environment.

Hot this week

TeamViewer data reveals urgent need to upgrade from Windows 10 as support ends

TeamViewer warns of cybersecurity risks as Windows 10 support ends, with over 40% of global devices still on the outdated system.

Geotab launches AI assistant Ace for fleets in Southeast Asia

Geotab launches Ace, a generative AI assistant, in Southeast Asia to help fleets improve safety, efficiency and data-driven decision-making.

OpenAI launches ChatGPT Go in Asia to make AI more accessible

OpenAI launches ChatGPT Go in 16 Asian countries, offering advanced GPT-5 features like higher message limits and image generation at a lower cost.

Delta Electronics showcases energy-efficient data centre solutions at Data Centre World Asia 2025

Delta Electronics unveiled cutting-edge power and cooling solutions at Data Centre World Asia 2025, supporting sustainable, AI-ready data centres.

OpenAI seeks to reduce political bias in ChatGPT responses

OpenAI says its latest GPT-5 models are less politically biased after internal stress tests of its responses.

Little Nightmares 3 disappoints despite striking visuals

Review finds Little Nightmares 3 visually strong but frustratingly dark, with unclear puzzles and weak horror atmosphere.

Microsoft expands Copilot on Windows with Office document creation and Gmail integration

Microsoft updates Copilot on Windows with Office document creation, Gmail integration, and new AI productivity features.

OpenAI seeks to reduce political bias in ChatGPT responses

OpenAI says its latest GPT-5 models are less politically biased after internal stress tests of its responses.

Armis and Fortinet expand partnership to boost cyber resilience for global businesses

Armis and Fortinet have expanded their partnership to enhance cyber resilience with deeper integration, unified visibility, and automated security enforcement.

Related Articles