Friday, 19 December 2025
27.4 C
Singapore
29.2 C
Thailand
27.3 C
Indonesia
27.5 C
Philippines

OVHcloud launches AI Endpoints to simplify access to open-source models

OVHcloud launches AI Endpoints to offer serverless access to over 40 open-source AI models across key global markets.

OVHcloud has launched AI Endpoints, a serverless solution designed to make open-source artificial intelligence models more accessible to developers and businesses. The platform offers over 40 models, including large language models (LLMs) and generative AI tools, supporting applications such as chatbots, speech transcription, and code generation.

With AI Endpoints, developers can integrate advanced AI capabilities into their applications without needing to manage infrastructure or possess deep machine learning expertise. The service is hosted on OVHcloud’s trusted cloud environment, allowing users to experiment with and deploy AI models securely and efficiently.

Support for diverse business use cases

AI Endpoints provides a sandbox environment for developers to test features before rolling them out across applications and business processes. The platform is suited for a wide range of AI applications, including real-time conversational agents, data extraction, speech recognition and synthesis, and coding assistance.

For example, LLMs can be embedded into applications to enhance customer service or user interaction. Text extraction capabilities help businesses process unstructured data, improving operational workflows. Through voice APIs, developers can incorporate both transcription and voice response features, supporting voice-based user interfaces. Additionally, coding tools such as Continue offer in-IDE support with code suggestions and error detection to streamline development.

Privacy, transparency, and environmental responsibility

The platform is built on OVHcloud’s energy-efficient infrastructure, which relies on water-cooled servers housed in environmentally friendly data centres. This approach helps minimise the environmental footprint of AI operations while maintaining performance.

A key differentiator of AI Endpoints is its focus on data sovereignty and transparency. By hosting the solution in Europe, OVHcloud ensures data is protected from non-European regulations. The use of open-weight AI models allows organisations to migrate or replicate these models across different infrastructures, giving them greater control over their data and applications.

“We are excited to launch AI Endpoints and are humbled by the incredible feedback we get from our amazing community. With support for the most diverse and sought after open source LLM models, AI Endpoints helps to democratise AI so developers can add to their apps the most cutting-edge models. Our solution enables them to do this easily in a trusted cloud environment with full confidence in OVHcloud’s sovereign infrastructure,” said Yaniv Fdida, Chief Product and Technology Officer at OVHcloud.

Flexible pricing and regional availability

Following an early preview phase, AI Endpoints is now live in Asia-Pacific, Canada, and Europe, with services deployed from OVHcloud’s Gravelines data centre. Based on user feedback, the service includes enhanced features such as better API key management, increased model stability, and a wider range of supported models.

The offering covers several model categories, including LLMs like Llama 3.3 70B and Mixtral 8x7B; small language models such as Mistral Nemo and Llama 3.1 8B; code models like Qwen 2.5 Coder 32B and Codestral Mamba; reasoning tools such as DeepSeek-R1; multimodal models like Qwen 2.5 VL 72B; image generation with SDXL; and speech-to-text (ASR) and text-to-speech (TTS) capabilities.

Pricing is offered on a pay-as-you-go basis, with costs calculated by the number of tokens processed per minute, depending on the selected model.

Hot this week

LG introduces Micro RGB evo TV ahead of CES 2026

LG unveils its first Micro RGB evo TV for CES 2026, promising wider colour gamut, higher brightness, and LCD performance closer to OLED.

Deel becomes Arsenal’s official HR platform partner in multi-year global deal

Deel signs a multi-year global partnership with Arsenal, becoming the club’s Official HR Platform Partner and supporting its global operations.

Delta Electronics Singapore signs MOU with NUS to advance sustainable data centre innovation

Delta Electronics Singapore and NUS partner to develop sustainable, AI-ready data centre technologies for tropical environments.

Google removes AI-generated Disney videos from YouTube after cease-and-desist

Google has removed AI-generated Disney character videos from YouTube after receiving a cease-and-desist letter over copyright claims.

Jobstreet by SEEK outlines key job market shifts and skills needed to thrive in Singapore in 2026

Jobstreet by SEEK highlights rising retrenchments, strong tech demand, and the growing importance of AI and skills-based hiring in Singapore.

The rise of agentic AI and what it means for enterprise leaders

Agentic AI is accelerating across Asia, pushing leaders to rethink productivity, governance, and the infrastructure needed for long-term competitiveness.

Apple explores iPhone-class chip for future MacBook, leaks suggest

Leaked Apple files hint at testing a MacBook powered by an iPhone-class chip, suggesting a possible lower-cost laptop in the future.

Delta Electronics Singapore signs MOU with NUS to advance sustainable data centre innovation

Delta Electronics Singapore and NUS partner to develop sustainable, AI-ready data centre technologies for tropical environments.

Zoom introduces AI Companion 3.0 with a web-based assistant and expanded task automation

Zoom launches AI Companion 3.0, adding a web-based assistant that automates tasks, drafts emails and reshapes the platform into an AI workspace.

Related Articles

Popular Categories