Thursday, 1 May 2025
26.7 C
Singapore
30 C
Thailand
21 C
Indonesia
28.6 C
Philippines

OVHcloud launches AI Endpoints to simplify access to open-source models

OVHcloud launches AI Endpoints to offer serverless access to over 40 open-source AI models across key global markets.

OVHcloud has launched AI Endpoints, a serverless solution designed to make open-source artificial intelligence models more accessible to developers and businesses. The platform offers over 40 models, including large language models (LLMs) and generative AI tools, supporting applications such as chatbots, speech transcription, and code generation.

With AI Endpoints, developers can integrate advanced AI capabilities into their applications without needing to manage infrastructure or possess deep machine learning expertise. The service is hosted on OVHcloud’s trusted cloud environment, allowing users to experiment with and deploy AI models securely and efficiently.

Support for diverse business use cases

AI Endpoints provides a sandbox environment for developers to test features before rolling them out across applications and business processes. The platform is suited for a wide range of AI applications, including real-time conversational agents, data extraction, speech recognition and synthesis, and coding assistance.

For example, LLMs can be embedded into applications to enhance customer service or user interaction. Text extraction capabilities help businesses process unstructured data, improving operational workflows. Through voice APIs, developers can incorporate both transcription and voice response features, supporting voice-based user interfaces. Additionally, coding tools such as Continue offer in-IDE support with code suggestions and error detection to streamline development.

Privacy, transparency, and environmental responsibility

The platform is built on OVHcloud’s energy-efficient infrastructure, which relies on water-cooled servers housed in environmentally friendly data centres. This approach helps minimise the environmental footprint of AI operations while maintaining performance.

A key differentiator of AI Endpoints is its focus on data sovereignty and transparency. By hosting the solution in Europe, OVHcloud ensures data is protected from non-European regulations. The use of open-weight AI models allows organisations to migrate or replicate these models across different infrastructures, giving them greater control over their data and applications.

“We are excited to launch AI Endpoints and are humbled by the incredible feedback we get from our amazing community. With support for the most diverse and sought after open source LLM models, AI Endpoints helps to democratise AI so developers can add to their apps the most cutting-edge models. Our solution enables them to do this easily in a trusted cloud environment with full confidence in OVHcloud’s sovereign infrastructure,” said Yaniv Fdida, Chief Product and Technology Officer at OVHcloud.

Flexible pricing and regional availability

Following an early preview phase, AI Endpoints is now live in Asia-Pacific, Canada, and Europe, with services deployed from OVHcloud’s Gravelines data centre. Based on user feedback, the service includes enhanced features such as better API key management, increased model stability, and a wider range of supported models.

The offering covers several model categories, including LLMs like Llama 3.3 70B and Mixtral 8x7B; small language models such as Mistral Nemo and Llama 3.1 8B; code models like Qwen 2.5 Coder 32B and Codestral Mamba; reasoning tools such as DeepSeek-R1; multimodal models like Qwen 2.5 VL 72B; image generation with SDXL; and speech-to-text (ASR) and text-to-speech (TTS) capabilities.

Pricing is offered on a pay-as-you-go basis, with costs calculated by the number of tokens processed per minute, depending on the selected model.

Hot this week

XPENG unveils AI-powered innovations and supercharged EVs at Auto Shanghai 2025

XPENG launches AI brain, 10-minute charging EV, and IRON humanoid robot at Auto Shanghai 2025, setting new mobility benchmarks.

Early cancer detection startup Craif raises US$22M to expand into the U.S.

Craif raises $22M to expand its microRNA early cancer detection technology into the U.S., aiming to make testing simple and accessible.

Commvault expands cyber recovery services through CrowdStrike partnership

Commvault and CrowdStrike expand partnership to offer integrated cyber recovery and incident response services for stronger cyber resilience.

Semperis launches Ready1 to boost cyber crisis response for Singapore businesses

Semperis unveils Ready1 to streamline cyber crisis management, with Singapore ranking among the most prepared yet still facing major response gaps.

Exclusive Networks: Are Singapore businesses ready for AI, cybersecurity and the 2025 digital landscape?

Explore how AI is transforming cybersecurity in Singapore, the impact of Budget 2025, workforce gaps, and risks facing ASEAN businesses.

You can get DOOM: The Dark Ages free with select Nvidia graphics cards

Get DOOM: The Dark Ages Premium Edition free with select Nvidia RTX 50 GPUs until May 21, including in-game extras and early access.

Xiaomi enters China’s AI race with new model to power smart devices

Xiaomi joins China’s AI race with its new MiMo model, aiming to power devices with smarter tech and compete with big tech firms.

Samsung chip profits fall sharply due to US export controls and price drops

Samsung chip profits dropped 40% due to US export rules and price cuts as the company raced to catch up in AI memory production.

Chinese AI and robotics start-ups back Xi’s push for technological self-reliance

Chinese AI and robotics start-ups vow self-reliance after Xi visits Shanghai, showcasing innovation and commitment to homegrown tech.

Related Articles

Popular Categories