Monday, 16 June 2025
29.3 C
Singapore
28.2 C
Thailand
20.1 C
Indonesia
28.7 C
Philippines

Microsoft supports new startups as it launches an AI processor that skips GPUs and expensive memory

Discover how d-Matrix's new AI processor, Corsair, backed by Microsoft, redefines AI inference with a GPU-free and cost-effective solution.

D-Matrix Inc., a hardware startup based in Santa Clara, California, has unveiled its first AI processor, Corsair, which aims to redefine AI inference. This innovative product does away with traditional GPUs and costly high-bandwidth memory (HBM), offering a more efficient and cost-effective solution.

Supported by Microsoft and embracing cutting-edge advancements, Corsair is available to early-access customers and is expected to reach broader availability by the second quarter of 2025.

What makes Corsair stand out?

Corsair is purpose-built to tackle demanding AI inference tasks, particularly those involving generative AI models. The processor achieves remarkable speeds, handling 60,000 tokens per second at 1 millisecond per token when running models like Llama3 8B on a single server.

For larger-scale applications, such as Llama3 70B, Corsair achieves 30,000 tokens per second at 2 milliseconds per token on a single rack. This performance significantly lowers energy consumption and operational costs compared to traditional GPU-based systems.

Built on d-Matrix’s Nighthawk and Jayhawk II tiles, Corsair utilises a 6nm manufacturing process. Each Nighthawk tile has four neural cores and a RISC-V CPU, optimised for large-model inference. It incorporates digital in-memory computation (DIMC) and supports versatile datatype processing, including block floating point (BFP).

Corsair’s chiplet packaging integrates memory and computation to boost efficiency. It adheres to the PCIe Gen5 full-height, full-length card form factor, making it compatible with DMX Bridge cards for scalable performance. The card is powered by 2400 TFLOPs of 8-bit peak computing power, 2GB of integrated performance memory, and 256GB of off-chip memory capacity.

A partnership with Nvidia’s key ally

Micron Technology, a significant partner of Nvidia, is collaborating with d-Matrix to support Corsair’s development and growth.

Although initially slated for release in late 2023, d-Matrix restructured its architecture to align with the growing demand for generative AI. This shift allowed Corsair to incorporate advanced features tailored for transformer models, agentic AI, and emerging applications like interactive video generation.

“Our vision for d-Matrix was to address the massive computing challenges of generative AI and transformers,” said Sid Sheth, co-founder and CEO of d-Matrix. “Corsair is a groundbreaking platform, delivering blazing-fast token generation for interactive AI applications, making generative AI commercially viable.”

Corsair’s focus on cost efficiency, scalability, and high-performance positions it as a promising solution in the rapidly evolving AI landscape.

Hot this week

Steam adds full native support for Apple Silicon Macs

Steam runs natively on Apple Silicon Macs, ditching Rosetta 2 for smoother performance and better gaming on M1 and M2 devices.

Atome secures US$75 million funding to boost financial inclusion in the Philippines

Atome secures US$75 million from Lending Ark to expand responsible digital credit access in the Philippines.

Commvault strengthens data protection with post-quantum cryptography capabilities

Commvault expands post-quantum cryptography support with HQC to protect long-term data from future quantum computing threats.

Redmagic 10S Pro launches in Singapore with faster gaming performance and exclusive offers

Redmagic 10S Pro lands in Singapore with overclocked performance, S$270 early bird deals, and a free cooling fan for a limited time.

OpenAI gives ChatGPT voice mode a big update for smoother and more lifelike conversations

OpenAI updates ChatGPT’s voice mode for more natural speech, better emotion, and real-time translation for all paid users.

Informatica deepens partnership with Databricks to support new Iceberg and OLTP services

Informatica joins Databricks as launch partner for new Iceberg and OLTP solutions, introducing AI tools to speed up GenAI development.

Hong Kong opens skies to larger drones in bid to grow low-altitude economy

Hong Kong will allow the testing of larger drones to boost its low-altitude economy and improve logistics, following mainland China's lead.

Hong Kong to build new AI supercomputing centre in bid to lead global tech race

Hong Kong plans a new AI supercomputing centre to boost its tech hub status and support growing start-ups across the Greater Bay Area.

Steam adds full native support for Apple Silicon Macs

Steam runs natively on Apple Silicon Macs, ditching Rosetta 2 for smoother performance and better gaming on M1 and M2 devices.

Related Articles

Popular Categories