Sunday, 2 November 2025
26.2 C
Singapore
24.7 C
Thailand
21.8 C
Indonesia
29 C
Philippines

You can now rent Google’s advanced AI chip: Trillium TPU powers Gemini 2.0 and challenges AMD and Nvidia

Google’s Trillium TPU is now available for rent. It offers unmatched AI training efficiency, energy savings, and powerful computing performance.

Google officially announced that its sixth-generation Tensor Processing Unit (TPU), Trillium, is now available for rent. After months of being offered in preview, this powerful AI chip is ready for general use. Designed to revolutionise AI infrastructure, Trillium has already proven its capabilities by training Gemini 2.0, Google’s cutting-edge AI model.

This news is big if you’re in the AI or tech industry. The chip boasts impressive features, including double the High Bandwidth Memory (HBM) capacity and double the Interchip Interconnect bandwidth compared to its predecessors. According to Google, Trillium offers up to a 2.5x improvement in training performance per dollar, making it an efficient choice for businesses aiming to optimise their AI operations.

Revolutionary performance with Trillium TPU

Trillium isn’t just an upgrade—it’s a leap forward. It delivers more than four times the training performance of its predecessor, while energy efficiency has increased by 67%. Regarding raw computing power, each chip’s peak performance is 4.7 times greater than earlier.

Google’s benchmarks reveal that Trillium also significantly enhances inference tasks. For image generation models like Stable Diffusion XL, throughput has increased thrice. Large language models, crucial in today’s AI landscape, see nearly double the throughput.

The chip’s architecture is also optimised for embedding-intensive models. Its third-generation SparseCore improves dynamic and data-dependent operations efficiency, ensuring smooth performance even under complex workloads.

Powering Google Cloud’s AI Hypercomputer

One of Trillium’s standout achievements is its role in Google Cloud’s AI Hypercomputer. This advanced system integrates over 100,000 Trillium chips, all connected through a Jupiter network fabric with an astounding 13 Petabits/sec bandwidth. The system combines this cutting-edge hardware with open-source software and well-known machine-learning frameworks like JAX, PyTorch, and TensorFlow.

What does this mean for you? Google Cloud customers can now use the same state-of-the-art hardware that trained the Gemini 2.0 AI model. With Trillium’s general availability, high-performance AI technology is no longer reserved for a select few. From image generation to complex language models, Trillium opens the door to countless applications, making it a valuable asset for businesses aiming to stay ahead in the AI race.

Hot this week

Thief VR: Legacy of Shadow launches on 4 December

The classic stealth series returns with Thief VR: Legacy of Shadow, launching 4 December on Meta Quest, PS VR, and SteamVR.

Disney Plus to release original Fortnite x The Simpsons animated shorts

Disney Plus releases four new Fortnite x The Simpsons shorts in November, also viewable within the game itself.

Jobstreet by SEEK launches verified education credentials to boost hiring confidence in Singapore

Jobstreet by SEEK adds verified education credentials to SEEK Pass, giving Singapore employers greater hiring confidence and efficiency.

Zoom and NVIDIA partner to deliver custom enterprise AI for the future

Zoom and NVIDIA partner to enhance enterprise AI with advanced model integration, improving speed, reasoning, and security across industries.

VoidZero secures US$12.5 million Series A to launch unified JavaScript toolchain Vite+

VoidZero raises US$12.5 million Series A to launch Vite+, a unified JavaScript toolchain aimed at boosting developer productivity.

Disney Plus to release original Fortnite x The Simpsons animated shorts

Disney Plus releases four new Fortnite x The Simpsons shorts in November, also viewable within the game itself.

Bluesky tests the dislike button and ‘social proximity’ to improve user interactions

Bluesky tests a private dislike button and ‘social proximity’ system to improve conversations and foster more meaningful online interactions.

Innovation drives legacy industries at TechInnovation 2025

Industry leaders at TechInnovation 2025 shared how innovation and collaboration are helping legacy businesses modernise for the future.

Informatica unveils Fall 2025 release to power the era of agentic AI

Informatica’s Fall 2025 release introduces new AI-driven data management tools to power agentic AI with trusted enterprise data.

Related Articles

Popular Categories