Sunday, 15 June 2025
34 C
Singapore
32.7 C
Thailand
24.5 C
Indonesia
29.8 C
Philippines

Alibaba reveals Qwen3, a powerful new series of AI models

Alibaba launches Qwen3, a powerful open AI model family with hybrid reasoning and strong performance that rivals Google and Openai.

Chinese tech giant Alibaba has introduced Qwen3, a new family of artificial intelligence models that could rival top names like Google and OpenAI. Launched on April 29, the Qwen3 series offers a wide range of models and is available under an open licence on platforms such as Hugging Face and GitHub.

With models ranging from 0.6 billion to 235 billion parameters, Qwen3 covers a broad scope of problem-solving abilities. In AI, parameters represent how much a model can learn from data, and generally, more parameters mean better performance. According to Alibaba, some of the Qwen3 models even surpass Google’s Gemini 2.5 Pro and OpenAI’s o3-mini on key benchmarks.

A hybrid approach to problem-solving

What makes Qwen3 stand out is its “hybrid” design. This means it can switch between thinking deeply about a task or providing a quick answer, depending on the situation. For example, asking a complex question takes time to “reason” through the answer. For simpler queries, it responds quickly. This feature allows users to control how much computing power—or “thinking budget”—they want the model to use.

“We have seamlessly integrated thinking and non-thinking modes, offering users the flexibility to control the thinking budget,” the Qwen team explained in a blog post. “This design enables users to configure task-specific budgets with greater ease.”

Some Qwen3 models also use a “mixture of experts” (MoE). This method divides tasks into smaller parts and sends them to specialised models, or “experts”, which work together to generate an answer. This approach helps the models work faster and more efficiently.

Strong performance across many tests

The Qwen3 models support 119 languages and were trained using a dataset of nearly 36 trillion tokens. A token is a small piece of data the model uses to learn—around 1 million tokens equal 750,000 words. The training data included everything from textbooks and code snippets to AI-generated content and question-answer pairs.

According to Alibaba, this extensive training has made Qwen3 much more advanced than its predecessor, Qwen2. While none of the models are dramatically better than top offerings from Google or OpenAI, they are considered highly competitive.

One standout model is Qwen-3-235B-A22B, the largest in the Qwen3 family. It slightly outperforms OpenAI’s o3-mini and Google’s Gemini 2.5 Pro on Codeforces, a popular coding competition site. It also scores higher on AIME, a tough maths test, and BFCL, a benchmark for reasoning ability. However, this top-tier model isn’t available to the public for now.

The biggest publicly available model, Qwen3-32B, is still strong. It competes well with other AI tools, including models from DeepSeek, a Chinese AI lab. On several coding tests like LiveCodeBench, Qwen3-32B even beats OpenAI’s o1 model.

A growing role in the open-source AI landscape

Alibaba says Qwen3 does more than solve problems. It’s also good at calling tools, following instructions, and copying data formats. Besides downloading the models, you can access Qwen3 through cloud services like Fireworks AI and Hyperbolic.

Tuhin Srivastava, CEO and co-founder of cloud hosting company Baseten sees Qwen3 as part of a larger trend. “The U.S. is doubling down on restricting sales of chips to China and purchases from China,” he said, “but models like Qwen3 that are state-of-the-art and open … will undoubtedly be used domestically.”

He added that businesses are now combining custom-built tools with ready-made models from companies like Anthropic and OpenAI. With Qwen3, Alibaba is showing that Chinese firms are catching up in AI and setting new standards.

Hot this week

Disinformation security: Safeguarding truth in the digital age

Discover how AI detection tools, public education, and smart regulations are working together to combat the spread of misinformation online.

Redmagic 10S Pro launches in Singapore with faster gaming performance and exclusive offers

Redmagic 10S Pro lands in Singapore with overclocked performance, S$270 early bird deals, and a free cooling fan for a limited time.

Nothing to launch new over-ear headphones and flagship smartphone on 2 July

Nothing will unveil its first over-ear headphones and flagship smartphone, Phone (3), in a global launch event on 2 July.

Singapore Airlines and PALO IT test generative AI for faster software development

Singapore Airlines and PALO IT successfully trial Gen-e2, an AI-first software development approach powered by GitHub Copilot.

REDMAGIC 10S Pro launches in Singapore with upgraded Snapdragon 8 Elite chip

REDMAGIC launches its 10S Pro gaming phone in Singapore with the Snapdragon 8 Elite chip, 144Hz display, and up to 24GB RAM.

Hong Kong opens skies to larger drones in bid to grow low-altitude economy

Hong Kong will allow the testing of larger drones to boost its low-altitude economy and improve logistics, following mainland China's lead.

Hong Kong to build new AI supercomputing centre in bid to lead global tech race

Hong Kong plans a new AI supercomputing centre to boost its tech hub status and support growing start-ups across the Greater Bay Area.

Steam adds full native support for Apple Silicon Macs

Steam runs natively on Apple Silicon Macs, ditching Rosetta 2 for smoother performance and better gaming on M1 and M2 devices.

Amazon taps nuclear power to boost AWS cloud energy supply

Amazon signs a 1.92 GW nuclear energy deal with Talen to power AWS cloud and explore new small modular reactors in Pennsylvania.

Related Articles

Popular Categories