Saturday, 4 October 2025
26 C
Singapore
26.2 C
Thailand
20.3 C
Indonesia
27.6 C
Philippines

DeepSWE, powered by Alibaba’s Qwen3-32B, outperforms rivals in global benchmark

Alibaba’s open-source Qwen model powers DeepSWE to global victory in AI agent rankings, signalling a shift in open-weight AI innovation.

You’ve likely heard a lot about artificial intelligence lately, but here’s a major development that could shape the future of AI. Alibaba Group’s open-source AI model, Qwen, has powered a new agentic framework, DeepSWE, to the top of a major global ranking. Developed by Agentica and San Francisco-based Together AI, DeepSWE has achieved 59% accuracy on the SWEBench-Verified test, outperforming all other open-weight models, including DeepSeek’s V3-0324.

This achievement is no small feat. DeepSWE is a software agent built on the Qwen3-32B large language model, which is part of Alibaba Cloud’s latest third-generation AI family. These agentic frameworks are software platforms that allow artificial intelligence agents to act more like humans, planning, collaborating, making decisions, and solving problems independently.

Why does this matter to you

If you use apps or tools that handle tasks automatically—whether it’s customer support, content creation, or software development—this win by DeepSWE is something to watch. AI agents like DeepSWE aren’t just trained to chat. They can write code, fix bugs, and work through complex problems on platforms like GitHub. In essence, they can act like digital assistants for developers, helping reduce the heavy lifting of software engineering.

What makes DeepSWE different? It’s been trained using Agentica’s rLLM, a modular reinforcement learning system. The team behind it didn’t just keep the results to themselves—they’ve open-sourced everything. That means developers worldwide can access the dataset, code, training process, and even the evaluation logs. This transparency enables other teams to build, improve, and scale their own AI agents more quickly.

Training DeepSWE wasn’t a weekend project. It ran for six days on powerful computers using Nvidia’s high-end H100 graphics processors. This intensive training enabled the model to learn and handle detailed and high-level software tasks efficiently.

Alibaba’s rise in open-source AI

This success adds to Alibaba’s growing influence in the global open-source AI scene. The company, based in Hangzhou, started releasing the Qwen models to the public in August 2023. By April 2024, it had already released more than 200 open-source Qwen models, which together had been downloaded over 300 million times and inspired 100,000 derivative models worldwide.

The Qwen3 series, released in April, supports platforms like Ollama, LM Studio, SGLang, and vLLM. According to tests run by Alibaba, some Qwen3 models—such as Qwen3-235B and Qwen3-4B—have matched or even beaten the likes of OpenAI’s o1, Google’s Gemini, and DeepSeek’s R1 in tasks like coding support, text generation, and solving complex mathematical problems.

Last month, Alibaba’s chairman, Joe Tsai and CEO, Eddie Wu Yongming, proudly stated in a letter to shareholders that Qwen is now the world’s largest open-source AI model family. They emphasised that this strategy is part of a broader push to promote global adoption of Chinese-developed AI systems.

Massive investment in AI’s future

Alibaba Cloud is not slowing down. On July 4, the company announced a fresh investment of more than US$60 million to accelerate AI innovation through its partner ecosystem before the end of its current financial year in March.

This follows CEO Wu’s February pledge to invest at least 380 billion yuan (US$53 billion) over the next three years in AI and cloud computing infrastructure. It’s set to be the largest computing project ever backed by a private company in China.

So, what does this mean for you? Whether you’re a tech enthusiast, developer, or someone interested in how AI can make life easier, Alibaba’s Qwen-based DeepSWE is a clear sign that open-source models are not just catching up—they’re leading the way.

Hot this week

Dell launches first wireless earbuds with AI noise suppression

Dell has launched its first wireless earbuds, the Pro Plus EB525, which feature AI noise suppression, adaptive ANC, and enterprise integration.

Tile trackers face criticism over lack of encryption and stalking risks

Researchers warn that Tile trackers lack encryption, raising concerns about stalking risks despite the company's claims of safety improvements.

Canon Singapore launches #iamkyosei to connect with Gen Z

Canon Singapore launches #iamkyosei, a campaign celebrating Gen Z creativity, diversity and social change while reinforcing its Kyosei values.

Apple may launch two new external displays by early 2026

Apple is preparing to launch two new external displays, possibly featuring mini-LED, with a release expected by early 2026.

Hitachi Vantara’s VSP One delivers 285% ROI as ASEAN leaders push for stronger AI foundations

Hitachi Vantara’s VSP One delivers 285% ROI and seven-month payback as ASEAN leaders tackle AI readiness and data infrastructure gaps.

Sharp launches new energy-efficient washing machines with no-holes tub technology

Sharp launches a new series of energy-efficient washing machines in Singapore, featuring exclusive no-holes tub technology and quiet performance.

Telin partners with Nokia to boost data centre connectivity across Singapore

Telin selects Nokia to expand and upgrade data centre interconnectivity in Singapore, boosting speed, scalability, and global connectivity.

Sharp launches AQUOS 2025 4K Google TV series in Singapore

Sharp unveils the AQUOS 2025 4K Google TV series in Singapore with AI karaoke, vibrant colour, cinematic sound, and energy-saving features.

NTT DATA and AWS form global alliance to drive AI-powered contact centre transformation

NTT DATA partners with AWS to deliver AI-powered contact centre solutions, accelerating global customer experience transformation.

Related Articles

Popular Categories