Monday, 20 October 2025
28.9 C
Singapore
24.3 C
Thailand
21.6 C
Indonesia
28.5 C
Philippines

Anthropic’s newest Claude chatbot outperforms GPT-4o in benchmarks

Explore Claude 3.5 Sonnet, Anthropic's latest AI model, now available. It excels at understanding nuance and visual input, outpacing GPT-4o benchmarks.

On Thursday, Anthropic introduced its latest AI language model, Claude 3.5 Sonnet. This new version surpasses the company’s previous top-tier model, the Claude 3 Opus, while operating at twice the speed. You can now explore this enhanced chatbot, even with a free account.

Key features and performance

Claude 3.5 Sonnet is the first in the Claude 3.5 series and is considered Anthropic’s most balanced model. Future releases in this series will include Claude 3.5 Haiku, the fastest model, and Claude 3.5 Opus, the most powerful. These updates will roll out later this year while the current versions remain on Claude 3. The quick release of Sonnet, just months after the Claude 3 family, highlights the rapid pace at which AI companies are developing their technologies.

Anthropic claims that Claude 3.5 Sonnet significantly improves understanding of nuance, humour, and complex prompts, enabling it to write in a more natural tone. Benchmark tests indicate that the new model sets industry records for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency. It surpasses OpenAI’s GPT-4o in many of these benchmarks. However, it is worth noting that the latest models from Claude, ChatGPT, Gemini, and Llama are all closely matched, often scoring within a few percentage points of each other, reflecting the intense competition in the AI field.

Enhanced visual interpretation and a new workspace

The company asserts that Claude 3.5 Sonnet excels at interpreting visual input better than its predecessor, Claude 3.0 Opus. The new model can accurately transcribe text from imperfect images, a feature expected to attract retail, logistics, and financial services customers who require precise data interpretation from charts, graphs, and other visual cues.

Claude’s latest update also includes a new workspace feature called Artifacts. When you prompt the chatbot to generate content such as code, text documents, or web designs, a dedicated window appears next to the chat interface. This Artefacts window allows you to request changes, and it will update with the chatbot’s latest output. Anthropic sees artefacts as a step towards making Claude a hub for broader team collaboration. The company envisions a future where teams and entire organisations can securely centralise their knowledge, documents, and ongoing projects in one shared space, with Claude acting as an on-demand team member.

Availability and pricing

Claude 3.5 Sonnet is now available for anyone with an account to try on Anthropic’s website and through the Claude iOS app. Pro and team subscribers on these platforms will benefit from higher token counts. Additionally, you can access it via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. The cost remains the same as the previous model, at US$3 per million input tokens and US$15 per million output tokens.

Hot this week

Global mobile app demand remains resilient as APAC leads growth surge

Adjust’s 2025 Mobile App Growth Report shows global app demand rising, led by APAC’s strong growth in gaming and entertainment.

NetApp expands enterprise cloud capabilities with Google Cloud partnership

NetApp expands its Google Cloud collaboration with new block storage and AI capabilities in NetApp Volumes to accelerate enterprise transformation.

8BitDo unveils NES40 collection to mark 40 years of the Nintendo Entertainment System

8BitDo marks 40 years of the NES with a limited NES40 collection featuring redesigned controllers, a premium keyboard, and a modernised speaker.

Meta accelerates AI innovation in Singapore with Llama Incubator Program Demo Day

Meta’s Llama Incubator Demo Day highlights its push to support open-source AI innovation and strengthen Singapore’s digital economy.

Facebook reintroduces job listings with a focus on local work

Facebook is reintroducing job listings for local, entry-level, and trade work in the US, accessible through Marketplace and groups.

Oura redesigns app with enhanced stress tracking and hypertension study

Oura unveils redesigned app with advanced stress tracking and begins FDA-backed study to develop early hypertension detection features.

Shadow of the Colossus turns 20: Exploring the moral depth of gaming’s quietest hero

Shadow of the Colossus marks its 20th anniversary, celebrated for its quiet heroism, moral depth, and enduring emotional power.

Samsung partners with Nvidia to develop custom CPUs and XPUs for AI dominance

Nvidia partners with Samsung to develop custom CPUs and XPUs, expanding its NVLink Fusion ecosystem to strengthen its AI hardware dominance.

NVIDIA unveils first US-made Blackwell wafer as domestic chip production expands

NVIDIA unveils its first US-made Blackwell wafer at TSMC’s Arizona facility, marking a major milestone in domestic AI chip production.

Related Articles