Wednesday, 10 December 2025
26.8 C
Singapore
13.5 C
Thailand
22.6 C
Indonesia
26.8 C
Philippines

Anthropic’s newest Claude chatbot outperforms GPT-4o in benchmarks

Explore Claude 3.5 Sonnet, Anthropic's latest AI model, now available. It excels at understanding nuance and visual input, outpacing GPT-4o benchmarks.

On Thursday, Anthropic introduced its latest AI language model, Claude 3.5 Sonnet. This new version surpasses the company’s previous top-tier model, the Claude 3 Opus, while operating at twice the speed. You can now explore this enhanced chatbot, even with a free account.

Key features and performance

Claude 3.5 Sonnet is the first in the Claude 3.5 series and is considered Anthropic’s most balanced model. Future releases in this series will include Claude 3.5 Haiku, the fastest model, and Claude 3.5 Opus, the most powerful. These updates will roll out later this year while the current versions remain on Claude 3. The quick release of Sonnet, just months after the Claude 3 family, highlights the rapid pace at which AI companies are developing their technologies.

Anthropic claims that Claude 3.5 Sonnet significantly improves understanding of nuance, humour, and complex prompts, enabling it to write in a more natural tone. Benchmark tests indicate that the new model sets industry records for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency. It surpasses OpenAI’s GPT-4o in many of these benchmarks. However, it is worth noting that the latest models from Claude, ChatGPT, Gemini, and Llama are all closely matched, often scoring within a few percentage points of each other, reflecting the intense competition in the AI field.

Enhanced visual interpretation and a new workspace

The company asserts that Claude 3.5 Sonnet excels at interpreting visual input better than its predecessor, Claude 3.0 Opus. The new model can accurately transcribe text from imperfect images, a feature expected to attract retail, logistics, and financial services customers who require precise data interpretation from charts, graphs, and other visual cues.

Claude’s latest update also includes a new workspace feature called Artifacts. When you prompt the chatbot to generate content such as code, text documents, or web designs, a dedicated window appears next to the chat interface. This Artefacts window allows you to request changes, and it will update with the chatbot’s latest output. Anthropic sees artefacts as a step towards making Claude a hub for broader team collaboration. The company envisions a future where teams and entire organisations can securely centralise their knowledge, documents, and ongoing projects in one shared space, with Claude acting as an on-demand team member.

Availability and pricing

Claude 3.5 Sonnet is now available for anyone with an account to try on Anthropic’s website and through the Claude iOS app. Pro and team subscribers on these platforms will benefit from higher token counts. Additionally, you can access it via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. The cost remains the same as the previous model, at US$3 per million input tokens and US$15 per million output tokens.

Hot this week

HPE expands hybrid cloud portfolio with new virtualisation, security and AI capabilities

HPE expands its GreenLake cloud portfolio with new virtualisation, security and AI capabilities to support modern hybrid cloud demands.

ByteDance faces growing resistance as Chinese apps block its AI-driven smartphone

Chinese apps restrict ByteDance’s new AI smartphone as developers raise concerns over automation, security and privacy.

Sony introduces A7 V with updated sensor, faster processing, and improved stabilisation

Sony launches the A7 V with a new sensor, a faster processor, and upgraded stabilisation, targeting hybrid shooters with enhanced features.

HPE expands AI-native networking portfolio and outlines vision for self-driving IT operations

HPE expands its AI-native networking portfolio with new AIOps features, hardware, and hybrid cloud tools designed for self-driving IT operations.

Tech industry overlooks Auracast as momentum quietly builds

Auracast promises major improvements in wireless audio, but limited marketing and slow adoption mean many consumers still don't know it exists.

ByteDance faces growing resistance as Chinese apps block its AI-driven smartphone

Chinese apps restrict ByteDance’s new AI smartphone as developers raise concerns over automation, security and privacy.

Pudu Robotics unveils new robot dog as it expands global presence

Pudu Robotics unveils its new D5 robot dog in Tokyo as part of its global push into service and industrial robotics.

Nintendo launches official eShop and Switch Online service in Singapore

Nintendo launches the Singapore eShop and Switch Online service, giving local players full access to digital games, subscriptions, and regional deals.

2026 Predictions Part 1: The five forces reshaping Asia’s digital economy

Five forces are redefining Asia’s digital economy in 2026, from AI adoption and data sovereignty to new security and workforce demands.

Related Articles