Sunday, 13 July 2025
24.6 C
Singapore
30.5 C
Thailand
19.9 C
Indonesia
27.4 C
Philippines

Anthropic’s newest Claude chatbot outperforms GPT-4o in benchmarks

Explore Claude 3.5 Sonnet, Anthropic's latest AI model, now available. It excels at understanding nuance and visual input, outpacing GPT-4o benchmarks.

On Thursday, Anthropic introduced its latest AI language model, Claude 3.5 Sonnet. This new version surpasses the company’s previous top-tier model, the Claude 3 Opus, while operating at twice the speed. You can now explore this enhanced chatbot, even with a free account.

Key features and performance

Claude 3.5 Sonnet is the first in the Claude 3.5 series and is considered Anthropic’s most balanced model. Future releases in this series will include Claude 3.5 Haiku, the fastest model, and Claude 3.5 Opus, the most powerful. These updates will roll out later this year while the current versions remain on Claude 3. The quick release of Sonnet, just months after the Claude 3 family, highlights the rapid pace at which AI companies are developing their technologies.

Anthropic claims that Claude 3.5 Sonnet significantly improves understanding of nuance, humour, and complex prompts, enabling it to write in a more natural tone. Benchmark tests indicate that the new model sets industry records for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency. It surpasses OpenAI’s GPT-4o in many of these benchmarks. However, it is worth noting that the latest models from Claude, ChatGPT, Gemini, and Llama are all closely matched, often scoring within a few percentage points of each other, reflecting the intense competition in the AI field.

Enhanced visual interpretation and a new workspace

The company asserts that Claude 3.5 Sonnet excels at interpreting visual input better than its predecessor, Claude 3.0 Opus. The new model can accurately transcribe text from imperfect images, a feature expected to attract retail, logistics, and financial services customers who require precise data interpretation from charts, graphs, and other visual cues.

Claude’s latest update also includes a new workspace feature called Artifacts. When you prompt the chatbot to generate content such as code, text documents, or web designs, a dedicated window appears next to the chat interface. This Artefacts window allows you to request changes, and it will update with the chatbot’s latest output. Anthropic sees artefacts as a step towards making Claude a hub for broader team collaboration. The company envisions a future where teams and entire organisations can securely centralise their knowledge, documents, and ongoing projects in one shared space, with Claude acting as an on-demand team member.

Availability and pricing

Claude 3.5 Sonnet is now available for anyone with an account to try on Anthropic’s website and through the Claude iOS app. Pro and team subscribers on these platforms will benefit from higher token counts. Additionally, you can access it via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. The cost remains the same as the previous model, at US$3 per million input tokens and US$15 per million output tokens.

Hot this week

Embedded LLM and AMD launch TokenVisor to boost AI monetisation for GPU neoclouds

Embedded LLM and AMD launch TokenVisor, a platform enabling monetisation and management of AMD GPU clusters for LLM workloads.

BDx unveils Southeast Asia’s first hybrid quantum AI testbed in Singapore

BDx launches Southeast Asia’s first hybrid quantum AI testbed in Singapore, boosting AI innovation and sustainability at the SIN1 data centre.

ASUS IoT launches R680EA-IM-Z industrial motherboard with enhanced AI performance and durability

ASUS IoT has launched the R680EA-IM-Z motherboard, offering 21% faster AI performance and enhanced durability for industrial applications.

Epson launches WorkForce Pro EM-C8100/C8101 multifunction printers in Singapore

Epson launches the WorkForce Pro EM-C8100/C8101 in Singapore, offering high-speed printing and eco-friendly features for modern offices.

Microsoft to exit Pakistan after 25 years, shifting to reseller model

Microsoft ends its 25-year presence in Pakistan, shifting to a reseller model amid global cuts and broader industry challenges.

OpenAI preparing to launch AI-powered web browser to rival Chrome

OpenAI plans to launch a new AI-powered web browser, aiming to transform the browsing experience using ChatGPT technology.

Singapore to get Huawei’s 480kW ultra-fast EV charger by the end of 2025

Huawei brings 480kW ultra-fast EV charger to Singapore by late 2025, slashing charge times and boosting support for commercial vehicles.

Samsung, Google, and Qualcomm share their vision for where mobile AI is heading

Samsung, Google, and Qualcomm share how mobile AI will become more helpful, personal, and invisible in your everyday life.

Razer unveils DeathAdder V4 Pro with pro-level features and ultra-lightweight design

Razer’s DeathAdder V4 Pro lands with 8000Hz wireless polling, a lighter design, and esports-level precision for serious gamers.

Related Articles

Popular Categories