Saturday, 6 December 2025
27.8 C
Singapore
24.5 C
Thailand
21.4 C
Indonesia
26.8 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

Nvidia partners with Mistral AI to accelerate new open model family

Nvidia and Mistral AI launch the Mistral 3 model family to boost enterprise AI performance across cloud and edge platforms.

OpenAI enters circular ownership deal with Thrive Holdings

OpenAI enters a circular ownership deal with Thrive Holdings, deepening ties with private equity while expanding its AI reach.

UnionBank adopts Amazon Quick Suite to accelerate data-driven decision making

UnionBank deploys Amazon Quick Suite to expand access to data analytics and speed up decision making across its organisation.

DJI Osmo Pocket 4 leak suggests launch may be imminent

DJI’s Osmo Pocket 4 appears in FCC filings, hinting at an imminent launch amid rumours of new features and a possible US product ban.

Samsung introduces Galaxy Tab A11+ with larger display, AI features, and long-term software support

Samsung launches the Galaxy Tab A11+, an affordable 11-inch tablet with AI tools, long battery life, and seven years of software support.

Google highlights Singapore’s top trending searches in 2025

Google reveals Singapore’s top trending searches for 2025, highlighting SG60 celebrations, elections, pop culture and financial concerns.

HPE expands hybrid cloud portfolio with new virtualisation, security and AI capabilities

HPE expands its GreenLake cloud portfolio with new virtualisation, security and AI capabilities to support modern hybrid cloud demands.

EOY music, comics and arts festival returns with new venue and expanded programme

EOY 2025 returns with a new venue, international guests and expanded activities celebrating Japanese pop culture in Singapore.

Tiger Brokers: Bringing institutional-grade AI intelligence to global retail investors

AI is redefining retail investing as platforms like Tiger Brokers’ TigerAI integrate verified intelligence, personalisation, and long-term wealth management to empower global investors.

Related Articles

Popular Categories