Sunday, 6 July 2025
26.6 C
Singapore
28.6 C
Thailand
19.9 C
Indonesia
29 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

E Ink transforms laptop touchpads into smart e-reader displays for AI use

E Ink’s new touchpad brings e-reader tech to laptops, offering a low-power screen for AI apps and assistants right under your fingertips.

Blizzard winds down development for the Warcraft mobile game after layoffs

Blizzard will end new content for Warcraft Rumble after 100 staff were laid off, scaling down mobile ambitions amid broader Microsoft cuts.

X introduces AI bots to help write Community Notes

X lets AI bots write Community Notes, but humans still decide what appears on posts.

Sony brings louder bass and new designs to its Ult Power speaker lineup in 2025

Sony’s 2025 Ult Power speakers offer deeper bass, longer battery, and party features, launching in Singapore in Q3.

Secretlab teams up with Genshin Impact for first Liyue-inspired chair and desk collection

Secretlab reveals its first Genshin Impact collection, which includes Liyue-themed chairs and a desk inspired by Xiao, Ningguang, and the Lantern Rite.

China to invest in Brazil-led global forest fund, signalling shift in climate finance

China may invest in Brazil's global forest fund, signalling a shift in climate finance and broader support from emerging economies.

Trump says talks with China on TikTok deal to begin this week

Trump says TikTok deal talks with China will begin this week, with possible involvement from President Xi or his team.

DeepSWE, powered by Alibaba’s Qwen3-32B, outperforms rivals in global benchmark

Alibaba’s open-source Qwen model powers DeepSWE to global victory in AI agent rankings, signalling a shift in open-weight AI innovation.

E Ink transforms laptop touchpads into smart e-reader displays for AI use

E Ink’s new touchpad brings e-reader tech to laptops, offering a low-power screen for AI apps and assistants right under your fingertips.

Related Articles

Popular Categories