Tuesday, 23 December 2025
27 C
Singapore
25.6 C
Thailand
21.1 C
Indonesia
26.7 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

ChatGPT for Android may soon offer faster access to specific chats

ChatGPT for Android may add home-screen shortcuts that open specific chats directly, making repeat conversations easier to access.

Cut dialogue reveals how talkative Metroid Prime 4 nearly was

Cut dialogue reveals Metroid Prime 4 once planned over 30 minutes of extra NPC chatter, highlighting a controversial design choice.

The rise of agentic AI and what it means for enterprise leaders

Agentic AI is accelerating across Asia, pushing leaders to rethink productivity, governance, and the infrastructure needed for long-term competitiveness.

Apple Studio Display 2 tipped to add 120Hz refresh rate and HDR support

Apple Studio Display 2 is tipped to feature 120Hz refresh rates, HDR support, and possibly mini-LED technology, with a launch expected in 2026.

AI designs a Linux computer with 843 parts in a single week

Quilter reveals a Linux computer designed by AI in one week, hinting at a future where hardware development is faster and more accessible.

AI designs a Linux computer with 843 parts in a single week

Quilter reveals a Linux computer designed by AI in one week, hinting at a future where hardware development is faster and more accessible.

IATA raises concerns over potential 5G interference with aviation systems

IATA warns uneven global 5G rules could pose aviation risks, even as Singapore reports no interference with aircraft systems.

Thoughtworks: Singapore’s financial OS upgrade, agentic AI and the race for the future of wealth

How agentic AI could reshape wealth management in Singapore by enhancing personalisation, improving responsiveness and elevating the role of advisers.

Google delays Gemini takeover from Assistant on Android until 2026

Google has delayed replacing Google Assistant with Gemini on Android, extending the transition into 2026 as technical challenges persist.

Related Articles

Popular Categories