Sunday, 2 November 2025
28.6 C
Singapore
27.6 C
Thailand
28.1 C
Indonesia
28.9 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

AMD to power next-generation US supercomputers for sovereign AI and scientific research

AMD and the US Department of Energy unveil Lux and Discovery supercomputers to advance sovereign AI and scientific innovation.

OpenAI outlines major improvements and new features for ChatGPT Atlas

OpenAI announces major updates to ChatGPT Atlas, including tab groups, user profiles, improved sidebar tools, and enhancements to Agent mode.

OPPO launches Find X9 Series to redefine the premium smartphone experience

OPPO unveils the Find X9 Series globally, introducing AI-powered imaging and advanced design to redefine premium smartphones.

Denodo and ST Engineering partner to advance AI-driven sensemaking technologies

Denodo and ST Engineering partner to develop AI-driven data technologies to enhance decision-making and operational intelligence.

StarHub connects Singapore this festive season with shared experiences

StarHub celebrates the festive season with Halloween, football, and Christmas events to bring Singaporeans together.

Bluesky tests the dislike button and ‘social proximity’ to improve user interactions

Bluesky tests a private dislike button and ‘social proximity’ system to improve conversations and foster more meaningful online interactions.

Innovation drives legacy industries at TechInnovation 2025

Industry leaders at TechInnovation 2025 shared how innovation and collaboration are helping legacy businesses modernise for the future.

Informatica unveils Fall 2025 release to power the era of agentic AI

Informatica’s Fall 2025 release introduces new AI-driven data management tools to power agentic AI with trusted enterprise data.

Commvault launches Data Rooms to connect enterprise data with AI platforms securely

Commvault introduces Data Rooms, a secure platform enabling enterprises to safely activate and share backup data for AI use.

Related Articles

Popular Categories