Thursday, 21 August 2025
25 C
Singapore
28.7 C
Thailand
19.2 C
Indonesia
28.2 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

HyperX unveils new gaming headsets and microphones with extended battery life

HyperX launches new headsets and microphones, including the Cloud Alpha 2, which boasts 250 hours of battery life, as well as new streaming microphones.

Chinese AI start-up partners with Cherrypicks to expand overseas

Chinese AI start-up Zhongke WengAI partners with Hong Kong’s Cherrypicks to expand AI solutions overseas and support innovation.

Google Cloud unveils new AI security capabilities at Security Summit 2025

Google Cloud reveals new AI-powered security tools at Security Summit 2025 to protect AI systems and boost cyber defence.

China hosts the first world humanoid robot games with record-breaking performances

China hosts its first World Humanoid Robot Games, where 280 teams competed in events ranging from running to cleaning, achieving record-breaking results.

Sony expands INZONE gaming gear line-up with new headsets, keyboard and mouse

Sony expands its INZONE gaming range with new headsets, in-ear headphones, a keyboard, mouse, and mats launching in Singapore.

MoneyMe partners with SEON to strengthen fraud prevention and credit decisioning

MoneyMe partners with SEON to boost fraud prevention and credit decisioning as it scales lending operations securely.

Sekiro: Shadows Die Twice to be adapted into anime on Crunchyroll in 2026

Sekiro: Shadows Die Twice will be adapted into a hand-drawn anime, Sekiro: No Defeat, streaming on Crunchyroll in 2026.

Meta introduces an AI dubbing tool for Instagram and Facebook videos

Meta rolls out an AI dubbing tool for Instagram and Facebook reels, starting with English-Spanish translations for eligible creators.

Google moves closer to nuclear power deal with Kairos and TVA

Google partners with TVA and Kairos Power on a new reactor in Tennessee, aiming to supply data centres with nuclear energy by 2030.

Related Articles

Popular Categories