Tuesday, 29 April 2025
29.2 C
Singapore
30.3 C
Thailand
26.5 C
Indonesia
28.9 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

Anthropic aims to uncover how AI models think by 2027

Anthropic CEO Dario Amodei aims to understand how AI models work by 2027 and urges industry-wide action for safety and transparency.

Tenable uncovers critical privilege escalation flaw in Google Cloud Composer

Tenable exposes a GCP vulnerability in Cloud Composer that allows privilege escalation through interdependent cloud services.

Step inside Brooklyn’s cardboard coworking space for AI chatbots

Step inside Chat Haus, a clever cardboard coworking space for AI chatbots in Brooklyn. It offers a playful take on the future of creativity.

GumGum reports digital ads up to 90% more carbon efficient than industry average

GumGum cuts digital ad emissions by up to 90% versus industry norms, using global sustainability standards and Cedara’s carbon reporting tools.

Veeam report reveals nearly 70% of organisations still targeted by ransomware

Nearly 70% of organisations were hit by ransomware last year, says Veeam, urging stronger recovery strategies and proactive resilience.

India could manufacture all US-bound iPhones by the end of 2026

Apple plans to manufacture all iPhones for the US market in India by the end of 2026 to avoid China tariffs and secure its supply chain.

Razer Launches Pro Click V2 and V2 Vertical Mice: Blending Gaming and Productivity

Razer's new Pro Click V2 and V2 Vertical mice offer gaming precision and ergonomic comfort, with AI prompt access and long battery life, available now!

Nintendo Pop-Up Store and Mario Kart Fun Return to Jewel Changi Airport

Experience the magic of Nintendo at Jewel Changi Airport with the return of the Pop-Up Store and the exciting Mario Kart Jewel Circuit Challenge!

Lian Li’s new Lancool 207 Digital case brings a 6-inch LCD screen to your PC

Lian Li's Lancool 207 Digital PC case brings a bright 6-inch LCD screen to your setup, offering style, function, and full customisation.

Related Articles

Popular Categories