Sunday, 15 June 2025
34 C
Singapore
32.7 C
Thailand
24.5 C
Indonesia
29.8 C
Philippines

OpenAI launches GPT-4o: Fast, free, and versatile

OpenAI's GPT-4o offers near-human response times and versatile multimodal capabilities, challenging existing tools and enhancing developer access.

During its Spring update, OpenAI announced the release of GPT-4o (“o” for “omni”), its latest flagship generative AI model. This new model offers near-human response times, with audio inputs processed in just 232 milliseconds, similar to human conversational speeds.

GPT-4o is designed to handle queries in various formats, including text, audio, and images, and it generates responses in the same formats. In its demonstration, the model was showcased in voice mode, allowing users to speak directly to ChatGPT. The blog page also features six preset voices—three male and three female—that can read the page aloud.

Advanced recognition and summarisation

The model’s demos highlighted GPT-4o’s ability to recognise and respond to diverse inputs such as screenshots, videos, photos, documents, charts, facial expressions, and handwritten notes. Notably, GPT-4o can create detailed summaries of video presentations and meetings with multiple attendees from voice recordings, posing a potential challenge to transcription tools like Otter.ai.

Developer access and new applications

Developers can now access GPT-4o through the API as a text and vision model. This access is available at half the cost and with five times the rate limits compared to GPT-4 Turbo. Additionally, a new desktop app for Mac is available, with a Windows version on the way.

OpenAI’s release of GPT-4o underscores its dedication to advancing AI technology and making it more accessible. With its impressive speed and versatility, GPT-4o offers vast potential applications for both individuals and businesses.

Hot this week

OpenAI delays the release of new open model until later this summer

OpenAI delayed its new open AI model, now expected later this summer, aiming to rival Mistral and Qwen.

Amazon taps nuclear power to boost AWS cloud energy supply

Amazon signs a 1.92 GW nuclear energy deal with Talen to power AWS cloud and explore new small modular reactors in Pennsylvania.

Smart partners with Salesforce to launch AI-powered unified e-commerce platform

Smart partners with Salesforce to build a unified, AI-powered e-commerce site, bringing seamless online services to over 50 million users.

Singapore Airlines and PALO IT test generative AI for faster software development

Singapore Airlines and PALO IT successfully trial Gen-e2, an AI-first software development approach powered by GitHub Copilot.

Proofpoint opens new Singapore office to expand APAC operations and AI capabilities

Proofpoint opens new Singapore office to expand APAC presence and boost AI-led, human-centric cybersecurity efforts across the region.

Hong Kong opens skies to larger drones in bid to grow low-altitude economy

Hong Kong will allow the testing of larger drones to boost its low-altitude economy and improve logistics, following mainland China's lead.

Hong Kong to build new AI supercomputing centre in bid to lead global tech race

Hong Kong plans a new AI supercomputing centre to boost its tech hub status and support growing start-ups across the Greater Bay Area.

Steam adds full native support for Apple Silicon Macs

Steam runs natively on Apple Silicon Macs, ditching Rosetta 2 for smoother performance and better gaming on M1 and M2 devices.

Amazon taps nuclear power to boost AWS cloud energy supply

Amazon signs a 1.92 GW nuclear energy deal with Talen to power AWS cloud and explore new small modular reactors in Pennsylvania.

Related Articles

Popular Categories