Tuesday, 29 April 2025
26.7 C
Singapore
27 C
Thailand
18.9 C
Indonesia
27.9 C
Philippines

ChatGPT could soon gain the ability to see

ChatGPT’s Advanced Voice Mode might soon include a live camera feature, enabling AI to identify objects and interact visually.

ChatGPT’s Advanced Voice Mode, known for enabling real-time conversations with a chatbot, might soon include visual capabilities. Code uncovered in the latest beta version of the app hints at introducing a “live camera” feature. This discovery in ChatGPT v1.2024.317, as reported by Android Authority, suggests that the rollout of this exciting feature could be just around the corner. However, OpenAI has yet to confirm an official release date.

A glimpse into the feature’s early tests

The idea of ChatGPT having a visual edge has been introduced previously. During the initial alpha testing phase of Advanced Voice Mode in May, OpenAI demonstrated its potential visual capabilities. In one example, the chatbot used a phone’s camera to identify a dog, recognise its ball, and associate the two in the context of playing fetch. This ability to observe, understand, and link objects to real-world scenarios was widely praised by early testers.

Alpha testers were quick to explore the feature’s uses. A notable example came from a user on X (formerly Twitter), Manuel Sainsily, who utilised the camera to ask questions about his new kitten. This interactive capability showcased how the feature could provide fun and practical benefits.

When Advanced Voice Mode entered beta testing in September for ChatGPT Plus and Enterprise users, its visual functionality was notably absent. Despite this, the voice feature gained immense popularity for enabling natural, dynamic conversations. According to OpenAI, users could interrupt the chatbot at any moment, and it could even pick up on the speaker’s emotional tone.

What sets it apart from competitors?

ChatGPT could have a unique edge over rivals like Google and Meta if the live camera feature is introduced. Google’s conversational AI, Gemini Live, may speak over 40 languages but lacks visual processing capabilities. Similarly, Meta’s Natural Voice Interactions, showcased at the Connect 2024 event in September, cannot use camera inputs. While these systems are competent in their ways, OpenAI’s visual feature could redefine how AI assistants interact with the world.

Desktop users can now enjoy Advanced Voice Mode

In a related update, OpenAI announced that Advanced Voice Mode is now available to paid ChatGPT Plus users on desktop. Previously limited to mobile devices, this update means users can now access this feature directly on their laptops or PCs.

The introduction of the live camera could mark a significant leap forward, combining the ability to see and hear into one seamless AI experience. While the exact timing remains uncertain, the potential impact of this development is already generating excitement among users and industry experts alike.

Hot this week

Vulnerability exploitation spikes as Tenable joins Verizon to highlight patching delays

Tenable reveals critical CVEs remain unpatched for over 200 days, risking exploitation, as highlighted in Verizon’s 2025 DBIR.

Lenovo introduces new ThinkPad mobile workstations and business laptops for the AI-ready workforce

Lenovo refreshes its ThinkPad lineup with new AI-ready mobile workstations and business laptops, enhancing mobility, performance, and security.

Gitex Asia x Ai Everything Singapore highlights robotics, AI and next-gen tech at inaugural event

Gitex Asia x Ai Everything Singapore highlights robotics, AI, startups, and tech innovations, shaping Southeast Asia’s digital future.

ChatGPT joins forces with The Washington Post in new content partnership

OpenAI partners with The Washington Post to bring trusted news summaries to ChatGPT, offering better access to reliable information.

GitLab announces general availability of GitLab Duo with Amazon Q

GitLab announces the general availability of GitLab Duo with Amazon Q, combining DevSecOps and AI to accelerate secure software development.

Razer Launches Pro Click V2 and V2 Vertical Mice: Blending Gaming and Productivity

Razer's new Pro Click V2 and V2 Vertical mice offer gaming precision and ergonomic comfort, with AI prompt access and long battery life, available now!

Nintendo Pop-Up Store and Mario Kart Fun Return to Jewel Changi Airport

Experience the magic of Nintendo at Jewel Changi Airport with the return of the Pop-Up Store and the exciting Mario Kart Jewel Circuit Challenge!

Lian Li’s new Lancool 207 Digital case brings a 6-inch LCD screen to your PC

Lian Li's Lancool 207 Digital PC case brings a bright 6-inch LCD screen to your setup, offering style, function, and full customisation.

Google to end support for early Nest thermostats on October 25

Google will stop supporting first—and second-generation Nest thermostats on October 25 and end new Nest launches in Europe.

Related Articles

Popular Categories