Tuesday, 29 April 2025
29.2 C
Singapore
30.3 C
Thailand
26.5 C
Indonesia
28.9 C
Philippines

OpenAI’s new AI tool may soon take over your PC tasks

OpenAI's Operator tool may launch soon, promising AI-driven PC tasks. Leaked benchmarks show potential, but safety concerns remain.

OpenAI could be on the verge of releasing a groundbreaking AI tool that could potentially control your computer and carry out tasks on your behalf. This development is linked to the much-anticipated Operator tool, which has been rumoured to be in the works for some time.

Tibor Blaho, a software engineer known for accurately leaking AI product details, claims to have discovered fresh evidence of Operator’s progress. Earlier reports from outlets like Bloomberg hinted that Operator would be an “agentic” system, capable of automating tasks such as writing code or managing travel bookings.

Evidence points to a January release

According to a report by The Information, OpenAI is targeting a January release for Operator. Over the weekend, Blaho uncovered a hidden code in OpenAI’s ChatGPT macOS client, suggesting that the tool is nearing completion. The code includes options for shortcuts like “Toggle Operator” and “Force Quit Operator,” which remain disabled for now.

Blaho also reported that OpenAI’s website contains unpublished references to Operator, including performance tables that compare it to other computer-using AI systems. While these tables may be placeholders, they offer a glimpse into Operator’s potential capabilities and limitations.

The leaked benchmarks indicate mixed results. On OSWorld, a platform that evaluates AI performance in simulated computer environments, OpenAI’s “Computer Use Agent” (CUA)—believed to be the foundation for Operator—achieved a score of 38.1%. This surpasses the performance of a similar model by Anthropic but falls well below the human benchmark of 72.4%.

In web-based benchmarks, Operator performed exceptionally on WebVoyager, which measures an AI’s ability to navigate websites, but struggled on WebArena, another web interaction test. Specific tasks proved challenging; for example, Operator only managed a 60% success rate when setting up a virtual machine and a mere 10% success rate when creating a Bitcoin wallet.

Competitors and safety concerns

OpenAI’s move into the AI agent market comes amid growing competition from major players like Anthropic and Google. While still in their infancy, these AI-powered agents are being promoted as the next big advancement in artificial intelligence. Analysts at Markets and Markets predict that the AI agent market could grow to $47.1 billion by 2030.

However, the rise of AI agents has raised concerns about their reliability and safety. Leaked documents suggest Operators have undergone rigorous safety evaluations, including tests designed to prevent them from engaging in illicit activities or accessing sensitive personal data. Despite these efforts, some experts remain wary about the rapid development of such technology.

OpenAI has faced criticism for prioritising product releases over safety, with former employees and researchers expressing concerns about the potential risks. OpenAI co-founder Wojciech Zaremba recently criticised a competitor, Anthropic, for releasing an agent he claims lacks adequate safety features. “I can only imagine the negative reactions if OpenAI made a similar release,” Zaremba said in a post on X (formerly Twitter).

Looking ahead

OpenAI has yet to comment publicly on the leaks, and questions remain about how reliable and safe Operator will be in real-world applications. If the tool launches as planned, it could reshape how we interact with AI in everyday tasks, marking a significant step forward in developing autonomous systems.

As the tech world awaits further announcements, one thing is clear: the race to dominate the AI agent space is heating up, and Operator could be a game-changer.

Hot this week

Bitdefender launches GravityZone PHASR to combat stealthy endpoint threats

Bitdefender unveils GravityZone PHASR, a dynamic endpoint security tool that reduces attack surfaces using behaviour-based automation.

Neuralink’s journey towards merging minds with machines

Neuralink’s journey to merge human minds with machines marks a major milestone, offering transformative potential for healthcare, technology, and beyond.

Nintendo Pop-Up Store and Mario Kart Fun Return to Jewel Changi Airport

Experience the magic of Nintendo at Jewel Changi Airport with the return of the Pop-Up Store and the exciting Mario Kart Jewel Circuit Challenge!

Exclusive Networks: Are Singapore businesses ready for AI, cybersecurity and the 2025 digital landscape?

Explore how AI is transforming cybersecurity in Singapore, the impact of Budget 2025, workforce gaps, and risks facing ASEAN businesses.

SquareX secures US$20 million to transform browser security

SquareX raises US$20 million to strengthen browser security, offering enterprises an easy way to protect users without disrupting their workflows.

India could manufacture all US-bound iPhones by the end of 2026

Apple plans to manufacture all iPhones for the US market in India by the end of 2026 to avoid China tariffs and secure its supply chain.

Razer Launches Pro Click V2 and V2 Vertical Mice: Blending Gaming and Productivity

Razer's new Pro Click V2 and V2 Vertical mice offer gaming precision and ergonomic comfort, with AI prompt access and long battery life, available now!

Nintendo Pop-Up Store and Mario Kart Fun Return to Jewel Changi Airport

Experience the magic of Nintendo at Jewel Changi Airport with the return of the Pop-Up Store and the exciting Mario Kart Jewel Circuit Challenge!

Lian Li’s new Lancool 207 Digital case brings a 6-inch LCD screen to your PC

Lian Li's Lancool 207 Digital PC case brings a bright 6-inch LCD screen to your setup, offering style, function, and full customisation.

Related Articles

Popular Categories