Tuesday, 16 September 2025
27.3 C
Singapore
26.9 C
Thailand
19 C
Indonesia
27.1 C
Philippines

OpenAI’s new AI tool may soon take over your PC tasks

OpenAI's Operator tool may launch soon, promising AI-driven PC tasks. Leaked benchmarks show potential, but safety concerns remain.

OpenAI could be on the verge of releasing a groundbreaking AI tool that could potentially control your computer and carry out tasks on your behalf. This development is linked to the much-anticipated Operator tool, which has been rumoured to be in the works for some time.

Tibor Blaho, a software engineer known for accurately leaking AI product details, claims to have discovered fresh evidence of Operator’s progress. Earlier reports from outlets like Bloomberg hinted that Operator would be an “agentic” system, capable of automating tasks such as writing code or managing travel bookings.

Evidence points to a January release

According to a report by The Information, OpenAI is targeting a January release for Operator. Over the weekend, Blaho uncovered a hidden code in OpenAI’s ChatGPT macOS client, suggesting that the tool is nearing completion. The code includes options for shortcuts like “Toggle Operator” and “Force Quit Operator,” which remain disabled for now.

Blaho also reported that OpenAI’s website contains unpublished references to Operator, including performance tables that compare it to other computer-using AI systems. While these tables may be placeholders, they offer a glimpse into Operator’s potential capabilities and limitations.

The leaked benchmarks indicate mixed results. On OSWorld, a platform that evaluates AI performance in simulated computer environments, OpenAI’s “Computer Use Agent” (CUA)—believed to be the foundation for Operator—achieved a score of 38.1%. This surpasses the performance of a similar model by Anthropic but falls well below the human benchmark of 72.4%.

In web-based benchmarks, Operator performed exceptionally on WebVoyager, which measures an AI’s ability to navigate websites, but struggled on WebArena, another web interaction test. Specific tasks proved challenging; for example, Operator only managed a 60% success rate when setting up a virtual machine and a mere 10% success rate when creating a Bitcoin wallet.

Competitors and safety concerns

OpenAI’s move into the AI agent market comes amid growing competition from major players like Anthropic and Google. While still in their infancy, these AI-powered agents are being promoted as the next big advancement in artificial intelligence. Analysts at Markets and Markets predict that the AI agent market could grow to $47.1 billion by 2030.

However, the rise of AI agents has raised concerns about their reliability and safety. Leaked documents suggest Operators have undergone rigorous safety evaluations, including tests designed to prevent them from engaging in illicit activities or accessing sensitive personal data. Despite these efforts, some experts remain wary about the rapid development of such technology.

OpenAI has faced criticism for prioritising product releases over safety, with former employees and researchers expressing concerns about the potential risks. OpenAI co-founder Wojciech Zaremba recently criticised a competitor, Anthropic, for releasing an agent he claims lacks adequate safety features. “I can only imagine the negative reactions if OpenAI made a similar release,” Zaremba said in a post on X (formerly Twitter).

Looking ahead

OpenAI has yet to comment publicly on the leaks, and questions remain about how reliable and safe Operator will be in real-world applications. If the tool launches as planned, it could reshape how we interact with AI in everyday tasks, marking a significant step forward in developing autonomous systems.

As the tech world awaits further announcements, one thing is clear: the race to dominate the AI agent space is heating up, and Operator could be a game-changer.

Hot this week

How earables are evolving into health and lifestyle assistants

Earables are evolving beyond audio, offering health tracking, translation, and navigation—shaping new opportunities across Southeast Asia.

Apple brings major upgrades to Powerbeats Pro 2 with iOS 26

Apple adds heart rate, fitness, and smart usability upgrades to Powerbeats Pro 2 with iOS 26, launching on 15 September.

Southeast Asia startup funding sinks to six-year low as investors turn selective

Startup funding in Southeast Asia fell to a six-year low in H1 2025, though Vietnam, Malaysia and late-stage deals showed resilience.

Delta Electronics showcases future-ready building automation at MARVEX 2025

Delta Electronics showcases future-ready building automation solutions at MARVEX 2025, focusing on sustainability, efficiency, and smarter operations.

Keeper Security publishes back-to-school cybersecurity guide for schools and families

Keeper Security has launched a back-to-school cybersecurity guide to help schools and families strengthen digital safety against rising threats.

Biwin unveils Mini SSD, a tiny storage device that could replace microSD cards

Biwin launches Mini SSD, a tiny yet powerful storage device that could replace microSD cards if industry standards are adopted.

Apple brings major upgrades to Powerbeats Pro 2 with iOS 26

Apple adds heart rate, fitness, and smart usability upgrades to Powerbeats Pro 2 with iOS 26, launching on 15 September.

UltraGreen.ai secures US$188 million anchor investment at US$1.3 billion valuation

UltraGreen.ai secures US$188 million anchor investment led by 65EP, Vitruvian, and August, valuing the firm at US$1.3 billion.

ConnectingDNA launches AI-powered DNA wellness marketplace in Singapore

ConnectingDNA launches the world’s first AI-powered DNA wellness marketplace in Singapore, offering personalised health insights and secure data protection.

Related Articles

Popular Categories