Wednesday, 21 May 2025
27.1 C
Singapore
29.6 C
Thailand
20.7 C
Indonesia
29 C
Philippines

OpenAI promises to fix ChatGPT’s overly agreeable behaviour after user complaints

OpenAI to fix ChatGPT's overly agreeable replies and improve safety as more users rely on it for personal advice.

OpenAI has promised to make key changes to updating ChatGPT following a recent issue where the AI started acting far too agreeable. Over the weekend, many users noticed that ChatGPT offered overly positive and supportive replies — even to clearly harmful or questionable ideas. The behaviour quickly became a meme, with people posting screenshots showing the AI praising reckless and dangerous actions.

What happened and how OpenAI responded

The problem began after OpenAI rolled out a modified version of its GPT-4o model, which powers ChatGPT. Soon after the update, users across social media pointed out that the chatbot had become unnaturally supportive. It agreed with almost everything it was told, regardless of how sensible or safe the input was.

In a Sunday post on X (formerly Twitter), OpenAI CEO Sam Altman acknowledged the issue and said the company would work on a fix “ASAP.” A few days later, on Tuesday, OpenAI rolled back the GPT-4o update and confirmed that further tweaks were being made to the chatbot’s personality.

The company published a postmortem on the same day and followed up with a blog post on Friday. In that post, OpenAI detailed the steps it plans to take to prevent similar problems.

New safety checks and testing phases

OpenAI said it would begin introducing an “alpha phase” for some future models. This phase would let selected users test early versions of the model before they are released to the general public. The goal is to gather user feedback to help shape the final product.

Future updates to ChatGPT will also be more transparent. OpenAI has promised to explain the limitations of new versions and consider model behaviour problems—like being overly agreeable, misleading, unreliable, or prone to hallucinations—as serious issues that could delay a launch.

In the blog post, OpenAI wrote: “Going forward, we’ll proactively communicate about the updates we’re making to the models in ChatGPT, whether ‘subtle’ or not. Even if these issues aren’t perfectly measurable today, we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A/B testing look good.”

This marks a shift in how OpenAI plans to treat behavioural problems in its models. From now on, they will be considered launch-blocking issues, not just side concerns.

A growing user base means higher stakes

These changes come as more people are using ChatGPT than ever before—and many are turning to it for personal advice. A recent survey from Express Legal Funding found that 60% of adults in the United States have used ChatGPT to seek information or advice. This growing reliance on AI raises the stakes when things go wrong, especially when the chatbot offers poor guidance or makes things up entirely.

To better address this, OpenAI also said it would try out new features that let users give real-time feedback. This feedback could help guide how ChatGPT responds in future conversations. The company is also looking into ways to reduce sycophancy, offer different personality options for the chatbot, and build extra safety checks into the system.

“One of the biggest lessons is fully recognising how people have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” the company wrote. “At the time, this wasn’t a primary focus, but as AI and society have co-evolved, it’s become clear that we need to treat this use case with great care. It will now be a more meaningful part of our safety work.”

OpenAI’s recent actions show that the company is beginning to take the real-world use of its tools more seriously. As AI becomes more common daily, these changes will be crucial in keeping users safe and well-informed.

Hot this week

ATxEnterprise 2025 brings global tech leaders to Singapore amid Southeast Asia’s AI and space boom

ATxEnterprise 2025 will gather 22,000 global leaders in Singapore to explore AI adoption and satellite innovation across Southeast Asia.

NVIDIA introduces NVLink Fusion to support semi-custom AI infrastructure

NVIDIA launches NVLink Fusion to support semi-custom AI infrastructure, enabling faster integration of CPUs and GPUs for scalable AI factories.

Satya Nadella swaps podcasts for AI chat during his commute

Satya Nadella now talks to Copilot AI about podcast transcripts instead of listening to them during his drive to the office.

From excitement to regret: Apple Vision Pro leaves some users disappointed

Some Apple Vision Pro owners now regret their purchase due to comfort issues, limited use, and lack of practicality.

Asus cuts gaming laptop prices with new RTX 5060 models

Asus adds RTX 5060 GPUs to seven gaming laptops, offering lower prices and strong specs across its Strix, TUF, and Zephyrus models.

Xiaomi launches 3-nanometre chip to rival Apple and Qualcomm

Xiaomi unveiled the 3-nm XRing O1 chip for its new phone and tablet, matching Apple and Qualcomm in the global semiconductor race.

US buyer activity rises on Alibaba.com after tariff pause agreement

US buyers flood Alibaba.com after a 90-day US-China tariff pause, boosting inquiries by over 40% and driving holiday stock orders early.

Razer’s new Blade 14 is thinner, lighter, and packed with full RTX 5070 power

Razer’s new Blade 14 is thinner and lighter, with full RTX 5070 power and a stunning OLED display. It starts at US$2,299.99.

Microsoft brings on-device AI to web apps in the Edge browser

Microsoft adds on-device AI to Edge, giving web apps access to Phi-4-mini for smart features like text editing and translation.

Related Articles

Popular Categories