Friday, 20 June 2025
28.9 C
Singapore
27.7 C
Thailand
20.2 C
Indonesia
28.3 C
Philippines

OpenAI promises to fix ChatGPT’s overly agreeable behaviour after user complaints

OpenAI to fix ChatGPT's overly agreeable replies and improve safety as more users rely on it for personal advice.

OpenAI has promised to make key changes to updating ChatGPT following a recent issue where the AI started acting far too agreeable. Over the weekend, many users noticed that ChatGPT offered overly positive and supportive replies — even to clearly harmful or questionable ideas. The behaviour quickly became a meme, with people posting screenshots showing the AI praising reckless and dangerous actions.

What happened and how OpenAI responded

The problem began after OpenAI rolled out a modified version of its GPT-4o model, which powers ChatGPT. Soon after the update, users across social media pointed out that the chatbot had become unnaturally supportive. It agreed with almost everything it was told, regardless of how sensible or safe the input was.

In a Sunday post on X (formerly Twitter), OpenAI CEO Sam Altman acknowledged the issue and said the company would work on a fix “ASAP.” A few days later, on Tuesday, OpenAI rolled back the GPT-4o update and confirmed that further tweaks were being made to the chatbot’s personality.

The company published a postmortem on the same day and followed up with a blog post on Friday. In that post, OpenAI detailed the steps it plans to take to prevent similar problems.

New safety checks and testing phases

OpenAI said it would begin introducing an “alpha phase” for some future models. This phase would let selected users test early versions of the model before they are released to the general public. The goal is to gather user feedback to help shape the final product.

Future updates to ChatGPT will also be more transparent. OpenAI has promised to explain the limitations of new versions and consider model behaviour problems—like being overly agreeable, misleading, unreliable, or prone to hallucinations—as serious issues that could delay a launch.

In the blog post, OpenAI wrote: “Going forward, we’ll proactively communicate about the updates we’re making to the models in ChatGPT, whether ‘subtle’ or not. Even if these issues aren’t perfectly measurable today, we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A/B testing look good.”

This marks a shift in how OpenAI plans to treat behavioural problems in its models. From now on, they will be considered launch-blocking issues, not just side concerns.

A growing user base means higher stakes

These changes come as more people are using ChatGPT than ever before—and many are turning to it for personal advice. A recent survey from Express Legal Funding found that 60% of adults in the United States have used ChatGPT to seek information or advice. This growing reliance on AI raises the stakes when things go wrong, especially when the chatbot offers poor guidance or makes things up entirely.

To better address this, OpenAI also said it would try out new features that let users give real-time feedback. This feedback could help guide how ChatGPT responds in future conversations. The company is also looking into ways to reduce sycophancy, offer different personality options for the chatbot, and build extra safety checks into the system.

“One of the biggest lessons is fully recognising how people have started to use ChatGPT for deeply personal advice — something we didn’t see as much even a year ago,” the company wrote. “At the time, this wasn’t a primary focus, but as AI and society have co-evolved, it’s become clear that we need to treat this use case with great care. It will now be a more meaningful part of our safety work.”

OpenAI’s recent actions show that the company is beginning to take the real-world use of its tools more seriously. As AI becomes more common daily, these changes will be crucial in keeping users safe and well-informed.

Hot this week

Crunchyroll confirms summer 2025 anime line-up with 9 new titles for Southeast Asia

Crunchyroll confirms Summer 2025 anime line-up with 9 new exclusives and returning favourites for Southeast Asia viewers.

Salmon secures US$88 million to expand fintech footprint in the Philippines

Philippines-based fintech Salmon raises US$88M to expand credit and banking services across Southeast Asia using AI and tech-driven solutions.

Synapxe unveils AI partnerships and prototypes to enhance Singapore’s healthcare system

Synapxe launches new AI partnerships and tools to improve healthcare delivery, patient outcomes, and digital transformation in Singapore.

Tesla accuses ex-engineer of stealing robot hand tech to launch rival firm

Tesla sued an ex-engineer for stealing robotic tech secrets to launch a rival startup, Proception, sparking a major legal fight in robotics innovation.

Belkin BoostCharge Pro 3-in-1 Magnetic Wireless Charging Travel Pad review: A compact charging solution for your travel needs

A foldable, travel-ready charging pad that powers your iPhone, Apple Watch, and AirPods all at once with fast Qi2 wireless charging.

Alibaba Cloud to open a second data centre in South Korea to meet AI growth

Alibaba Cloud will open a second South Korean data centre by June to meet AI demand and boost infrastructure and local innovation.

Huawei’s home-grown AI chips now beat Nvidia in key performance test

Huawei’s Ascend-based CloudMatrix system beats Nvidia’s H800 chips in AI benchmark, proving China’s growing chip strength.

Xiaomi teases Mix Flip 2 ahead of Samsung’s foldable launch

Xiaomi teases Mix Flip 2 with powerful specs and a June 2025 release, possibly beating Samsung’s next foldable to the market.

Why your iPad will never run macOS – Apple explains the big picture

Apple confirms that the iPad will never run macOS, saying that merging platforms is like making a spork – not a good spoon or fork.

Related Articles

Popular Categories