Sunday, 30 November 2025
26.9 C
Singapore
14.5 C
Thailand
21.2 C
Indonesia
27.4 C
Philippines

New research highlights ChatGPT’s struggles in helping with coding tasks

New research from Purdue University reveals significant errors in ChatGPT's programming assistance, emphasising caution and calling for further study.

According to recent research, ChatGPT is still grappling with effectively assisting with programming issues despite becoming an overnight sensation. While many developers have turned to generative AI tools like GitHub’s Copilot to streamline their workflow and free up time for more productive tasks, a new study from Purdue University sheds light on significant shortcomings in ChatGPT’s performance.

Study reveals widespread errors

Researchers at Purdue University analysed 517 questions from Stack Overflow, comparing ChatGPT’s answers to those provided by human experts. The findings were startling: more than half (52%) of the responses generated by ChatGPT were incorrect. The breakdown of errors is as follows: 54% were conceptual misunderstandings, 36% were factual inaccuracies, 28% were logical mistakes in code, and 12% were terminology errors.

The study also highlighted that ChatGPT often produced unnecessarily lengthy and complex responses. This overabundance of detail can lead to confusion and distractions for developers seeking straightforward answers. Despite these issues, an ultra-small-scale poll involving 12 programmers revealed that one-third preferred ChatGPT’s articulate, textbook-like responses. This preference underscores how easily the AI’s seemingly authoritative tone can mislead coders.

Implications for the coding community

The implications of these findings are significant. Errors in coding can cascade, potentially causing problems across multiple departments or even entire organisations. The researchers emphasise the importance of caution when using ChatGPT for programming tasks.

They state, “Since ChatGPT produces many incorrect answers, our results emphasise the necessity of caution and awareness regarding the usage of ChatGPT answers in programming tasks.” This caution is vital to prevent minor coding errors from escalating into more significant, complex issues.

Call for further research and transparency

Beyond urging caution, the researchers advocate for further studies to identify and mitigate these errors. They also call for greater transparency and communication regarding the potential inaccuracies in ChatGPT’s responses. This openness is crucial for developers to make informed decisions about when and how to use AI tools in their workflows.

As the coding community continues to integrate AI into its practices, these findings serve as a reminder of the limitations and risks associated with relying too heavily on automated tools. While ChatGPT and similar technologies offer exciting possibilities, their current capabilities require scrutiny and responsible use to ensure they genuinely enhance productivity without introducing significant errors.

Hot this week

Global mobile gaming ads surge in 2025 as AI and interactivity reshape engagement

Mobile gaming ads grew strongly in 2025 as AI-driven optimisation and interactive formats reshaped global user acquisition strategies.

Sumsub reports sharp rise in synthetic personal data fraud in APAC

Sumsub reports a sharp rise in synthetic identity fraud and deepfake attacks across APAC as AI-driven scams become more sophisticated.

Belkin UltraCharge Pro 3-in-1 Magnetic Charging Dock with Qi2 25W review: Fast, quiet and convenient charging

Belkin UltraCharge Pro 3-in-1 Magnetic Charging Dock with Qi2 25W offers fast, quiet and convenient wireless charging for iPhone, Apple Watch and AirPods.

Google limits free Nano Banana Pro image generation due to high demand

Google is reducing free Nano Banana Pro and Gemini 3 Pro usage due to high demand, limiting daily access while paid plans remain unchanged.

Kaspersky reports surge in shopping phishing and gaming-related attacks in 2025

Kaspersky reports 6.4 million shopping phishing attempts and more than 20 million gaming-related attacks detected in 2025.

DeepSeek launches open AI model achieving gold-level scores at the Maths Olympiad

DeepSeek launches Math-V2, the first open AI model to achieve gold-level scores at the International Mathematical Olympiad.

AI browsers vulnerable to covert hacks using simple URL fragments, experts warn

Experts warn AI browsers can be hacked with hidden URL fragments, posing risks invisible to traditional security measures.

Slop Evader filters out AI content to restore pre-ChatGPT internet

Slop Evader filters AI-generated content online, restoring pre-ChatGPT search results for a more human web.

Lara Croft becomes gaming’s best-selling heroine amid new Tomb Raider rumours

Lara Croft becomes gaming’s best-selling heroine as new Tomb Raider rumours fuel excitement.

Related Articles

Popular Categories