Saturday, 28 June 2025
30.1 C
Singapore
32.7 C
Thailand
22.3 C
Indonesia
29.4 C
Philippines

New research highlights ChatGPT’s struggles in helping with coding tasks

New research from Purdue University reveals significant errors in ChatGPT's programming assistance, emphasising caution and calling for further study.

According to recent research, ChatGPT is still grappling with effectively assisting with programming issues despite becoming an overnight sensation. While many developers have turned to generative AI tools like GitHub’s Copilot to streamline their workflow and free up time for more productive tasks, a new study from Purdue University sheds light on significant shortcomings in ChatGPT’s performance.

Study reveals widespread errors

Researchers at Purdue University analysed 517 questions from Stack Overflow, comparing ChatGPT’s answers to those provided by human experts. The findings were startling: more than half (52%) of the responses generated by ChatGPT were incorrect. The breakdown of errors is as follows: 54% were conceptual misunderstandings, 36% were factual inaccuracies, 28% were logical mistakes in code, and 12% were terminology errors.

The study also highlighted that ChatGPT often produced unnecessarily lengthy and complex responses. This overabundance of detail can lead to confusion and distractions for developers seeking straightforward answers. Despite these issues, an ultra-small-scale poll involving 12 programmers revealed that one-third preferred ChatGPT’s articulate, textbook-like responses. This preference underscores how easily the AI’s seemingly authoritative tone can mislead coders.

Implications for the coding community

The implications of these findings are significant. Errors in coding can cascade, potentially causing problems across multiple departments or even entire organisations. The researchers emphasise the importance of caution when using ChatGPT for programming tasks.

They state, “Since ChatGPT produces many incorrect answers, our results emphasise the necessity of caution and awareness regarding the usage of ChatGPT answers in programming tasks.” This caution is vital to prevent minor coding errors from escalating into more significant, complex issues.

Call for further research and transparency

Beyond urging caution, the researchers advocate for further studies to identify and mitigate these errors. They also call for greater transparency and communication regarding the potential inaccuracies in ChatGPT’s responses. This openness is crucial for developers to make informed decisions about when and how to use AI tools in their workflows.

As the coding community continues to integrate AI into its practices, these findings serve as a reminder of the limitations and risks associated with relying too heavily on automated tools. While ChatGPT and similar technologies offer exciting possibilities, their current capabilities require scrutiny and responsible use to ensure they genuinely enhance productivity without introducing significant errors.

Hot this week

Microsoft lets you pin games and customise your Xbox Home screen with the latest update

Microsoft's latest update, which was released on June 24, allows you to pin games, hide apps, and customise your Xbox Home screen.

Adobe launches LLM Optimizer as AI replaces search engines in content discovery

Adobe unveils LLM Optimizer to help brands appear in AI chats like ChatGPT as AI becomes the new way people discover and shop.

Lenovo unveils new hybrid AI services and platforms to accelerate enterprise transformation

Lenovo expands its Hybrid AI Advantage with new services, solutions, and platforms to help enterprises scale and operationalise AI.

Samsung Galaxy Z Fold 7 dummy unit leak shows off its ultra-thin design

The Leaked Galaxy Z Fold 7 dummy unit reveals an ultra-slim 4.5mm thickness. The official launch is expected on July 9.

OIDIRE ODI-XDG10 Portable Baby Bottle Steriliser review: Compact, travel-friendly UV steriliser for modern parents

Compact and stylish, the OIDIRE ODI-XDG10 UV steriliser is a travel-friendly pick for modern parents who want clean bottles without the bulk.

Google adds precise Bluetooth tracking to Pixel Watch 3, but it’s not active yet

Pixel Watch 3 gets new Bluetooth tracking tech called Channel Sounding, which promises precise tracking but still needs full device support.

Meta may buy PlayAI to boost its voice cloning technology

Meta may buy AI voice cloning startup PlayAI to expand lifelike voice features in its apps, smart glasses, and AI assistants.

NVIDIA reveals RTX 5050 entry-level GPU – but is it worth your money?

NVIDIA’s RTX 5050 launches at US$249 with DLSS 3 and Blackwell tech, but better GPU options are only slightly more expensive.

OPPO Reno14 Pro launch offers a limited-time Dyson hairdryer bundle

From June 27 to July 6 in Singapore, get a free Dyson Supersonic hairdryer with OPPO Reno14 Pro or Reno14—this is a limited-time launch offer!

Related Articles

Popular Categories