AIAAIC - ChatGPT mostly gets programming questions wrong

Study: ChatGPT gets most programming questions wrong

Occurred: August 2023

Report incident 🔥 | Improve page 💁 | Access database 🔢

ChatGPT wrongly answers over half of software engineering questions it receives, according to a research study.

Purdue University researchers analysed how ChatGPT responded to 517 questions posed on Stack Overflow to assess the correctness, consistency, comprehensiveness, and conciseness of the chatbot's answers using linguistic and sentiment analysis, and by questioning a dozen volunteer participants.

The analysis showed that 52 percent of ChatGPT answers are incorrect, and 77 percent are verbose.

'Nonetheless', the researchers said, 'ChatGPT answers are still preferred 39.34 percent of the time due to their comprehensiveness and well-articulated language style.'

The paper raised questions about ChatGPT's ability to generate high quality engineering information.

System 🤖

ChatGPT

Operator: Stack Overflow; OpenAI
Developer: OpenAI
Country: USA
Sector: Technology
Purpose: Generate text
Technology: Chatbot; Generative AI; Machine learning
Issue: Accuracy/reliability

Research, advocacy 🧮

Kabir S. et al (2023). Who Answers It Better? An In-Depth Analysis of ChatGPT and Stack Overflow Answers to Software Engineering Qyestions (pdf)
Borwankar S. (2023). Unraveling the Impact: An Empirical Investigation of ChatGPT's Exclusion from Stack Overflow

News, commentary, analysis 🗞️

Related 🌐

Page info
Type: Incident
Published: November 2023

Page updated

Google Sites

Report abuse