AIAAIC - ChatGPT's ability to generate accurate computer code plummets

ChatGPT's ability to generate accurate computer code plummets

Occurred: July 2023

Report incident 🔥 | Improve page 💁 | Access database 🔢

ChatGPT has become less accurate at generating computer code and other tasks, according to Stanford University and UC Berkeley researchers.

Using the March and June 2023 versions of OpenAI's GPT-3.5 and GPT-4 large language models - which power ChatGPT - on tasks such as maths problem-solving, answering sensitive questions, code generation, and visual reasoning, the researchers found GPT-4's ability to identify prime numbers declined significantly from an accuracy of 97.6 percent in March to 2.4 percent in June.

However, the study's methodology and findings were said to be unconvincing by some researchers.

Princeton computer science professor Arvind Narayanan argued that the researchers failed to distinguish between ChatGPT's capabilities, which were acquired through pre-training, and its behaviour, which arises through regular fine-tuning.

System 🤖

ChatGPT

Operator:
Developer: OpenAI
Country: Global
Sector: Multiple
Purpose: Generate computer code
Technology: Chatbot; Generative AI; Machine learning
Issue: Accuracy/reliability

Research, advocacy 🧮

Chen L., Zaharia M., Zou J. How Is ChatGPT’s Behavior Changing over Time?

News, commentary, analysis 🗞️

https://www.tomshardware.com/news/chatgpt-response-quality-decline
https://www.popsci.com/technology/chatgpt-human-inaccurate/
https://www.businessinsider.com/chatgpt-ai-openai-research-gpt4-2023-7
https://fortune.com/2023/07/19/chatgpt-accuracy-stanford-study/
https://arstechnica.com/information-technology/2023/07/is-chatgpt-getting-worse-over-time-study-claims-yes-but-others-arent-sure/

Related 🌐

Page info
Type: Incident
Published: November 2023

Page updated

Google Sites

Report abuse