ChatGPT's ability to generate accurate computer code plummets 

Occurred: July 2023

Can you improve this page?
Share your insights with us

ChatGPT has become less accurate at generating computer code and other tasks, according to Stanford University and UC Berkely researchers.

Using the March and June 2023 versions of OpenAI's GPT-3.5 and GPT-4 large language models - which power ChatGPT - on tasks such as maths problem-solving, answering sensitive questions, code generation, and visual reasoning, the researchers found GPT-4's ability to identify prime numbers declined significantly from an accuracy of 97.6 percent in March to 2.4 percent in June. 

However, the study's methodology and findings were said to be unconvincing by some researchers. Princeton computer science professor Arvind Narayanan argued that the researchers failed to distinguish between ChatGPT's capabilities, which were acquired through pre-training, and its behaviour, which arises through regular fine-tuning.

Databank

Operator:  
Developer: OpenAI
Country: Global
Sector: Multiple
Purpose: Generate computer code
Technology: Chatbot; NLP/text analysis; Neural network; Deep learning; Machine learning; Reinforcement learning
Issue: Accuracy/reliability
Transparency: Governance

System


Research, advocacy


News, commentary, analysis