DeepSeek V3 large language model thinks it is ChatGPT

Occurred: December 2024

Report incident ๐Ÿ”ฅ | Improve page ๐Ÿ’ | Access database ๐Ÿ”ข

A new large language model developed by the Chinese start-up DeepSeek has gained attention for its impressive performance and peculiar behaviour of identifying itself as ChatGPT.

What happened

DeepSeek V3 has been reported to outperform established models like OpenAI's GPT-4 and Meta's Llama 3 on a number of benchmarks.ย 

The model features 671 billion parameters and was trained at a notably low cost of approximately USD 5.58 million over two months.ย 

However, it has also been observed that DeepSeek V3 sometimes claims to be a version of ChatGPT.

Why it happened

The phenomenon of DeepSeek V3 identifying itself as ChatGPT may stem from the training data used for its development, with commentators speculating that the system may have been trained on datasets that include outputs from ChatGPT.

Deepseek has not explained the behaviour of its system.

What it means

Deepseek V3 may perform strongly, but the start-up has been noticeably reluctant to discuss the sources of its training data, prompting questions about its ethics and integrity.ย 

With AI models increasingly mimicing one another, understanding their unique characteristics and origins will become important differentators.

Large language model

A large language model (LLM) is a computational model capable of language generation or other natural language processing tasks.

Source: Wikipedia ๐Ÿ”—

System ๐Ÿค–

Operator:ย 
Developer: Deepseek
Country: China
Sector: Technology
Purpose: Provide information
Technology: Generative AI; Machine learning
Issue: Cheating/plagiarism; Transparency