AIAAIC - Chinese large language model thinks it is ChatGPT

DeepSeek V3 large language model thinks it is ChatGPT

Occurred: December 2024
Page published: January 2025

Report incident🔥| Improve page 💁| Access database 🔢

A new large language model developed by the Chinese start-up DeepSeek gained attention for its impressive performance, as well as its peculiar behaviour of identifying itself as ChatGPT, suggesting it had been trained on OpenAI's product.

What happened

DeepSeek V3 has been reported to outperform established models like OpenAI's GPT-4 and Meta's Llama 3 on a number of benchmarks.

The model features 671 billion parameters and was trained at a notably low cost of approximately USD 5.58 million over two months.

However, it has also been observed that DeepSeek V3 sometimes claims to be a version of ChatGPT.

Why it happened

The phenomenon of DeepSeek V3 identifying itself as ChatGPT may stem from the training data used for its development, with commentators speculating that the system may have been trained on datasets that include outputs from ChatGPT.

Deepseek has not explained the behaviour of its system.

What it means

Deepseek V3 may perform strongly, but the start-up has been noticeably reluctant to discuss the sources of its training data, prompting questions about its ethics and integrity.

With AI models increasingly mimicing one another, understanding their unique characteristics and origins will become important differentators.

Large language model

A large language model (LLM) is a computational model capable of language generation or other natural language processing tasks.

Source: Wikipedia 🔗

System 🤖

DeepSeek V3 🔗

Developer: Deepseek
Country: China
Sector: Technology
Purpose: Provide information
Technology: Generative AI; Machine learning
Issue: Cheating/plagiarism; Transparency

News, commentary, analysis 🗞️

Related 🌐

AIAAIC Repository ID: AIAAIC1856

Google Sites

Report abuse