AIs can guess Reddit users' age, location, and what they earn

Occurred: October-November 2023

Can you improve this page?
Share your insights with us

Large language models such as GPT-4 and are able to identify an individual's age, location, gender, and income with up to 85 per cent accuracy by analysing their posts on social media. 

ETH Zurich researchers discovered they were able to identify the place of birth, income bracket, gender, and location from information in the profiles or posts of 520 Reddit users using nine large language models.

OpenAI's GPT-4 was deemed the most accurate of the models, with an overall accuracy rate of 85 percent, and Meta's LlaMA-2-7b the least accurate model at 51 percent.

While personal details were explicitly stated in some posts, the findings raised concerns about the privacy implications of large language models and their chatbot counterparts such as ChatGPT

Databank

Operator: Alphabet/Google; Anthropic; Meta; OpenAI
Developer: Alphabet/Google; Anthropic; Meta; OpenAI
Country: Global
Sector: Media/entertainment/sports/arts
Purpose: Generate text
Technology: Chatbot; NLP/text analysis; Neural network; Deep learning; Machine learning; Reinforcement learning
Issue: Privacy
Transparency: Governance

System


Research, advocacy

News, commentary, analysis