LAION-5B links to photos of identifiable Brazilian children
LAION-5B links to photos of identifiable Brazilian children
Occurred: June 2024
Report incident 🔥 | Improve page 💁 | Access database 🔢
Dataset LAION-5B was found to contain personal photos and details of identifiable Brazilian children without their knowledge or consent, prompting concerns about privacy and its creator's governance and integrity.
Human Rights Watch (HRW) discovered over 170 photos of children across 10 Brazilian states in the dataset, including names, ages, locations, and other identifying information. Some of the photos dated back to the mid-1990s, while others are as recent as 2023. All images had been posted by families on social media.
In one instance, details revealed a 2-year-old girl and newborn sister, with their names and the hospital where the baby was born.
Human Rights Watch said it only reviewed 0.0001 percent of the 5.85 billion images on LAION-5B, suggesting many more such images could be present. The images violate children's privacy and enable malicious actors to create explicit deepfakes exploiting them, HRW said.
LAION, the non-profit organisation behind the dataset, temporarily removed the offending images and said it would implement filters. LAION-5B was used to train Stable Diffusion, among other models.
The incident was seen to highlight LAION's seemingly slapdash approach to protecting personal privacy. It also prompted concerns about the lack of comprehensive data privacy laws to safeguard children and others from violations by AI-powered systems.
Operator: Human Rights Watch
Developer: LAION
Country: Brazil
Sector: Private - individual
Purpose: Pair text and images
Technology: Database/dataset; Neural network; Deep learning; Machine learning
Issue: Ethics/values; Privacy; Transparency
Human Rights Watch (2024). Brazil: Children’s Personal Photos Misused to Power AI Tools
https://www.wired.com/story/ai-tools-are-secretly-training-on-real-childrens-faces/
https://petapixel.com/2024/06/10/brazilian-childrens-photos-and-personal-details-found-in-ai-training-data-set/
https://siliconangle.com/2023/12/20/researchers-find-csam-images-laion-5b-ai-training-dataset/
https://www.jurist.org/news/2024/06/hrw-report-reveals-pictures-of-brazil-children-misused-by-ai-tool/
Page info
Type: Incident
Published: June 2024