AIAAIC - Image-generation AIs memorise training images

Image-generation AIs memorise training images

Occurred: February 2023

Report incident 🔥 | Improve page 💁 | Access database 🔢

High-profile AI image generators such as DALL-E and Stable Diffusion memorise images from the data they are trained on, raising concerns about potential copyright and privacy violations.

Researchers at Google Deepmind, Princeton and other US universities extracted over one thousand training images from DALL-E, Google's Imagen, and Stable Diffusion, including photographs, film stills, copyrighted press photos, and trademarked company logos, and discovered that many of them were re-generated nearly exactly.

The researchers got the models to 'nearly identically' reproduce over a hundred training images, often with hardly visible changes like more noise in the image, raising concerns about the reproduction and distribution of copyrighted material, as well as privacy risks to people who do not want their images being used to train AI.

System 🤖

Operator: Nicholas Carlini, Jamie Hayes, Milad Nasr, Matthew Jagielski, Vikash Sehwag, Florian Tramèr, Borja Balle, Daphne Ippolito, Eric Wallace
Developer: Alphabet/Google; OpenAI; Stability AI
Country: Global
Sector: Multiple
Purpose: Generate images
Technology: Text-to-image; Generative adversarial network (GAN); Neural network; Deep learning; Machine learning
Issue: Copyright; Privacy

Research, advocacy 🧮

Carlini N., Hayes J., Nasr M., Jagielski M., Sehwag V., Tramèr F., Balle B., Ippolito D., Wallace E. (2023). Extracting Training Data from Diffusion Models

News, commentary, analysis 🗞️

https://www.vice.com/en/article/m7gznn/ai-spits-out-exact-copies-of-training-images-real-people-logos-researchers-find
https://news.yahoo.com/researchers-prove-ai-art-generators-204500956.html
https://arstechnica.com/information-technology/2023/02/researchers-extract-training-images-from-stable-diffusion-but-its-difficult/
https://www.newscientist.com/article/2358066-ai-image-generators-that-create-close-copies-could-be-a-legal-headache/
https://www.theregister.com/2023/02/06/uh_oh_attackers_can_extract/
https://petapixel.com/2023/02/02/ai-image-generators-can-exactly-replicate-copyrighted-photos/

Related 🌐

Page info
Type: Issue
Published: December 2023

Page updated

Google Sites

Report abuse