Stanford University Brainwash cafe facial recognition dataset
Released: 2014
Can you improve this page?
Share your insights with us
Brainwash is a dataset of 11,917 images of 91,146 'labelled' people created by Stanford University researchers in San Francisco's Brainwash Cafe, the principal aim of which was to 'train and validate their algorithm’s effectiveness.'
The dataset was removed 'at the request of the depositor' from Stanford University's website in June 2019 following the publication of researcher Adam Harvey's Exposing.ai project and a Financial Times investigation into facial recognition data sharing.
Transparency, privacy
Video footage was recorded over three days in October and November 2014 without the awareness or consent of Brainwash Cafe customers - a matter the New York Times notes was not addressed in Stanford's research paper on the project.
And the researchers behind Brainwash - Stewart Russell, Mykhaylo Andriluka, and Andrew Ng - refused to comment publicly on the nature or removal of the dataset.
Data sharing
The Brainwash dataset was published online and has been cited by high-profile organisations across the world, including by researchers affiliated with China's National University of Defense Technology for two research projects on advancing object recognition capabilities.
It 'also appears in a 2018 research paper affiliated with Megvii (Face++) ... who has provided surveillance technology to monitor Uighur Muslims in Xinjiang.'
Clips from the dataset remain available on YouTube.
Operator: Beijing University of Technology; Delft University of Technology; Honeywell Technology Solutions; Huawei; IDIAP Research Institute; IIT Madras; Megvii; National University of Defense Technology, China; North University of China; Shenzhen University; Qualcomm; University of Electronic Science and Technology of China
Developer: Stanford University; Stewart Russell; Mykhaylo Andriluka; Andrew Ng
Country: USA; China
Sector: Research/academia
Purpose: Train facial recognition systems
Technology: Dataset; Facial recognition; Computer vision
Issue: Privacy; Dual/multi-use
Transparency: Privacy
Dataset
Investigations, assessments, audits
Harvey, A., LaPlace, J. (2019). Exposing.ai
Murgia M., Financial Times (2019). Who’s using your face? The ugly truth about facial recognition
Research, advocacy
Li Y., Dou Y., Liu X., Li T. (2016). Localized region context and object feature fusion for people head detection
Zhao X., Wang Y., Dou Y. (2017). A Replacement Algorithm of Non-Maximum Suppression Base on Graph Clustering
News, commentary, analysis
https://www.nytimes.com/2019/07/13/technology/databases-faces-facial-recognition-technology.html
https://www.wired.com/story/secret-history-facial-recognition/
https://www.ft.com/content/7d3e0d6a-87a0-11e9-a028-86cea8523dc2
https://mashable.com/article/police-facial-recognition-algorithms-activism
https://www.tijd.be/dossier/legrandinconnu/brainwash/10136670.html
Page info
Type: Data
Published: May 2022