VGG Face facial recognition dataset

VGG Face is a dataset created by University of Oxford researchers that comprises 2.6 million facial images of 2,622 people that was created to provide researchers working on facial recognition systems with access to biometric data.  

The dataset mostly comprises celebrities, public figures, actors, and politicians whose names were chosen 'by extracting males and females, ranked by popularity, from the Internet Movie Data Base (IMDB) celebrity list.' 

Information about ethnicity, age, and kinship was also collected from IMDB.

Dataset 🤖

Documents 📃

Operator: ChaLearn; Chinese Academy of Sciences; Delft University of Technology; Simula Research Laboratory; University of Applied Sciences & Arts Western Switzerland; University of California, Berkeley; Universitat Autònoma de Barcelona
Developer: University of Oxford

Country: UK

Sector: Research/academia

Purpose: Develop facial recognition systems

Technology: Database/dataset; Facial recognition  
Issue: Copyright; Ethics/values; Privacy

Transparency: Privacy

Risks and harms 🛑

The VGG Face dataset has raised significant ethical concerns and potential harms by collecting and distributing biometric data of over 2,600 individuals without their consent, potentially enabling privacy violations, surveillance, and the development of biased facial recognition technologies.

Transparency and accountability 🙈

The VGG Face dataset is seen to have several significant transparency limitations:

Investigations, assessments, audits 🧐

Page info
Type: Data
Published: January 2023
Last updated: June 2024