Labeled Faces in the Wild (LFW) dataset

Report incident πŸ”₯ | Improve page πŸ’ | Access database πŸ”’

Labeled Faces in the Wild (LFW) is an open source dataset aimed at researchers that was intended to establish a public benchmark for facial verification.

Created by the University of Massachusetts, Amherst, and publicly released in 2007, LFW comprises over 13,000 facial images with different poses and expressions, under different lighting conditions. Each face is labeled with the name of the person, with 1,680 people having two or more distinct photos in the set.

LFW became the most widely used facial recognition benchmark in the world, according to the Financial Times.

Dataset info πŸ”’

Operator:
Developer: University of Massachussets, Amherst

Country: USA

Sector: Research/academia; Technology

Purpose: Train facial recognition systems

Technology: Dataset; Computer vision; Deep learning; Facial recognition; Facial detection; Facial analysis; Machine learning; Neural network; Pattern recognition
Issue: Bias/discrimination - race, ethnicity, gender; Ethics/values; Privacy

Transparency: Governance; Privacy

Risks and harms πŸ›‘


The Labeled Data in the Wild dataset has been criticised for privacy abuse and bias, and its potential misuse for surveillance and other purposes.

Transparency and accountability πŸ™ˆ

The Labeled Faces in the Wild (LFW) dataset is seen to suffer from several transparency limitations.