The datasets (definition) detailed on this page are listed alphabetically.
See also Incidents and Systems.
80 Million Tiny Images
BDD100K
BookCorpus
Books3
Brainwash
C4
Common Crawl
Coronavirus Mask Image Dataset
DiveFace
Diversity in Faces (DiF)
DukeMTMC
GoEmotions
HRT Transgender Dataset
Iarpa Janus Benchmark-C (IJP-C)
ImageNet
Labeled Faces in the Wild (LFW)
LAION-5B
LAION 400-M
Large-scale CelebFaces Attributes (CelebA)
Library Genesis
MegaFace
Microsoft Celeb (MS-Celeb-1M)
NHS patient medical history data store
Oxford Town Centre
People in Photo Albums (PIPA)
People of Tinder
Prosecraft
Real-World Masked Face
Simulated Masked Face Recognition Dataset (SMFRD)
The Pile
Unconstrained College Students (UCCS)
VGG-Face
WILDTRACK
YouTube Subtitles
Z-Library