Sign up for AIAAIC email updates
The datasets (definition) detailed on this page are listed alphabetically.
See also Incidents and Systems.
80 Million Tiny Images dataset
BookCorpus large language dataset
Books3 AI training dataset
C4 large language model dataset
Coronavirus Mask Image dataset
DukeMTMC facial recognition dataset
GoEmotions dataset mis-labelling
HRT Transgender dataset
Iarpa Janus Benchmark-C (IJP-C) dataset
IBM Diversity in Faces (DiF) dataset
ImageNet image recognition dataset
Labeled Faces in the Wild (LFW) dataset
LAION image-text pairings datasets
Large-scale CelebFaces Attributes (CelebA) dataset
Library Genesis shadow library
MegaFace facial recognition dataset
Microsoft Celeb (MS-Celeb-1M) facial recognition dataset
NHS patient medical history data store
LAION-5B image-text pairing dataset
LAION-400M image-text pairing dataset
Oxford Town Centre dataset
People in Photo Albums (PIPA) dataset
People of Tinder dataset
Prosecraft fiction analytics
Real-World Masked Face dataset
Simulated Masked Face Recognition (SMFRD) dataset
Stanford University Brainwash cafe facial recognition dataset
Unconstrained College Students (UCCS) dataset
VGG-Face facial recognition dataset
WILDTRACK pedestrian detection dataset
Z-Library shadow library
ACCESS DATABASE
PREMIUM MEMBERSHIP
SUBMIT INCIDENT