Support AIAAIC
The datasets (definition) detailed on this page are listed alphabetically.
See also Incidents and Systems.
ACCESS DATABASE
PREMIUM MEMBERSHIP
SUBMIT INCIDENT
80 Million Tiny Images dataset
BDD100K driving video dataset
BookCorpus large language dataset
Books3 AI training dataset
C4 large language model dataset
Coronavirus Mask Image dataset
DiveFace dataset
DukeMTMC facial recognition dataset
GoEmotions dataset
HRT Transgender dataset
Iarpa Janus Benchmark-C (IJP-C) dataset
IBM Diversity in Faces (DiF) dataset
ImageNet image recognition dataset
Labeled Faces in the Wild (LFW) dataset
LAION-5B image-text pairing dataset
LAION 400-M image-text pairings dataset
Large-scale CelebFaces Attributes (CelebA) dataset
Library Genesis shadow library
MegaFace facial recognition dataset
Microsoft Celeb (MS-Celeb-1M) facial recognition dataset
NHS patient medical history data store
Oxford Town Centre dataset
People in Photo Albums (PIPA) dataset
People of Tinder dataset
Prosecraft fiction analytics database
Real-World Masked Face dataset
Simulated Masked Face Recognition (SMFRD) dataset
Stanford University Brainwash cafe facial recognition dataset
Unconstrained College Students (UCCS) dataset
VGG-Face facial recognition dataset
WILDTRACK pedestrian detection dataset
Z-Library shadow library