This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 591 tags of 591 total tags for 420 datasets (1.41) »
|417||Visual Lip Reading Feasibility (VRLF)||The VLRF database is designed with the aim to contribute to research in visual only speech recognition. A key difference of the VLRF database with respect to ex...||lip reading recognition speaker spanish language mouth face speech||link||2017-11-07||28|
|412||MegaAge Dataset||We introduce a new large-scale MegaAge dataset that consists of 41,941 faces annotated with age posterior distributions. We also provide the MegaAge-Asian datas...||Face Analysis, Age Estimation||link||2017-10-12||71|
|402||GeoFaces||A large dataset of geotagged face images collected from Flickr. The zip file contains text files containing urls of the images. Face2GPS: Estimating Geograph...||face localization geotagged classification gender age human||link||2017-09-06||104|
|364||ETH CVL IMDB WIKI Faces||Since the publicly available face image datasets are often of small to medium size, rarely exceeding tens of thousands of images, and often without age informat...||face imdb wikipedia detection recognition age biometry||link||2017-02-22||332|
|355||IMPART multi-modal/multi-view||The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The sourc...||multi-view multi-mode video rgbd lidar 3d model color indoor outdoor dynamic action face human emotion||link||2017-01-01||364|
|354||Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle||FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara...||Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval||link||2017-02-27||514|
|345||MMSE Heartrate||The MMSE heart rate dataset measures the visual heart rate from. faces by throwing darts at people. ...||face landmark emotion heart rate biology||n/a||2016-10-21||486|
|340||Ljubljana CVL Face Database||Database contains 798 images of 114 persons, with 7 images per person and is freely available for research purposes. All images were taken in supervised conditi...||face pedestrian person recognition biometry human illumination lighting||link||2017-02-22||451|
|329||Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset||A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques. A new color face image database for ...||face, detection, skin, segmentation, benchmarking,||link||2016-07-11||564|
|314||WIDER FACE: A Face Detection Benchmark||WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in...||face detection scale pose occlusion||link||2016-02-11||1004|
|310||FASSEG - FAce Semantic Segmentation||The FAce Semantic SEGmentation (FASSEG) repository contains datasets for multi-class semantic face segmentation. The FASSEG repository is composed by two dat...||face, segmentation||link||2017-04-04||1005|
|290||UWO GCO Volume Segmentation||The Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc consistenc...||medical liver babyface bone abdomen adhead face segmentation binary optimization||link||2015-06-19||534|
|261||MPI Multi-View Collection GVV datasets||Welcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of the G...||video multiview tracking face mesh reconstruction depth human action pose||link||2014-12-10||754|
|257||FaceScrub||The FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per person. M...||face detection recognition celebrity people human||link||2017-11-12||932|
|256||Multi-Task Facial Landmark (MTFL) dataset||This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ...||face, landmark detection, deep learning, cnn, attribute||link||2015-11-07||1745|
|254||ChokePoint Dataset||We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e...||human pedestrian identification recognition multiview sequence face detection real world surveillance clustering||link||2015-05-02||1202|
|220||3D Mask Attack Dataset||The 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for both re...||3d biometry face recognition segmentation frontview emotion||link||2016-03-14||1074|
|211||POSTECH Labeled Faces in the Wild||POS Labeled Faces in the Wild, a collection of face which is proposed for studying face identification in unconstrained environment, its purpose is serving as a...||face identification wild recognition registration||link||2015-09-10||1162|
|192||Our Database of Faces||The Our Database of Faces (ORL) dataset contains ten different images of each of 40 distinct subjects. For some subjects, the images were taken at different tim...||face recognition illumination human expression||link||2013-09-23||978|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||2202|
|51||PN Learning||PN Learning - How does TLD work? Tracking estimates the object location as long as the object is visible. During tracking all observed patterns of the object...||single target tracking learning object pedestrian bike face||link||2017-11-28||739|
|50||Babenko tracking||The Babenko tracking dataset contains 12 video sequences for single object tracking. For each clip they provide (1) a directory with the original image s...||tracking single object animal face occlusion video||link||2016-08-08||2385|
|29||The Yale Face||The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to...||face, pedestrian, detection, pose, illumination||link||2015-06-23||837|
|28||CMU Faces - Frontal faces||The MIT + CMU frontal face dataset from H. Rowley contains 130 images with 507 labeled frontal faces from movie, portrait and media sources. It is mostly graysc...||frontview, face, detection object boundingbox||link||2015-06-19||870|
|27||Idiap/ETHZ Faces and Poses||Idiap/ETHZ Faces and Poses Dataset dataset by L. Jie, B. Caputo and V. Ferrari contains 1703 image-caption pairs. [author] Captions contain the names of some of...||face, pose, pedestrian, text||link||2013-03-11||831|