This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 668 tags of 668 total tags for 472 datasets (1.42) »
|471||CrowdFlower||/! Commercial annotation platform, not a publicly released dataset Our Human-in-the-Loop Machine Learning platform transforms unstructured text, image, audio, ...||dataset benchmark annotation||link||2018-09-11||59|
|470||MVOR||MVOR is a Multi-view Multi-person RGB-D Operating Room Dataset for 2D and 3D Human Pose Estimation We are pleased to announce the release of the MVOR datase...||medical clinical human annotation multiview pose estimation rgbd operation hospital||link||2018-10-08||26|
|444||Supervisely Person Dataset||The Supervisely Person Dataset consists of 5711 images with 6884 high-quality annotated person instances. All steps below are done inside Supervisely without a...||person pedestrian segmentation semantic mask supervisely annotation automatic dataset instance||link||2018-10-08||505|
|418||Udacity Annotated Driving Datasets||Udacity Annotated Driving Datasets have two datasets: Dataset 1 The dataset includes driving in Mountain View California and neighboring cities during dayli...||classification segmentation urban street selfdriving autonomous udacity annotation california city daylight||link||2017-11-08||454|
|404||Zurich Summer Dataset||The Zurich Summer v1.0 dataset is a collection of 20 chips (crops), taken from a QuickBird acquisition of the city of Zurich (Switzerland) in August 2002. Quick...||satellite segmentation semantic aerial urban city zurich pan nir rgb gsd superpixel annotation||link||2017-09-12||376|
|398||Osnabrück - Gaze Tracking Data Set||Gaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with polygo...||segmentation, gaze data, polygon annotation, video, metadata||link||2018-02-13||324|
|396||ADE20k||Scene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. mages ...||segmentation semantic annotation benchmark scene recognition||link||2017-08-03||363|
|388||Open Images Dataset v4 new||Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ...||classification large-scale category real image deep annotation automatic benchmark boundingbox||link||2018-09-11||533|
|372||VOT2016 segmentation||The VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of BW image...||object tracking segmentation mask annotation visual||link||2017-04-17||432|
|354||Facial Expression Research Group Database (FERG-DB), University of Washington, Seattle||FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The chara...||Face, Facial expression, Animation, Stylization, annotation emotion, deep learning, anger, sad, joy, disgust, surprise, neutral, fear, cardinal classification, human transfer, image retrieval||link||2017-02-27||927|
|353||COCO-Stuff||COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks like sema...||semantic segmentation stuff things COCO captioning annotation groundtruth benchmark||link||2017-02-16||858|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||2988|