This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 562 tags of 562 total tags for 409 datasets (1.37) »
|388||Open Images Dataset||Today, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. We tried ...||classification large-scale category real image deep annotation automatic||link||2017-07-02||110|
|289||ETHZ CVL Clust||MICCAI 2015 Challenge on Liver Ultrasound Tracking Munich, October 9, 2015 (Full Day) Outline Ultrasound (US) imaging is a widely used medical imaging techn...||medical liver tracking ultrasound therapy human organ benchmark real||link||2015-06-19||504|
|262||PHOS (Evaluating illumination invariance)||Phos is a color image database of 15 scenes captured under different illumination conditions. Every scene of the database contains 15 different images: 9 images...||Illumination invariance, real lighting conditions, uneven illumination, shadows, feature detection||link||2017-03-20||663|
|254||ChokePoint Dataset||We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e...||human pedestrian identification recognition multiview sequence face detection real world surveillance clustering||link||2015-05-02||1101|
|253||Street View House Number (SVHN)||SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatti...||streetview number recognition classification urban streetside detection text real world||link||2016-08-24||931|
|174||Pittsburgh Fast-food Image dataset||The Pittsburgh Fast-food Image dataset (PFID) consists of 4545 still images, 606 stereo pairs, 3033600 videos for structure from motion, and 27 privacy-preservi...||food recognition classification reconstruction video laboratory real||link||2017-05-27||1613|