This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 615 tags of 615 total tags for 427 datasets (1.44) »
|347||MOCAT (TUB Multi-Object and Multi-Camera Tracking Dataset)||The TU Berlin Multi-Object and Multi-Camera Tracking Dataset (MOCAT) is a synthetic dataset to train and test tracking and detection systems in a virtual world....||synthetic tracking detection multi-class multi-view evaluation pedestrian vehicle animal||link||2016-11-02||664|
|316||Extreme Classification Repository||The Extreme Classification Repository: Multi-label Datasets & Code Kush Bhatia • Himanshu Jain • Prateek Jain • Manik Varma The objective in extreme multi...||machine learning multilabel classification benchmark evaluation||link||2017-10-25||814|
|230||FGVC-Aircraft||Fine-Grained Visual Classification of Aircraft (FGVC-Aircraft) is a benchmark dataset for the fine grained visual categorization of aircraft. Data, annotatio...||fine-grained classification recognition benchmark evaluation aircraft airplane||link||2017-02-16||1695|
|194||HCI 4D Lightfields||The HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. For ma...||3d 4d lightfield benchmark depth reconstruction evaluation||link||2017-04-28||1323|
|177||SIPI textures||The Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenthesis (i...||texture, segmentation, classification, benchmark, synthetic, evaluation||link||2013-08-20||1008|
|167||Text and Vision (TVGraz) Dataset||The Text and Vision (TVGraz) dataset is an annotated multi-modal dataset which currently contains 10 visual object categories, 4030 images and associated text. ...||text appearance classification evaluation||link||2018-01-03||1270|
|164||ICG Lab 6 (Multi-Camera Multi-Object Tracking)||The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came...||multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz||link||2017-12-05||1946|