This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 641 tags of 641 total tags for 457 datasets (1.4) »
|393||ZuBuD+||ZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previous datase...||landmark, building, image retrieval, urban||link||2017-07-17||253|
|345||MMSE Heartrate||The MMSE heart rate dataset measures the visual heart rate from. faces by throwing darts at people. ...||face landmark emotion heart rate biology||n/a||2016-10-21||641|
|320||San Francisco Landmark Dataset for Mobile Landmark Recognition||The San Francisco Landmark Dataset for Mobile Landmark Recognition is a set of images and query images for localization. We present the San Francisco Landmar...||retrieval localization city urban sanfrancisco landmark calibration gps mobile||link||2016-03-04||812|
|303||1DSfM Landmarks||The 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler ground tru...||3d reconstruction landmark groundtruth benchmark urban city||link||2015-08-05||882|
|280||Yahoo Flickr Creative Commons 100M||Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ...||flickr landmark image recognition detection reconstruction 3d clustering social community internet||link||2015-09-24||1124|
|256||Multi-Task Facial Landmark (MTFL) dataset||This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ...||face, landmark detection, deep learning, cnn, attribute||link||2015-11-07||2016|
|208||Landmark 1000||The Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and published in...||landmark 3d reconstruction pose estimation pointcloud world location||link||2013-11-05||1236|
|200||Landmark 3D||This dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope it coul...||landmark recognition classification retrieval 3d reconstruction codebook matching feature flickr||link||2016-08-09||1211|
|161||ICG Annotated Facial Landmarks in the Wild (AFLW)||The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet...||face detection landmark pose age annotation||link||2017-07-25||2606|
|152||Colosseum and San Marco||The Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets are ...||3d, reconstruction, landmark, urban, sfm, aerial, street, flickr||link||2017-11-28||1518|
|135||Quad 6K||The Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth positions o...||reconstruction, sfm, urban, groundtruth, landmark, 3d gps||link||2013-11-05||1211|
|131||Dubrovnik6K and Rome16K||The Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dubrovnik6...||reconstruction, sfm, urban, landmark, dubrovnik, rome||link||2017-03-10||1165|
|127||Stable Structure from Motion||The Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final reconstr...||sfm, reconstruction, geometry, stability, robust, 3d, landmark, church||link||2013-08-08||1471|
|120||Samantha||The SAMANTHA (Structure-and-Motion Pipeline on a Hierarchical Cluster Tree) dataset contains 4 sequences for 3D reconstruction: Pozzoveggiani, Piazza Dante, Pia...||reconstruction, sfm, landmark, model, geometry||link||2013-03-12||1331|
|84||Aachen Retrieval||The Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their SIFT ...||retrieval, aachen, landmark, sfm, reconstruction||link||2013-03-11||1084|
|63||Paris500k||The Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box rather ...||retrieval, paris, landmark, geotag, flickr, panoramio, sfm, reconstruction||link||2016-12-23||1366|
|54||Notre Dame||The Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame by Mic...||limited, flickr, landmark, sfm, paris, frontview, reconstruction, 3d, pointcloud||link||2015-06-19||1123|
|46||Paris Retrieval||The Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/paris_1....||retrieval, urban, paris, landmark||link||2016-10-11||939|
|45||Oxford Buildings||The Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford landmarks. T...||retrieval, urban, oxford, landmark||link||2017-04-17||1081|