This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 524 tags of 524 total tags for 372 datasets (1.41) »
|355||IMPART multi-modal/multi-view||The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The sourc...||multi-view multi-mode video rgbd lidar 3d model color indoor outdoor dynamic action face human emotion||link||2017-01-01||144|
|331||EuRoC MAV Dataset||This web page presents visual-inertial datasets collected on-board a Micro Aerial Vehicle (MAV). The datasets contain stereo images, synchronized IMU measuremen...||aerial vehicles, indoor, global shutter, slam||link||2016-07-18||406|
|327||PIROPO Database: People in Indoor ROoms with Perspective and Omnidirectional cameras||The PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor rooms, u...||people surveillance perspective omnidirectional fisheye indoor room detection human||link||2017-02-16||454|
|295||Rent3D||The Rent3D dataset comprises floorplans and images. The goal of this work is to enable a 3D virtual-tour of an apartment given a small set of monocular images o...||indoor building reconstruction layout floorplan apartment urban||link||2015-07-13||418|
|286||HDA Person Dataset - ISR Lisbon||The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De...||Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human||link||2017-01-26||1235|
|271||Labeling in 3D Scenes||This dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: Detecti...||3d kinect reconstruction indoor depth object recognition||link||2015-03-16||579|
|270||B3DO: Berkeley 3D Object Dataset||For the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with the a...||3d kinect reconstruction indoor depth object recognition||link||2015-03-16||497|
|181||All I Have Seen (AIHS)||The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturin...||video summary user study clustering similarity outdoor indoor scene 3d||link||2013-09-05||600|
|168||Mall Dataset||The Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians were label...||detection tracking crowd counting pedestrian indoor video webcam||link||2016-12-06||1287|
|166||ICG Multi-Camera Datasets||The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc...||multiview pedestrian tracking detection object camera calibration graz indoor video multitarget||link||2015-06-19||987|
|163||TUGRAZ ICG Longterm Pedestrian Dataset||The Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection and back...||pedestrian change detection background illumination robust indoor coffee graz multitarget||link||2015-06-19||807|
|104||Make3D Depth||The Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data: Make3...||depth, learning, single view, outdoor, indoor||link||2013-03-12||1008|
|15||PETS 2006||The PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train station scen...||frontview, indoor, pedestrian, detection, tracking, multitarget||link||2015-08-12||896|