This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at
Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.
Hey! If you're reading this, why not help and update the description of the dataset you're working on?
Add a new dataset
«showing 641 tags of 641 total tags for 459 datasets (1.4) »
|443||ApolloScape Semantic Segmentation||The ApolloScape Parsing dataset is provided by Baidu for the CVPR 2018 Workshop on Autonomous Driving Challenge. It is expected that the Scene Parsing dataset ...||segmentation semantic scene benchmark size urban autonomous driving camera calibration||link||2018-04-25||122|
|346||LASIESTA (Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms)||LASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving object det...||dataset groundtruth motion object detection foreground background subtraction challenge stationary camera||link||2017-09-12||537|
|332||Multi-FoV - Large Field-of-View Cameras for Visual Odometry||The Multi-FoV synthetic datasets are two synthetic scenes (vehicle moving in a city, and flying robot hovering in a confined room). For each scene, three differ...||visual odometry camera fov synthetic groundtruth blender||link||2016-08-11||667|
|286||HDA Person Dataset - ISR Lisbon||The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De...||Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human||link||2017-10-02||2190|
|226||Fish4Knowledge||The Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and extracted...||classification animal fish video motion nature recognition water camera||link||2014-05-15||1158|
|215||WILD -Weather and Illumination Database||The Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seasons. It...||webcam light illumination camera video static change urban time depth estimation weather newyork||link||2016-04-19||1451|
|214||The Webcam Clip Art Dataset||This is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequences. As...||webcam light illumination camera video static change urban nature time||link||2014-02-01||941|
|205||GaTech VideoStab||The GaTech VideoStab dataset consists of N videos for the task of video stabilization. This code is implemented in Youtube video editor for stabilization. ...||video stabilization camera path||link||2013-10-09||1096|
|204||UCF Person and Car VideoSeg||The UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big car, sm...||video segmentation object motion model camera groundtruth||link||2015-04-19||1212|
|203||GaTech VideoSeg||The GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation annotat...||video segmentation object motion model camera||link||2013-10-09||1193|
|202||GaTech SegTrack||The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura...||video segmentation object proposal flow optical motion model camera stationary groundtruth||link||2013-10-09||1067|
|195||Yotta||The Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driving. ...||semantic segmentation urban video camera 3d reconstruction classification||link||2013-09-30||1071|
|188||KTH Multiview Football||The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ...||multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget||link||2016-09-18||1727|
|185||Kung-Fu fighter Multi-View||The test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for testing...||multiview tracking segmentation camera action||link||2013-10-08||1077|
|180||Airport MotionSeg||The Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challenging b...||motion segmentation airport video clustering camera zoom||link||2013-09-04||1152|
|166||ICG Multi-Camera Datasets||The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc...||multiview pedestrian tracking detection object camera calibration graz indoor video multitarget||link||2015-06-19||1420|
|165||ICG Multi-Camera and Virtual PTZ||The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph...||multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget||link||2017-08-19||1534|
|164||ICG Lab 6 (Multi-Camera Multi-Object Tracking)||The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came...||multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz||link||2017-12-05||2121|
|156||KUL Belgium Traffic Signs||BelgiumTS is a large dataset with 10000+ traffic sign annotations, thousands of physically distinct traffic signs. 4 video sequences recorded with 8 high resolu...||traffic sign classification urban road belgium camera calibration||link||2017-11-28||1513|
|105||MSR 3D Video||These sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitnick, ...||reconstruction, camera, segmentation, depth||link||2013-03-12||1040|