Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   accuracy   action   activity   address   adhead   adjustment   adult   aerial   aesthetics   affordance   age   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulation   artificial   aspect   atmospheric   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   berlin   bike   bilateral   bim   binary   biology   biometric   biometry   blender   blur   boat   body   bone   bottle   boundingbox   brain   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   california   caltech   camera   canada   caption   captioning   capture   car   cardinal   categorization   category   cats   cbir   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chicaco   chromaticity   church   circle   city   cityscapes   classification   clothing   cloud   clustering   clutter   cnn   co-localization   co-saliency   co-segmentation   co-skeletonization   coco   code   codebook   coffee   collaborative   color   community   comparison   computer   condition   constancy   context   contour   cooking   copyright   counting   cover   cow   crepe   crf   crop   cross-view   crowd   ct   cutting   daily   dance   dark   data   dataset   day   daylight   decomposition   deep   defocus   deformation   denoising   dense   depth   description   descriptor   detail   detection   dichromatic   disease   disgust   disparity   dogs   domain   dped   driving   drone   dubrovnik   duplicate   dynamic   ear   edge   egocentric   ellipse   emotion   empty   endtoend   enhancement   environment   estimation   evaluation   event   expertise   expression   eye   facade   face   facial   fake   fashion   fear   feature   field   fine-grained   fingerprint   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   fog   food   foot   footprint   foreground   fov   frames   frontview   fundus   gait   game   gan   gaze   gender   genetic   genome   geography   geometry   geoscience   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   grayscale   graz   ground   groundtruth   group   growth   gsd   hand   handwritten   hd   head   heart   heat   hierarchy   high-definition   high-resolution   highlight   highway   holes   horse   house   howto   human   identification   illumination   illuminiation   illusion   image   imagenet   images   imdb   imu   indoor   inertial   initialization   inserts   instance   intake   intensity   interaction   interactive   interest   internet   invariance   ir   isar   iso   joy   kaggle   kernels   keyframe   kimia   kinect   kitchen   kitti   label   labeling   laboratory   land   landmark   lane   language   large   large-scale   laser   lattice   layout   leaf   learning   letter   leuven   lidar   lifespan   light   lightfield   lighting   limited   line   lip   lisbon   liver   local   localization   location   logo   low   lowlevel   machine   makeup   manhattan   map   maritime   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   monitoring   mono   montage   motion   motorbike   mouse   mouth   movement   movie   mpeg   mser   mug   multi-camera   multi-class   multi-human   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multimedia   multimodal   multiple   multispectral   multitarget   multiview   naming   natural   nature   navigation   netherlands   network   neutral   newyork   night   nir   noise   normal   nude   number   object   occlusion   ocr   odometry   omnidirection   omnidirectional   online   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   pan   panchromatic   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   pedestrians   people   person   perspective   phase   photo   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   potsdam   presentation   pressure   primitive   privacy   procedural   profile   project   proposal   pruning   ptz   quality   question   radar   random   rank   ranking   ransac   rate   ratio   re-identification   reading   real   real-world   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   remote   removal   rendering   repetition   resolution   restoration   retina   retinal   retrieval   rgb   rgbd   road   robot   robotic   robust   rome   room   ros   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   search   segmentation   selfdriving   semantic   sense   sensing   sequence   series   sfm   shadow   shape   sheffield   shoes   shots   shutter   sideview   sign   signs   similarity   simultaneous   single   singleview   size   skeleton   skeletonization   sketch   skin   sky   slam   smartphone   soccer   social   software   source   space   spain   spanish   speaker   speech   speed   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   structure   structured   study   stuff   style   stylization   subpixel   subtraction   summarization   summary   superpixel   superresolution   supervised   supervisely   surface   surgery   surprise   surveillance   swan   switzerland   sydney   symmetry   synthetic   table   target   taxonomy   temporal   text   textile   texture   texture-less   therapy   thermal   things   time   timelapse   tiny   tokyo   tool   tools   top-view   topcoder   tracking   tracklet   traffic   trajectory   transfer   transportation   trees   triangulation   truth   tuberculosis   turbulence   type   uas   uav   udacity   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   vessel   video   view   viewpoint   virtual   visible   vision   visual   voc   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wiki   wikipedia   wild   workflow   world   worldwide   xray   year   youtube   zoom   zurich  
«showing 653 tags of 653 total tags for 463 datasets (1.41) »


outdoor
DID Name Description Tags URL Date Views
355 IMPART multi-modal/multi-view The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The sourc... multi-view multi-mode video rgbd lidar 3d model color indoor outdoor dynamic action face human emotion link 2017-01-01 560
269 Daimler Urban Segmentation Dataset The Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs with a r... semantic segmentation outdoor urban stereo motion link 2015-06-26 1516
260 Eurasian Cities dataset The Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing points ... vanishing line point geometry pose urban reconstruction outdoor manhattan link 2018-01-11 1226
251 ETHZ CVL RueMonge 2014 This ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] and p... semantic segmentation 3d reconstruction architecture paris benchmark source code urban recognition classification outdoor pointcloud mesh link 2014-11-24 1661
212 Polo Instance Segmentation The Polo instance segmentation dataset is a semantic segmentation task for Hough transform based segmentation masks. It consists of supervised segmentation for ... semantic segmentation horse human outdoor mask scene understanding n/a 2016-01-21 1155
206 GaTech VideoContext The GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context evaluation i... video geometry context classification semantic segmentation unsupervised supervised outdoor urban nature link 2014-04-06 1137
191 Daimler Mono Pedestrian Classification Benchmark The Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedestrian- a... pedestrian classification outdoor urban object scale illumination link 2013-09-18 1057
190 Daimler Mono Pedestrian Detection Benchmark The Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (image cut-o... pedestrian detection outdoor urban mono scale object link 2013-09-18 1181
188 KTH Multiview Football The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ... multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget link 2018-06-28 1815
181 All I Have Seen (AIHS) The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturin... video summary user study clustering similarity outdoor indoor scene 3d link 2013-09-05 979
165 ICG Multi-Camera and Virtual PTZ The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph... multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget link 2017-08-19 1582
117 YorkUrbanDB The York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the campus of... vanishing, point, pose, urban, reconstruction, outdoor, geometry, manhattan link 2013-09-18 884
104 Make3D Depth The Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data: Make3... depth, learning, single view, outdoor, indoor link 2018-03-16 1595
100 Sowerby The Sowerby dataset contains 105 images for semantic segmentation.... semantic, segmentation, outdoor n/a 2014-09-26 1090
93 Street View Text The Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challenging becaus... text, detection, recognition, classification, outdoor, urban link 2014-01-13 1349
89 Corel Photo Gallery This image database is a part of the "Corel Gallery Magic" (commercial product). It contains 80000 images divided into 800 categories of 100 images. These image... semantic, segmentation, outdoor n/a 2017-01-19 984
81 Zurich Hoengg Zurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the format is Ti... aerial, semantic, segmentation, outdoor link 2013-03-11 1019
79 LabelMe The goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by visitin... segmentation, semantic, outdoor, detection, urban, software link 2013-03-14 1025
72 Acute3D Aiguille du Midi MVS Aiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 2006) ... sfm, reconstruction, mesh, large scale, outdoor link 2013-03-21 1111
69 HCI Robust Vision Estimate robust and reliable depth or motion fields on our challenging real world videos! ... flow, depth, stereo, outdoor link 2016-09-08 1110
37 MSRC vNIPS The MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Connected... segmentation, semantic, outdoor link 2013-03-11 890
36 MSRC v2 The MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accurate pixe... segmentation, semantic, outdoor link 2016-08-28 2411
35 MSRC v1 The MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is commonl... segmentation, semantic, outdoor link 2016-09-07 1889
16 PETS 2009 The PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for person coun... frontview, outdoor, pedestrian, detection, tracking, overlap, occlusion multitarget, human link 2015-06-19 1680


total views: 30906 5 queries in 0.00012898445129395s 9.8943710327148E-5s 0.0001678466796875s 0.0001068115234375s 0.0012061595916748s and total 0.007133960723877s