Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   blur   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chromaticity   church   circle   cities   city   classification   clothing   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   cooking   copyright   cosegmentation   counting   cover   cow   crepe   cross-view   crowd   ct   cutting   dance   data   dataset   day   decomposition   deep   defocus   deformation   dense   depth   description   descriptor   detail   detection   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   endtoend   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   fov   frames   frontview   fundus   gait   game   gaze   gender   genetic   genome   geography   geometry   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   groundtruth   group   hand   hands   handwritten   hd   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   kernels   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   large-scale   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   movement   movie   mpeg   mug   multi-camera   multi-class   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   noise   normal   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   presentation   pressure   primitive   privacy   procedural   profile   proposal   ptz   quality   question   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   reidentification   remote   removal   rendering   repetition   resolution   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shots   shutter   sideview   sign   similarity   simultaneous   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   space   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structured   structures   study   stuff   stylization   subpixel   subtraction   summarization   summary   superresolution   supervised   surface   surgery   surprise   surveillance   swan   switzerland   symmetry   synthetic   table   target   taxonomy   temporal   text   texture   texture-less   therapy   thermal   things   time   time-series   tiny   tool   tools   top-view   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   videosurveillance   view   viewpoint   vision   visual   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   workflow   world   xray   year   zoom   zurich  
«showing 562 tags of 562 total tags for 411 datasets (1.37) »


flow
DID Name Description Tags URL Date Views
279 WWW Crowd The Where Who Why (WWW) dataset provides 10,000 videos with over 8 million frames from 8,257 diverse scenes, therefore offering a superior comprehensive dataset... surveillance crowd pedestrian detection recognition flow optical video link 2015-05-27 992
249 Image Sequence Analysis Test Site (EISATS) The .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based driver as... stereo vision optical flow motion analysis semantic segmentation link 2014-09-30 1008
218 VidPairs The VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same scene wi... video pair matching patch description flow dense optical link 2015-06-19 711
217 Youtube-Objects dataset The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for... video object detection segmentation flow optical link 2014-02-03 1014
202 GaTech SegTrack The SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for accura... video segmentation object proposal flow optical motion model camera stationary groundtruth link 2013-10-09 827
80 Hopkins 155 The Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It contains... flow, stereo, motion, segmentation, urban link 2015-04-01 1014
71 University of Tsukuba Stereo Flow This dataset contains 1800 stereo pairs with ground truth disparity maps, occlusion maps and discontinuity maps that will help to further develop the state of t... flow, depth, stereo, graphics, synthetic, optical, link 2013-08-08 1294
70 MPI Sintel Flow The MPI Sintel Flow dataset is an optical flow / stereo dataset based on the Blender movie Sintel: http://www.sintel.org The goal of this project was to crea... flow, depth, stereo, graphics, synthetic link 2013-08-08 832
69 HCI Robust Vision Estimate robust and reliable depth or motion fields on our challenging real world videos! ... flow, depth, stereo, outdoor link 2016-09-08 864
68 The KITTI Vision Benchmark Suite We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste... stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth link 2014-02-10 1287
65 Middlebury Stereo Two-frame stereo depth estimation... flow, depth, stereo link 2014-03-06 832
64 Middlebury Flow ... flow, depth link 2013-03-11 678


total views: 11353 5 queries in 0.00013279914855957s 0.00011992454528809s 0.00018715858459473s 2.1934509277344E-5s 0.0010449886322021s and total 0.0071468353271484s