Yet Another Computer Vision Index To Datasets (YACVID)

This website provides a list of frequently used computer vision datasets. Wait, there is more!
There is also a description containing common problems, pitfalls and characteristics and now a searchable TAG cloud.
Plus, this is open for crowd editing (if you pass the ultimate turing test)! - Questions? yacvid [at] hayko [dot] at

Content, Design and Idea © by Hayko Riemenschneider, 2011-2016. Texts and Images are subject of copyright by the respective authors.

Hey! If you're reading this, why not help and update the description of the dataset you're working on?

Add a new dataset



2d   3d   4d   aachen   abdomen   abrupt   accelerometer   action   activity   address   adhead   adjustment   aerial   aesthetic   aesthetics   age   aic   aircraft   airplane   airport   alignment   amazon   ambiguous   analysis   and   anger   animal   animation   annotation   anomaly   apartment   api   appearance   applelogo   architecture   articulated   aspect   attention   attribute   attributes   authentication   automatic   autonomous   avoid   axis   babyface   background   balance   baseline   behavior   belgium   benchmark   benchmarking   bike   bilateral   binary   biology   biometric   biometry   blender   blur   body   bone   bottle   boundingbox   brand   bremen   buffy   building   bullseye   bundle   bunny   byu   cad   calibration   caltech   camera   canada   captioning   capture   car   cardinal   categorization   category   celebrity   cell   centered   chair   challenge   change   chemistry   chest   chromaticity   church   circle   cities   city   classification   clothing   clustering   clutter   cnn   co-segmentation   coco   code   codebook   coffee   color   community   comparison   conditions   constancy   context   contour   cooking   copyright   cosegmentation   counting   cover   cow   crepe   cross-view   crowd   ct   cutting   dance   data   dataset   day   decomposition   deep   defocus   deformation   dense   depth   description   descriptor   detail   detection   dichromatic   disgust   disparity   dogs   domain   driving   dubrovnik   duplicate   dynamic   ear   ecocentric   edge   egocentric   ellipses   emotion   endtoend   estimation   evaluation   event   expression   eye   facade   face   facial   fear   feature   field   fine-grained   fingerprints   fingertip   first-person   fish   fisheye   fitting   flickr   flight   floorplan   flow   fly   flying   food   foot   foreground   fov   frames   frontview   fundus   gait   game   gaze   gender   genetic   genome   geography   geometry   geotag   geotagged   germany   gesture   getry   gif   giraffe   gis   global   google   gps   grammar   graphics   graz   ground   groundtruth   group   hand   hands   handwritten   hd   head   heart   heat   hierarchy   high-definition   highlight   highway   holes   horse   human   identification   illumination   image   imagenet   images   imdb   indoor   inertial   initialization   inserts   instance   intake   interaction   interactive   interest   internet   invariance   ir   isar   joy   kernels   keyframe   kimia   kinect   label   labeling   laboratory   landmark   lane   language   large   large-scale   laser   lattice   layout   learning   letter   leuven   lidar   light   lightfield   lighting   limited   line   lisbon   liver   local   localization   location   logo   lowlevel   machine   manhattan   map   mask   match   matching   material   medial   medical   medicine   memorability   mesh   metadata   milling   mirror   mobile   model   modeling   modelling   monitoring   mono   montage   motion   motion-capture-data   motorbike   mouse   movement   movie   mpeg   mug   multi-camera   multi-class   multi-mode   multi-sensor   multi-spectral   multi-view   multilabel   multiple   multitarget   multiview   naming   natural   nature   navigation   network   neutral   newyork   night   noise   normal   nude   number   object   objects   occlusion   ocr   odometry   omnidirection   omnidirectional   open-view   operation   optical   optimization   organ   original   osnabrueck   outdoor   overhead   overlap   oxford   pair   pairwise   panorama   panoramio   parallel   paris   parsing   part   partial   pasadena   pascal   patch   path   pattern   pedestrian   people   person   perspective   phase   photogrammetry   physics   pittsburgh   place   plane   planning   point   pointcloud   polygon   popularity   pornography   pose   presentation   pressure   primitive   privacy   procedural   profile   proposal   ptz   quality   question   radar   randomnoise   rank   ranking   ransac   rate   ratio   re-identification   real   realism   recipe   recognition   reconstruction   rectification   rectified   reflection   registration   regression   regular   reidentification   remote   removal   rendering   repetition   resolution   retina   retinal   retrieval   rgb   rgbd   road   robot   robust   rome   room   rotation   sad   saliency   sampling   sanfrancisco   satellite   scale   scan   scanner   scene   scenes   search   segmentation   semantic   sense   sensing   sequence   sfm   shadow   shadows   shape   shapes   sheffield   shoes   shots   shutter   sideview   sign   similarity   simultaneous   single   singletarget   singleview   skeleton   sketch   skin   sky   slam   soccer   social   software   source   space   spain   sphere   sport   stability   stabilization   static   stationary   stereo   stereovision   stochastic   street   streetside   streetview   structure   structure-from-motion   structured   structures   study   stuff   stylization   subpixel   subtraction   summarization   summary   superresolution   supervised   surface   surgery   surprise   surveillance   swan   switzerland   symmetry   synthetic   table   target   taxonomy   temporal   text   texture   texture-less   therapy   thermal   things   time   time-series   tiny   tool   tools   top-view   tracking   traffic   trajectory   transfer   transportation   triangulation   truth   tuberculosis   type   uas   ultrasound   understanding   uneven   unmanned   unsupervised   urban   user   vanishing   variation   vehicle   vehicles   video   videosurveillance   view   viewpoint   vision   visual   volleyball   vqa   vt   water   wavelength   weakly   wear   wearable   weather   webcam   white   wide   wikipedia   wild   workflow   world   xray   year   zoom   zurich  
«showing 562 tags of 562 total tags for 411 datasets (1.37) »


detection
DID Name Description Tags URL Date Views
385 WildLife Documentary (WLD) Dataset The dataset contains 15 documentary films that are downloaded from YouTube, whose durations vary from 9 minutes to as long as 50 minutes, and the total number o... Video object detection link 2017-06-23 87
380 CERTH Image Blur Dataset The CERTH image blur dataset consists of 2450 digital images, 1850 out of which are photographs captured by various camera models in different shooting conditio... blur motion defocus detection quality image link 2017-05-24 153
369 Nude Detection Dataset — Images (NPDI/DCC/UFMG) The database contains 180 images collected from the Web. If you make use of our database, please cite the following reference: LOPES, Ana; AVILA, Sandra... nude detection, images link 2017-07-28 389
368 Nude Detection Dataset — Videos (NPDI/DCC/UFMG) The database of nude and non-nude videos contains a collection of 179 video segments collected from the following movies: Alpha Dog, Basic Instinct, Before The ... nude detection, video, movie link 2017-09-12 215
364 ETH CVL IMDB WIKI Faces Since the publicly available face image datasets are often of small to medium size, rarely exceeding tens of thousands of images, and often without age informat... face imdb wikipedia detection recognition age biometry link 2017-02-22 305
357 udacity self-driving-car At Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing and usef... car robot driving autonomous street urban video recognition detection classification segmentation time synthetic link 2017-03-15 535
356 The Oxford RobotCar Dataset The Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset captures ... car robot driving autonomous street urban video recognition detection classification segmentation time year link 2017-01-04 458
348 Global Symmetry AVA Dataset Global Symmetry Ground-truth for AVA dataset Release Date: 2016 For detailed information, please refer to: Elawady, Mohamed, Cécile Barat, Christophe Duc... Global Bilateral Symmetry Detection Aesthetic Reflection Mirror link 2016-11-02 283
347 MOCAT (TUB Multi-Object and Multi-Camera Tracking Dataset) The TU Berlin Multi-Object and Multi-Camera Tracking Dataset (MOCAT) is a synthetic dataset to train and test tracking and detection systems in a virtual world.... synthetic tracking detection multi-class multi-view evaluation pedestrian vehicle animal link 2016-11-02 520
346 LASIESTA (Labeled and Annotated Sequences for Integral Evaluation of SegmenTation Algorithms) LASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving object det... dataset groundtruth motion object detection foreground background subtraction challenge stationary camera link 2017-09-12 353
330 Cityscapes We present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high quality... stereo video urban cities semantic segmentation detection car person pedestrian weakly link 2016-07-19 1023
329 Virginia Tech and Arab Academy for Science & Technology (VT-AAST) The VT-AAST Benchmarking Dataset A New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques​. A new color face image database for ... face, detection, skin, segmentation, benchmarking, link 2016-07-11 523
327 PIROPO Database: People in Indoor ROoms with Perspective and Omnidirectional cameras The PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor rooms, u... people surveillance perspective omnidirectional fisheye indoor room detection human link 2017-02-16 732
326 Desk3D (Cambridge University) Instance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., & Cipo... depth instance pose detection link 2016-04-15 516
322 Kendall Square Webcam The Kendall Square webcam dataset consists of two streams for one sunny day and one cloudy day of a city square. It is used for tracking and analyzing color cha... webcam color weather change detection appearance sky link 2016-03-02 652
317 NYU Symmetry Database The mirror symmetry database contains 176 single-symmetry and 63 multyple-symmetry images (.png files) with accompanying ground-truth annotations (.mat files). ... symmetry detection mirror groundtruth link 2016-04-15 450
314 WIDER FACE: A Face Detection Benchmark WIDER FACE dataset is a large-scale face detection benchmark dataset with 32,203 images and 393,703 face annotations, which have high degree of variabilities in... face detection scale pose occlusion link 2016-02-11 942
309 Coutour patches The contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role of Im... patch image match contour edge lowlevel detection segmentation link 2015-09-29 513
307 HandNet annotated hand dataset The HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214971 a... hand articulation segmentation classification detection pose fingertip rgbd video link 2017-09-12 883
302 CMP map2photo The CMP map2photo dataset consists of 6 pairs, where one image is satellite photo and second image is a map of the same area. The task is to match these images... feature detection description matching map remote sensing wide baseline link 2015-08-13 657
301 CMP Extreme Zoom Dataset The Extreme Zoom Dataset. EZD is a 6 image sets with incleasing zoom factor from general scene view to focusing on single detail. MODS: Fast and Robust Metho... feature detection description matching viewpoint zoom link 2015-07-15 561
300 CMP WxBS dataset The Wide (multiple) Baseline Dataset. 31 image pairs, simultaneously combining several nuisance factors: geometry, illumination, IR-visible, etc. WxBS: Wide ... feature detection description matching viewpoint IR day night link 2015-07-15 888
288 Berkeley Urban Street tracking The UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted on a ca... tracking detection segmentation multitarget recognition video pedestrian urban human link 2015-07-14 1141
287 INRIA Lafarge Benchmarks Some datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting Line-ne... 3d surface reconstruction groundtruth pointcloud object detection line road network urban crowd pedestrian counting link 2015-06-18 854
286 HDA Person Dataset - ISR Lisbon The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestrian De... Video Surveillance Pedestrian Detection Re-Identification Multiview Tracking Benchmark Indoor High-Definition Camera Network lisbon human link 2017-10-02 1667
284 TRANCOS Overlapping Car Crowds The TRaffic ANd COngestionS (TRANCOS) dataset, a novel benchmark for (extremely overlapping) vehicle counting in traffic congestion situations. It consists of 1... object detection car transportation vehicle highway urban spain traffic link 2015-06-16 945
280 Yahoo Flickr Creative Commons 100M Yahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. All the ... flickr landmark image recognition detection reconstruction 3d clustering social community internet link 2015-09-24 867
279 WWW Crowd The Where Who Why (WWW) dataset provides 10,000 videos with over 8 million frames from 8,257 diverse scenes, therefore offering a superior comprehensive dataset... surveillance crowd pedestrian detection recognition flow optical video link 2015-05-27 995
275 TST fall detection It is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, with diff... action recognition detection depth kinect wearable accelerometer human video link 2017-03-14 904
273 SBMI 2015 Scene Background Initialization (SBI) dataset The SBI dataset has been assembled in order to evaluate and compare the results of background initialization al... change detection background initialization foreground benchmark link 2015-05-02 539
272 Stanford 40 Actions The Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performing the ac... human action recognition detection boundingbox link 2015-06-19 860
263 Crowd Dataset The crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense crowd in t... crowd video detection anomaly scene understanding human pedestrian link 2017-09-19 1471
262 PHOS (Evaluating illumination invariance) Phos is a color image database of 15 scenes captured under different illumination conditions. Every scene of the database contains 15 different images: 9 images... Illumination invariance, real lighting conditions, uneven illumination, shadows, feature detection link 2017-03-20 687
257 FaceScrub The FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per person. M... face detection recognition celebrity people human link 2014-11-24 884
256 Multi-Task Facial Landmark (MTFL) dataset This dataset contains 12,995 face images which are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose. ... face, landmark detection, deep learning, cnn, attribute link 2015-11-07 1565
254 ChokePoint Dataset We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions using e... human pedestrian identification recognition multiview sequence face detection real world surveillance clustering link 2015-05-02 1139
253 Street View House Number (SVHN) SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatti... streetview number recognition classification urban streetside detection text real world link 2016-08-24 952
252 Volleyball Activity Dataset 2014 This dataset contains 7 challenging volleyball activity classes annotated in 6 videos from professionals in the Austrian Volley League (season 2011/12). A total... action activity sport volleyball detection recognition video analysis link 2017-07-05 1440
248 VIDEO datasets overview Many different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy spread... video benchmark recognition classification detection object action link 2014-09-30 1100
247 PASCAL VOC Parts The PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask for 2... detection recognition pascal object part pedestrian human segmentation semantic link 2014-09-30 1210
242 Stanford Dogs Dataset The Stanford Dogs dataset contains images of 120 breeds of dogs from around the world. This dataset has been built using images and annotation from ImageNet for... classification, detection, fine-grained categorization, dogs link 2015-07-29 1276
240 Microsoft COCO The Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features: Mo... object context segmentation detection recognition benchmark semantic link 2015-05-02 1445
239 CUHK crowd dataset CUHK crowd dataset introduces the largest publicly available crowd dataset of 474 videos from 215 crowded scenes. It has been used in the paper: Scene-Ind... crowd analysis, group detection and analysis, scene understanding link 2016-09-14 1271
232 Pratheepan Human Skin Detection Dataset The images in this dataset are downloaded randomly from Google for human skin detection research. It has been used in the paper: W.R. Tan, C.S. Chan, Y. Prathee... skin detection, skin segmentation, human detection, skin dataset link 2017-09-14 2341
227 Omnidirectional and panoramic image dataset We share our omnidirectional and panoramic image dataset (with annotations) to be used for human and car detection. Please reach through: http://cvrg.iyte.edu.... panorama detection car omnidirection human recognition link 2017-01-13 1257
225 California-ND An Annotated Dataset For Near-Duplicate Detection In Personal Photo Collections Managing photo collections involves a variety of image quality assessment tas... retrieval duplicate copyright groundtruth detection link 2014-03-19 705
224 CMP Extreme View Dataset 15 wide baseline stereo image pairs with large viewpoint change, provided ground truth homographies. Image size (~1000x700 pixels, RGB) D. Mishkin and M. ... feature detection description matching viewpoint link 2015-07-15 920
222 Ford Car Dataset The Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects). ... car detection lidar 3d groundtruth sfm link 2014-04-16 1817
221 EPFL Multi-View Cars Th EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using the time o... pose multiview car detection estimation rotation link 2014-02-10 997
217 Youtube-Objects dataset The YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 videos for... video object detection segmentation flow optical link 2014-02-03 1016
216 CVC Partial Occlusion Virtual Pedestrian The CVC Partial Occlusion Virtual Pedestrian datasets (CVC-01 to CVC-06) cover a range of scenarios of occluded pedestrians generated in a virtual and real envi... detection classification tracking pedestrian synthetic urban occlusion link 2016-03-15 1276
213 ChairGest Gestures ChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 Xsens Ine... benchmark recognition kinect gesture detection human link 2014-06-06 651
210 Traffic Video dataset The Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be downloaded ... urban traffic tracking detection overhead view road video link 2014-02-03 2655
201 50 Salads The dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities correspo... action activity recognition classification detection tracking video link 2013-10-05 874
199 THUR15000 We introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each of the ... group saliency object detection visual attention sketch shape retrieval internet link 2013-10-08 915
198 THUS10000 The THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotated with p... segmentation saliency object detection visual attention link 2015-01-11 1053
193 City planar and non-planar The city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (University)... plane detection 3d urban building estimation link 2013-09-23 702
190 Daimler Mono Pedestrian Detection Benchmark The Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (image cut-o... pedestrian detection outdoor urban mono scale object link 2013-09-18 941
188 KTH Multiview Football The KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body joints. ... multiview pedestrian tracking detection object camera outdoor game soccer pose recognition multitarget link 2016-09-18 1356
187 Aspect Layout dataset The Aspect Layout dataset is designed to allow evaluation of object detection for aspect ratios in perspective images. Author text: In this project we see... detection object aspect ratio perspective layout link 2013-09-06 609
182 MSR Action The MSR Action datasets is a collection of various 3D datasets for action recognition. See details http://research.microsoft.com/en-us/um/people/zliu/action... video action recognition detection reconstruction 3d link 2013-09-05 926
169 QMUL Junction Dataset The QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames) Fra... detection tracking crowd counting pedestrian video motion behavior link 2016-12-06 1331
168 Mall Dataset The Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians were label... detection tracking crowd counting pedestrian indoor video webcam link 2016-12-06 1593
166 ICG Multi-Camera Datasets The ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowded sc... multiview pedestrian tracking detection object camera calibration graz indoor video multitarget link 2015-06-19 1172
165 ICG Multi-Camera and Virtual PTZ The ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from a sph... multiview pedestrian tracking detection object camera calibration graz network video panorama crowd outdoor multitarget link 2017-08-19 1262
164 ICG Lab 6 (Multi-Camera Multi-Object Tracking) The ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1347 came... multiview pedestrian tracking detection object laboratory camera calibration evaluation segmentation graz link 2017-09-18 1711
163 TUGRAZ ICG Longterm Pedestrian Dataset The Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection and back... pedestrian change detection background illumination robust indoor coffee graz multitarget link 2015-06-19 990
161 ICG Annotated Facial Landmarks in the Wild (AFLW) The Annotated Facial Landmarks in the Wild (AFLW) consists of a large-scale collection of annotated face images gathered from the web, exhibiting a large variet... face detection landmark pose age annotation link 2017-07-25 2052
160 Caltech Lanes Dataset The Caltech Lanes dataset includes four clips taken around streets in Pasadena, CA at different times of day. The archive below includes 1225 individual frame... urban road lane detection caltech pasadena link 2013-08-08 980
157 Background Models Challenge (BMC) Background Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern: -... background modeling change motion detection surveillance video segmentation link 2016-02-24 1383
151 People in WBCN This dataset is for people tracking in wide baseline camera networks and was designed as a contest at ICPR 2012. The contest consists of two challenges: ... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-08-02 1264
150 SDHA Contest The Semantic Description of Human Activities (SDHA) was a contest at ICPR 2010. The contest is composed of three different types of activity recognition cha... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-07-31 934
147 FlickrLogos-32 The FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval methods... flickr, logo, detection, retrieval, image, object recognition, machine learning, classification brand boundingbox link 2017-05-27 1147
142 German Traffic Sign Recognition Benchmark The German Traffic Sign Recognition Benchmark is a dataset for multi-class detection problem in natural images and do cordially invite you to participate. The b... detection, traffic, urban, recognition link 2016-08-15 1455
138 Buffy The Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for training... segmentation, detection, buffy, movie, human link 2015-02-07 715
113 Penn-Fudan Pedestrian Penn-Fudan Pedestrian Detection and Segmentation... pedestrian detection segmentation background motion link 2013-08-08 954
107 BIWI Pedestrians We provide the three datasets used for testing our system for our ICCV 2007 publication, including annotations. Data was recorded using a pair of AVT Marlins mo... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion link 2013-03-12 1119
106 BIWI Walking Pedestrians (EWAP) The BIWI Walking Pedestrians (EWAP) dataset shows walking pedestrians in busy scenarios from a bird eye view. Manually annotated. Data used for training in our ... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion, aerial link 2013-08-02 1597
99 BSDS500 This new dataset is an extension of the BSDS300, where the original 300 images are used for training / validation and 200 fresh images, together with human anno... segmentation, edge, contour, detection link 2013-03-12 854
98 BSDS300 The goal of this work is to provide an empirical basis for research on image segmentation and boundary detection. To this end, we have collected 12,000 hand-la... segmentation, edge, contour, detection link 2013-03-12 857
95 Stroke Width Transform Text Stroke Width Transform Text dataset is by Boris Epstein and consists of 307 images and XXX text instances. Detecting Text in Natural Scenes with Stroke Wid... text, detection, recognition, classification link 2015-04-24 988
94 Chars74K The Chars74K dataset consists of 64 classes (0-9, A-Z, a-z), 7705 characters obtained from natural images, 3410 hand drawn characters using a tablet PC, 62992 s... text, detection, recognition, classification link 2017-08-03 1465
93 Street View Text The Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challenging becaus... text, detection, recognition, classification, outdoor, urban link 2014-01-13 1066
92 ICDAR 2011 This challenge is set up around three tasks: Text Localisation, Text Segmentation and Word Recognition. Participation in any or all tasks is welcome. Check the ... text, detection, recognition, classification link 2016-06-01 792
91 ICDAR 2003 The ICDAR 2003 datasets available for download on this site: Robust Reading , Robust Word Recognition , Robust OCR , Text Locating and Cursive Script . Pleas... text, detection, recognition, classification link 2017-08-15 975
88 Change Detection The dataset folder contains 7 folders (one for each category). Each category folder contains 4 to 6 folders (one for each video). Each video folder contains: ... change, detection, background link 2013-03-13 714
86 ICG Graz240 The ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult due to ... segmentation, detection, semantic, urban, graz link 2016-03-29 957
79 LabelMe The goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by visitin... segmentation, semantic, outdoor, detection, urban, software link 2013-03-14 805
78 Caltech Pedestrian The Caltech Pedestrian Dataset consists of approximately 10 hours of 640x480 30Hz video taken from a vehicle driving through regular traffic in an urban environ... pedestrian, detection, urban link 2016-06-07 1754
77 Daimler Stereo Pedestrian Daimler Stereo Pedestrian Detection Benchmark C. Keller, M. Enzweiler, and D. M. Gavrila, A New Benchmark for Stereo-based Pedestrian Detection, Proc. of th... pedestrian, detection, urban link 2013-03-13 768
76 Daimler Pedestrian Classification Daimler Multi-Cue, Occluded Pedestrian Classification Benchmark Training and test samples have a resolution of 48 x 96 pixels with a 12-pixel border around t... detection, classification, pedestrian, urban link 2013-03-11 834
75 ETHZ Shape The ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-class ... shape, detection, matching, segmentation, clutter, applelogo, bottle, giraffe, nature, swan, mug link 2014-02-11 815
68 The KITTI Vision Benchmark Suite We take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest are: ste... stereo, depth, flow, detection tracking, reconstruction, sfm, odometry, segmentation, semantic car depth link 2014-02-10 1293
62 Deformed Lattice Detection The Deformed Lattice Detection In Real-World Images dataset is used for regular grid detection. The authors have developed a robust and fast lattice detection a... texture, segmentation, symmetry, lattice, detection, urban link 2013-03-11 765
60 PSU HUB The PSU HUB dataset is a detection, tracking dataset. Ground truth trajectory and grouping information for pedestrians walking in the PSU student union building... detection, tracking, pedestrian, trajectory, crowd, overlap, occlusion link 2013-07-19 991
58 INRIA Horses The INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and cluttered ba... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 761
57 Weizmann Horses The multi-scale Weizmann horses (originally from Eran Borenstein, adapted by Jamie Shotton) consists of 656 images which is split into 50+50training, 50+50 vali... detection, shape, segmentation, clutter, nature, horse link 2013-03-11 1051
56 ETHZ Extended Shape The ETHZ Extended Shape classes dataset from Konrad Schindler is larger dataset of shape categories, created by merging ETHZ shape classes with Konrad Schindler... detection, shape, segmentation, clutter link 2013-03-11 724
53 DTU Robot The DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For each s... feature, detection, description, matching, sfm, reconstruction, illumination link 2016-05-15 836
52 Graffiti The Graffiti dataset by Krystian Mikolajczyk and Cordelia Schmid contains 48 images split into 8 sequences with 6 images each showing different structured and t... feature, detection, description, rectification, benchmark link 2017-02-23 814
29 The Yale Face The Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (leading to... face, pedestrian, detection, pose, illumination link 2015-06-23 801
28 CMU Faces - Frontal faces The MIT + CMU frontal face dataset from H. Rowley contains 130 images with 507 labeled frontal faces from movie, portrait and media sources. It is mostly graysc... frontview, face, detection object boundingbox link 2015-06-19 819
25 PASCAL VOCs The PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. The late... detection segmentation pose pedestrian chair animal car building airplane link 2017-03-09 997
24 UIUC Cars This UIUC Cars dataset by Shivani Agarwal, Aatif Awan and Dan Roth contains images of side views of cars for use in evaluating object detection algorithms. The ... car, sideview, detection, scale, recognition, urban, scale link 2013-10-08 1046
23 Graz02 The Graz02 dataset by Andreas Opelt and Axel Pinz contains four categories of images: bikes, people, cars and a single background class. The annotation has been... bike, pedestrian, background, detection, clutter, car, graz link 2014-04-24 1047
22 Graz01 The Graz01 dataset by Andreas Opelt and Axel Pinz contains four types of images: bikes, people, background with no bikes, background with no people.... bike, pedestrian, background, detection, clutter, graz, occlusion link 2013-08-08 1069
18 Leeds Cows The Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of different backg... detection segmentation cow video background animal link 2013-08-08 901
17 TUD Motorbike The TUD Motorbike dataset from Bastian Leibe contains 115 images collected from the internet. Each image contains one or more motorbikes at different scales and... motorbike, detection, pascal link 2013-08-08 950
16 PETS 2009 The PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for person coun... frontview, outdoor, pedestrian, detection, tracking, overlap, occlusion multitarget, human link 2015-06-19 1350
15 PETS 2006 The PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train station scen... frontview, indoor, pedestrian, detection, tracking, multitarget link 2015-08-12 1079
14 INRIA People The INRIA People dataset from Navneet Dalal and Bill Triggs [DalalCVPR2005] consists of training and testing data. The training contains 1805 images and X peopl... detection, pedestrian, sideview, frontview, human, boundingbox link 2015-06-19 1285
13 CBCL / MIT Pedestrian MIT Pedestrian dataset from Papageorgiou and Poggio [IJCV2000] contains 509 training and 200 test images of pedestrians in city scenes (plus left-right reflecti... pedestrian, frontview, detection, urban, people, boundingbox link 2015-06-19 965
9 TUD Crossing tracking The TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with significant va... tracking detection segmentation multitarget pedestrian sideview overlap urban link 2015-06-19 1857


total views: 112565 5 queries in 0.00012898445129395s 0.00011897087097168s 0.00017380714416504s 0.00011181831359863s 0.0014679431915283s and total 0.0081160068511963s