Description (include details on usage, files and paper references) | The Chars74K dataset consists of 64 classes (0-9, A-Z, a-z), 7705 characters obtained from natural images, 3410 hand drawn characters using a tablet PC, 62992 synthesised characters from computer fonts. This gives a total of over 74K images (which explains the name of the dataset).
In the English language, Latin script (excluding accents) and Hindu-Arabic numerals are used. For simplicity we call this the English characters set.
T. E. de Campos, B. R. Babu and M. Varma. Character recognition in natural images. In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP), Lisbon, Portugal, February 2009.
Bibtex | Abstract | PDF |
|