arXiv Analytics

Sign in

arXiv:1711.07245 [cs.CV]AbstractReferencesReviewsResources

Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application

Konkimalla Chandra Prakash, Y. M. Srikar, Gayam Trishal, Souraj Mandal, Sumohana S. Channappayya

Published 2017-11-20Version 1

Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Telugu a non-trivial task. To address the challenge of OCR for Telugu, we make three contributions in this work: (i) a database of Telugu characters, (ii) a deep learning based OCR algorithm, and (iii) a client server solution for the online deployment of the algorithm. For the benefit of the Telugu people and the research community, we will make our code freely available at https://gayamtrishal.github.io/OCR_Telugu.github.io/

Related articles: Most relevant | Search more
arXiv:1603.02466 [cs.CV] (Published 2016-03-08)
A non-extensive entropy feature and its application to texture classification
arXiv:1702.03515 [cs.CV] (Published 2017-02-12)
Sparse Representation based Multi-sensor Image Fusion: A Review
arXiv:1301.2351 [cs.CV] (Published 2013-01-10)
Application of Hopfield Network to Saccades