arXiv:1711.07245 Abstract | arXiv Analytics

arXiv:1711.07245 [cs.CV]Abstract References Reviews Resources

Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application

Konkimalla Chandra Prakash, Y. M. Srikar, Gayam Trishal, Souraj Mandal, Sumohana S. Channappayya

Published 2017-11-20Version 1

Telugu is a Dravidian language spoken by more than 80 million people worldwide. The optical character recognition (OCR) of the Telugu script has wide ranging applications including education, health-care, administration etc. The beautiful Telugu script however is very different from Germanic scripts like English and German. This makes the use of transfer learning of Germanic OCR solutions to Telugu a non-trivial task. To address the challenge of OCR for Telugu, we make three contributions in this work: (i) a database of Telugu characters, (ii) a deep learning based OCR algorithm, and (iii) a client server solution for the online deployment of the algorithm. For the benefit of the Telugu people and the research community, we will make our code freely available at https://gayamtrishal.github.io/OCR_Telugu.github.io/

Comments: Submitted to NCC 2018

Categories: cs.CV

Keywords: optical character recognition, application, million people worldwide, client server solution, dravidian language spoken

Related articles: Most relevant | Search more

arXiv:1603.02466 [cs.CV] (Published 2016-03-08)

A non-extensive entropy feature and its application to texture classification

Seba Susan, Madasu Hanmandlu

arXiv:1702.03515 [cs.CV] (Published 2017-02-12)

Sparse Representation based Multi-sensor Image Fusion: A Review

Qiang Zhang, Yi Liu, Rick S. Blum, Jungong Han, Dacheng Tao

arXiv:1301.2351 [cs.CV] (Published 2013-01-10)

Application of Hopfield Network to Saccades