DSpace Repository

Optical Character Recognition for Sanskrit Using Convolution Neural Networks

Show simple item record

dc.contributor.author Goyal, Navneet
dc.date.accessioned 2022-12-26T06:31:59Z
dc.date.available 2022-12-26T06:31:59Z
dc.date.issued 2018
dc.identifier.uri https://ieeexplore.ieee.org/document/8395237/authors#authors
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8116
dc.description.abstract Ancient Sanskrit manuscripts are a rich source of knowledge about Science, Mathematics, Hindu mythology, Indian civilization, and culture. It therefore becomes critical that access to these manuscripts is made easy, to share this knowledge with the world and to facilitate further research on this Ancient literature. In this paper, we propose a Convolutional Neural Network (CNN) based Optical Character Recognition system (OCR) which accurately digitizes Ancient Sanskrit manuscripts (Devanagari Script) that are not necessarily in good condition. We use an image segmentation algorithm for calculating pixel intensities to identify letters in the image. The OCR considers typical compound characters (half letter combinations) as separate classes in order to improve the segmentation accuracy. The novelty of the OCR is its robustness to image quality, image contrast, font style and font size, which makes it an ideal choice for digitizing soiled and poorly maintained Sanskrit manuscripts. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject Computer Science en_US
dc.subject Devanagari Script en_US
dc.subject Sanskrit en_US
dc.subject Hindi en_US
dc.subject Deep Learning en_US
dc.subject OCR en_US
dc.subject Optical character recognition en_US
dc.title Optical Character Recognition for Sanskrit Using Convolution Neural Networks en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account