Optical Character Recognition for Sanskrit Using Convolution Neural Networks

dc.contributor.authorGoyal, Navneet
dc.date.accessioned2022-12-26T06:31:59Z
dc.date.available2022-12-26T06:31:59Z
dc.date.issued2018
dc.description.abstractAncient Sanskrit manuscripts are a rich source of knowledge about Science, Mathematics, Hindu mythology, Indian civilization, and culture. It therefore becomes critical that access to these manuscripts is made easy, to share this knowledge with the world and to facilitate further research on this Ancient literature. In this paper, we propose a Convolutional Neural Network (CNN) based Optical Character Recognition system (OCR) which accurately digitizes Ancient Sanskrit manuscripts (Devanagari Script) that are not necessarily in good condition. We use an image segmentation algorithm for calculating pixel intensities to identify letters in the image. The OCR considers typical compound characters (half letter combinations) as separate classes in order to improve the segmentation accuracy. The novelty of the OCR is its robustness to image quality, image contrast, font style and font size, which makes it an ideal choice for digitizing soiled and poorly maintained Sanskrit manuscripts.en_US
dc.identifier.urihttps://ieeexplore.ieee.org/document/8395237/authors#authors
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/8116
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectComputer Scienceen_US
dc.subjectDevanagari Scripten_US
dc.subjectSanskriten_US
dc.subjectHindien_US
dc.subjectDeep Learningen_US
dc.subjectOCRen_US
dc.subjectOptical character recognitionen_US
dc.titleOptical Character Recognition for Sanskrit Using Convolution Neural Networksen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: