DSpace Repository

Comparative study of preprocessing and classification methods in character recognition of natural scene images

Show simple item record

dc.contributor.author Sinha, Yash
dc.date.accessioned 2025-08-25T10:44:40Z
dc.date.available 2025-08-25T10:44:40Z
dc.date.issued 2025-01
dc.identifier.uri https://link.springer.com/chapter/10.1007/978-81-322-2625-3_11
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/19230
dc.description.abstract This paper presents an approach to character recognition in natural scene images. Recognizing such text is a challenging problem in the field of Computer Vision, more than the recognition of scanned documents due to several reasons. We propose a classification technique for classifying characters based on a pipeline of image processing operations and ensemble machine learning techniques. This pipeline tackles problems where Optical Character Recognition (OCR) fails. We present a framework that comprises a sequence of operations such as resizing, grey scaling, thresholding, morphological opening and median filtering on the images to handle background clutter, noise, multi-sized and multi-oriented characters and variance in illumination. We used image pixels and HOG (Histogram of Oriented Gradients) as features to train three different models based on Nearest-Neighbour, Random Forest and Extra Tree classifiers. When the input images were pre-processed, HOG features were extracted and fed into extra tree classifier, and the model classified the characters with maximum accuracy, among the other models that we tested. The proposed steps have been experimentally proven to yield better accuracy than the present state-of-the-art classification techniques on the Chars74k dataset. In addition, the paper includes a comparative study elaborating on various image processing operations, feature extraction methods and classification techniques. en_US
dc.language.iso en en_US
dc.publisher Springer en_US
dc.subject Computer Science en_US
dc.subject Character recognition en_US
dc.subject Natural scene text en_US
dc.subject Optical character recognition (OCR) en_US
dc.subject Image preprocessing en_US
dc.subject Histogram of oriented gradients (HOG) en_US
dc.title Comparative study of preprocessing and classification methods in character recognition of natural scene images en_US
dc.type Book chapter en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account