Open Access Open Access  Restricted Access Subscription or Fee Access

Character segmentation of Assamese printed and handwritten words using classifier-based sliding window technique

Amlan Jyoti Basumatari

Abstract


Development of optical character recognition for Indian scripts has been an active area of research. Despite ample amount of independent research, there are only a few available commercial applications. The reason behind this is the complex nature of these scripts which leads to poor segmentation accuracy even when isolated character recognition accuracy is very high. This paper explores the area of character segmentation and proposes an innovative character segmentation scheme for Assamese word images, both printed and handwritten, which operates in a sliding window based mechanism taking aid of a classifier. The method extends the conventional role of Support Vector Machine (SVM) classifiers and makes them useful in segmentation also. Here, a small window is passed over the word image and word segment inside the current window is fed to the trained SVM. Based on the probability estimate given by the SVM, segmentation points are determined. When probability estimate is higher than a predefined threshold it is assumed that the current window holds a segmentation point. Otherwise the size of window is incremented and again fed to the SVM. This process is repeated until the window passes over the entire word. When tested on self-made datasets the system achieved character level accuracies of 87.48% and 82.07% respectively for printed and handwritten words. The technique fails to work where slanted characters are present.

Keywords


Character segmentation, Support vector machine, Sliding window technique, Assamese script.

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information. Also: DOI is paid service which provided by a third party. We never mentioned that we go for this for our any journal. However, journal have no objection if author go directly for this paid DOI service.