Open Access Open Access  Restricted Access Subscription or Fee Access

Border Noise Removal from the Document Image Using X-Y Cut and Filtering Technique Based on Morphological Operation

Marian Wagdy, Ibrahima Faye,and Dayang Rohaya , Ibrahima Faye, Dayang Rohaya

Abstract


Noise is a common problem in most of the image understanding analysis. Border noise is one of the common document noises that are introduced when scanning and/or capturing thick books and old books. This noise is responsible for non-accurate results for document image analysis system as example OCR engines. In this paper we introducedan algorithm which can effectively remove the border noise from the document image by using X-Y cut
and filtering technique based on morphological operations. We evaluate our technique on CBDAR 2007 document dewarping dataset and make comparison with other state-of-art methods. The experiment results show the high performance of the algorithm in removing the border noise (textual and non-textual noise) for a variety of documents with different levels of complexity and different structure. The used documents include diagrams, figures, formulas and different languages (English, Arabic, Chinese and Japanese document).

Keywords


Border noise, X-Y Cut algorithm, Morphological Operations.

Full Text:

PDF


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.