Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 74 countries worldwide.
Separation of mixed Document Images in Farsi Scanned Documents Using Blind Source Separation
Hossein Ghanbarloo, Farbod Razzazi, Shahpur Alirezaei
Pages - 421 - 435     |    Revised - 30-08-2010     |    Published - 30-10-2010
Volume - 4   Issue - 4    |    Publication Date - October 2010  Table of Contents
Blind source Separation, Independent component Analysis, show-through, feed-through, background removing, scanned documents processing
In the field of mixed scanned documents separation, various studies have been carried out to reduce one (or more) unwanted artifacts from the document. Most of the approaches are based on comparison of the front and back sides of the documents. In some cases, it has been suggested to analyze the colored images; however, because of the calculation complexity of the approaches, they are not very applicable in practical applications. Furthermore none of them are tested on Farsi/Arabic documents. In this paper, an applicable approach to large size images is presented which is based on image block segmentation (mosaicing). The advantages of this approach are less memory usage, combining of simultaneous and ordinal blind source separation methods in order to increase the algorithm efficiency, reducing calculation complexity of the algorithm into about twenty percents of the basic algorithm, and high stability in noisy images. In noiseless conditions, the average signal to noise ratio of the output images is reached up to 28.75 db. Furthermore, all of these cases have been tested on Farsi official documents. By applying the suggested ideas, considerable accuracy is achieved in the results, at minimum time. In addition, various parameters of the proposed algorithm (e.g. the size of each block, appropriate initial point, and number of iterations) were optimized.
1 Google Scholar 
2 CiteSeerX 
3 iSEEK 
4 Socol@r  
5 Scribd 
6 SlideShare 
8 PdfSR 
1 N. K. Kishore and P. P. Rege. “Adaptive enhancement of historical document images”. In Proceedings of IEEE International Symposium on Signal Processing and Information Technology. Giza, Dec 2007
2 B. Nija, N. G. Preethi, and S. S. Shylaja. “Degraded document image enhancement using Hybrid Thresholding and Mathematical Morphology”. In Proceedings of sixth Indian Conference on Computer Vision, Graphics & Image Processing. Bhubaneswar, Dec 2008
3 J. Kennard and W. A. Barrett. “Separating lines of text in free-form handwritten historical documents”. In Proceedings of Second International Conference on Document Image Analysis for Libraries (DIAL'06). Lyon, France, April 2006
4 F. Drira. “Towards restoring historic documents degraded over time”. In Proceedings of the Second International Conference on Document Image Analysis for Libraries (DIAL'06), Pages 350-357. Lyon, France. April 2006
5 R. Farrahi moghaddam and M. Cheriet. “Restoration of single sided low quality document images”. In Proceedings of Pattern Recognition, Volume 42, Issue 12, Pages 3355-3364. Dec 2009
6 M. Feng, Y. Tan. “Adaptive binarization method for document image analysis”. In Proceedings of the IEEE International Conference on Multimedia and Expo, ICME. Taipei, Taiwan. June 2004
7 F. Merrikh-Bayat, M. Babaie-Zadeh, and C. Jutten. “A nonlinear blind source separation solution for removing the show-through effect in the scanned documents”. In Proceedings of the 16th European Signal Processing Conference (EUSIPCO). Lausanne, Suisse. Sep 2008
8 J. Wang, M. S. Brown, and C. Lim Tan. “Accurate alignment of double-sided manuscripts for bleed-through removal”. In Proceedings of the 8th IAPR International Workshop on Document Analysis Systems (DAS), Pages 69-75. Nara, Japan. Sep 2008
9 Y. Huang, M. S. Brown, and D. Xu. “A framework for reducing ink-bleed in old document”. In Proceedings of the IEEE Computer Society Conference Vision and Pattern Recognition (CVPR 2008). Anchorage, Alaska, USA. June 2008
10 J. Banerjee. “Document enhancement using text specific prior”. Thesis of Master of Science Degree. Hyderabad, India. Dec 2008
11 C. Wolf. “Document ink bleed-through removal with two hidden Markov random fields and a single observation field”. In Proceedings of the IEEE Transactions on Pattern Analysis and Machine Intelligence Volume 32, no 3, Pages 431-447. Jan 2010
12 A. Tonazzini, I. Gerace, and F. Cricco. “Joint blind separation and restoration of mixed degraded images for document analysis”. In Proceedings of the International Conference on Image Processing (ICIP'04). Singapore. Oct 2004
13 M. Cheriet and R. Farrahi Moghaddam. “Processing of low quality document images: issues and directions”. In Proceedings of the 16th European Signal Processing Conference (EUSIPCO). Lausanne, Switzerland. August 2008
14 A. Tonazzini, E. Salerno, M. Mochi, and L. Bedini. “Blind Source Separation Techniques for Detecting Hidden Texts and Textures in document Images”. In Proceedings of the International Conference on Image Analysis and Recognition (ICIAR), Part II. Porto, Portugal. Sep-Oct 2004
15 A. Tonazzini, L. Bedini, and E. Salerno. “Independent component analysis for document restoration”. International Journal on Document Analysis and Recognition, Volume 7, Number 1, Pages 17-27. March 2004
16 A. Cichocki, and S. Amari. “Adaptive blind signal and image processing”. 2002.
Mr. Hossein Ghanbarloo
- Iran
Dr. Farbod Razzazi
- Iran
Dr. Shahpur Alirezaei
- Iran