Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

(679.8KB)
This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 74 countries worldwide.
A Novel Method for De-warping in Persian document images captured by cameras
hadi dehbovid, farbod razzazi, shapor alirezaee
Pages - 390 - 400     |    Revised - 30-08-2010     |    Published - 30-10-2010
Volume - 4   Issue - 4    |    Publication Date - October 2010  Table of Contents
MORE INFORMATION
KEYWORDS
Geometric Distortion, OCR, camera based OCR, Image Archives
ABSTRACT
In this Paper, We proposed a novel algorithm for de-warping of Persian document images captured by the cameras. The aim of de-warping is to remove page distortions and to straighten document images captured by the cameras, so that the documents are readable to the OCR system. Recently, the industrial implementation of the images captured by digital cameras has significantly expanded. Most of the studies carries out so far in this regard have focused on the documents written in Latin and few researches have been conducted regarding Persian documents. The original idea of the proposed algorithm is based on the segmentation of the components of texts. In this algorithm, an effective technique is offered for detection of the upper and lower baselines, which is used in estimation of the slope of the words. Moreover, vertical shift of the warped words is done through fitting a quadratic curve fitted to the centers of the words in a line in relation to the horizontal line. The suggested algorithm is examined by qualitative and quantitative measures and the results of its implementation on various documents indicate a 92% accuracy of the proposed technique in correction of the location and angle of the words.
CITED BY (3)  
1 Shayegan, M. A. (2015). Dataset size and dimensionality reduction approaches for handwritten farsi digits and characters recognition (Doctoral dissertation, University of Malaya).
2 Camera, S. C. B. Electric Institute funded master's degree thesis master's program.
3 Guo Wende. (2011). Identification and automatic music playing system to retrieve the camera.
1 Google Scholar 
2 CiteSeerX 
3 iSEEK 
4 Socol@r  
5 Scribd 
6 SlideShare 
7 PDFCAST 
8 PdfSR 
1 J. Liang, D. Doermann, H. Li. "Camera-based analysis of text and documents: a survey". Int. Jour. Of Document Analysis and Recognition, 7(2-3): 84104, 2005
2 A. Ulges, C. Lampert, and T. M. Breuel. "Document capture using stereo vision". In Proceedings of the ACM Symposium on Document Engineering, Milwaukee, Wisconsin, USA, 2004
3 A. Yamashita, A. Kawarago, T. Kaneko and K.T.Miura. "Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system". In Proceedings of 17th International Conference on Pattern Recognition (ICPR) Cambridge UK, 2004
4 M.S. Brown and W.B. Seales. "Document restoration using 3d shape: A general deskewing algorithm for arbitrarily warped documents". In International Conference on Computer Vision (ICCV), Vancouver, B.C., Canada, 2001
5 M. Pilu. "Deskewing perspectively distorted documents: An approach based on perceptual organization". In HP Technical Reports, 2001
6 L. Zhang and C.L. Tan. "Warped image restoration with applications to digital libraries". In Proc. Eighth Int. Conf. on Document Analysis and Recognition, Washington, DC, USA, 2005
7 A. Ulges, C.H. Lampert and T.M. Breuel. "Document image dewarping using robust estimation of curled text lines". In Proc. Eighth Int. Conf. on Document Analysis and Recognition, Washington, DC, USA, 2005
8 J. Liang, D.F. DeMenthon, and D. Doermann. "Flattening curved documents in images". In Proc. Computer Vision and Pattern Recognition,San Diego, 2005
9 A. Masalovitch and L. Mestetskiy. "Usage of continuous skeletal image representation for document images de-warping". In 2nd Int. Workshop on Camera- Based Document Analysis and Recognition, Curitiba, Brazil, 2007
10 B.Gatos, I. Pratikakis, and K. Ntirogiannis. "Segmentation based recovery of arbitrarily warped document images". In Proc. Int. Conf. on Document Analysis and Recognition, Curitiba, Brazil, 2007
11 B. Fu, M.Wu, R. Li,W. Li, and Z. Xu. "A model-based book de-warping method using text line detection". In 2nd Int. Workshop on Camera-Based Document Analysis and Recognition, Curitiba, Brazil, 2007
12 U.V. Marti, H. Bunke. "Using a statistical language model to improve the performance of an HMMbased cursive handwriting recognition system". Int. Jour. of Pattern Recognition and Artifical Intelligence, 15(1): 6590, 2001
13 F. Shafait and T. M. Breuel. "Document Image Dewarping Contest". In proc CBDR, 2007
Mr. hadi dehbovid
Islamic Azad University, Nour Branch - Iran
hadi.dehbovid@gmail.com
Dr. farbod razzazi
paya soft - Iran
Dr. shapor alirezaee
- Iran