6533b834fe1ef96bd129d6a3
RESEARCH PRODUCT
Using skeleton and Hough transform variant to correct skew in historical documents
Dominique MichelucciOmar BoudraaWalid Khaled Hidoucisubject
General Computer ScienceHorizontal and verticalMorphological skeletonComputer scienceSkew estimationComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONDocument image analysis010103 numerical & computational mathematics02 engineering and technologySkeleton (category theory)01 natural sciencesMeasure (mathematics)Theoretical Computer ScienceHough transformlaw.inventionImage (mathematics)lawMorphological skeleton0202 electrical engineering electronic engineering information engineering[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]0101 mathematicsNumerical AnalysisPixelbusiness.industryApplied MathematicsProgressive probabilistic Hough transformSkew[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognitionSkew correction[INFO.INFO-TT]Computer Science [cs]/Document and Text ProcessingModeling and Simulation020201 artificial intelligence & image processingArtificial intelligencebusinessdescription
International audience; As a main part of several document analysis systems, Skew estimation represents one of the major research challenges, particularly in case of historical documents exploration. In this paper, we propose an original skew angle detection and correction technique. Morphological Skeleton is introduced to considerably diminish the amount of data by eliminating the redundant pixels and preserving only the central curves of the image components. Next, the proposed method uses Progressive Probabilistic Hough Transform (PPHT) to find image lines. At the end, a specific procedure is applied in order to measure the global skew angle of the document image from these identified lines. Experimental results demonstrate the accuracy and the effectiveness of our approach on skew angle detection upon three popular datasets covering many types of documents of diverse linguistic writings (Chinese, Greek and English) and different styles (horizontal or vertical orientations, including figures and tables, multi-columns page layouts).
year | journal | country | edition | language |
---|---|---|---|---|
2020-01-01 | Mathematics and Computers in Simulation |