A robust segmentation of scanned documents
In: SPIE Proceedings, 2015-02-08
Online
unknown
Zugriff:
The image quality of reprinted documents that were scanned at a high resolution may not satisfy human viewers who anticipate at least the same image quality as the original document. Moire artifacts without proper descreening, text blurred by the poor scanner modulation transfer function (MTF), and color distortion resulting from misclassification between color and gray may make the reprint quality worse. To remedy these shortcomings from reprinting, the documents should be classified into various attributes such as image or text, edge or non-edge, continuous-tone or halftone, color or gray, and so on. The improvement of the reprint quality could be achieved by applying proper enhancement with these attributes. In this paper, we introduce a robust and effective approach to classify scanned documents into the attributes of each pixel. The proposed document segmentation algorithm utilizes simple features such as variance-to-mean (VMR), gradient, etc in various combinations of sizes and positions of a processing kernel. We also exploit each direction of gradients in the multiple positions of the same kernel to detect as small as 4-point text. Experimental results show that our proposed algorithm performs well over various types of the scanned documents including the documents that were printed in a resolution of low lines per inch (LPI).
Titel: |
A robust segmentation of scanned documents
|
---|---|
Autor/in / Beteiligte Person: | Ji Young Yi ; Park, Hyungjun |
Link: | |
Zeitschrift: | SPIE Proceedings, 2015-02-08 |
Veröffentlichung: | SPIE, 2015 |
Medientyp: | unknown |
ISSN: | 0277-786X (print) |
DOI: | 10.1117/12.2076907 |
Schlagwort: |
|
Sonstiges: |
|