Open Access Open Access  Restricted Access Subscription or Fee Access

Multiscale Segmentation for Mixed Raster Content Applicable to Document Coding

S. Amutha, V. Ponraj

Abstract


Compound document images contain graphic or textual content along with pictures. They are found in magazines, brochures, web-sites, etc in a document format. The goal is to compress an image containing the mixed raster content (MRC) using multi-layer approach. The proposed methodology segments the image into regions such as text, pictures and background. The key to MRC compression is the separation of the document into foreground and background layers, represented as a binary mask. The compression quality depends on the segmentation algorithm used to compute the binary mask. The proposed multi-scale segmentation algorithm models the complex aspects of both local and global contextual behavior. The proposed algorithm finds the block-wise segmentation of the raster image in a global cost optimization framework. Then the initial segmentation is refined by classifying feature vectors of connected components using a Markov random field (MRF) model. Then hybrid procedures of the previous steps are then incorporated into a multi-scale framework in order to improve the segmentation accuracy of text with varying size. It is shown that the proposed methodology achieves greater accuracy of text detection but with a lower false detection rate of non-text features. This segmentation algorithm can improve the quality of decoded documents while simultaneously lowering the bit rate. It is also shown that execution time can be greatly reduced by the use of features that are not computationally intensive.

Keywords


Muliscale Image Analysis, Mixed Raster Content, Document Image Segmentation, MRC Compression, Markov Random Fields, Document Coding.

Full Text:

PDF

References


International Telecommunication Union, ITU-T recommendation T.44 Mixed raster content (MRC), April 1999.

R. L. de Queiroz, R. Buckley, M. Xu, “Mixed raster content (MRC) model for compound image compression”, Proc. SPIE, Visual Commu-nications and Image Processing, Vol. 3653, pp. 1106-1117, Jan 1999.

R. L. de Queiroz, “Compressing Compound Documents”, in The Doc-ument and Image Compression Handbook, edited by M. Barni, Marcel-Dekker, 2005.

A. Said and A. Drukarev, “Simplified segmentation for com-pound image compression,” in Proc. of IEEE Int’l Confer-ence on Image Proc., Kobe, Japan, October 1999, vol. 1, pp. 229–233.

L. Bottou et al, “High quality document image commpression using djvu,” Journal of Electronic Image, vol. 7, pp. 410–425, 1998.

J. Huang, Y. Wang, and E. K. Wong, “Check image compression using a layered coding method,” Journal of Electronic Imaging, vol. 7, no. 3, July 1998.

K. Etemad, D. Doermann, and R. Chellappa, “Multiscale segmentation of unstructured document pages using soft decision integration,” IEEE Trans. Pattern Anal. Machine Intell., vol. 19, pp. 92–96, Jan. 1997.

M. Unser, “Texture classification and segmentation using wavelet frames,” IEEE Trans. Image Processing, vol. 4, pp. 1549–1560, Nov. 1995.

E.Haneda and A.Bouman, “ Text Segmentation for MRC Document Compression”, IEEE Trans. On Image Processing, Vol. 20, pp. 1611-1626 , June 2011.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.