Textual Information Localization and Retrieval in Document Images Based on Quadtree Decomposition (original) (raw)
Abstract
Textual information extraction is a challenging issue in Information Retrieval. Two main approaches are commonly distinguished: texture-based and region-based. In this paper, we propose a method guided by the quadtree decomposition. The principle of the method is to recursively decompose regions of a document image is four equal regions, starting from the image of the whole document. At each step of the decomposition process an OCR engine is used for retrieving a given textual information from the obtained regions. Experiments on real invoice data provide promising results.
Similar content being viewed by others
References
- Dagher, I., & Taleb, C. (2014). Image denoising using fourth order wiener filter with wavelet quadtree decomposition. Journal of Electrical and Computer Engineering, 2014, 9.
Article Google Scholar - Emmanouilidis, C., Batsalas, C., & Papamarkos, N. (2009). Development and evaluation of text localization techniques based on structural texture features and neural classifiers. In 10th International Conference on Document Analysis and Recognition (pp. 1270–1274).
Google Scholar - Finkel, R. A., & Bentley, J. L. (1974). Quad trees: A data structure for retrieval on composite keys. Acta Informatica, 4, 11–9.
Article MATH Google Scholar - Gatos, B. G. (2014). Imaging techniques in document analysis processes. In Handbook of document image processing and recognition (Vol. 1, pp. 73–131). London: Springer-Verlag
Google Scholar - Jacobs, P. S. (2014). Text-based intelligent systems: Current research and practice in information extraction and retrieval. New York: Psychology Press.
Google Scholar - Minaee, S., Yu, H., & Wang, Y. (2014). A robust regression approach for background/Foreground segmentation. arXiv preprint arXiv: 1412.5126.
Google Scholar - Piskorski, J., & Yangarber, R. (2013). Information extraction: past, present and future. In Multi-source, multilingual information extraction and summarization (pp. 23–49). Berlin Heidelberg: Springer-Verlag
Chapter Google Scholar - Ramanathan, V., Mishra, S., & Mitra, P. (2011). Quadtree decomposition based extended vector space model for image retrieval. 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 139–144.
Google Scholar - Sumathi, C. P., Santhanam, T., & Gayathri, D. (2012a). A survey on various approaches of text extraction in images. International Journal of Computer Science & Engineering Survey, 3(4), 27–42.
Google Scholar - Sumathi, C. P., Santhanam, T., Priya, N. (2012b). Techniques and challenges of automatic text extraction in complex images: a survey. Journal of Theoretical and Applied Information Technology, 35(2), 225–235.
Google Scholar - Wei, L., Lefebvre, S., Kwatra, V., Turk, G., (2009). State of the art in example-based texture synthesis. Eurographics 2009, State of the Art Report, EG-STAR (pp. 93–117).
Google Scholar - Ying, L., Dengsheng, Z., & Guojun, L. (2008). Region based image retrieval with high-level semantics using decision tree learning. Pattern Recognition, 41, 2554–2570.
Article MATH Google Scholar - Ying, L., Zhang, G., & Wei-Ying, M. (2006). Study on texture feature extraction in region-based image retrieval system. In International Multimedia Modelling Conference (pp. 264–271).
Google Scholar - Yuan, Y., Kim, I. K., Zheng, X., Liu, L., Cao, X., Lee, S., & Park, J. H. (2012). Quadtree based nonsquare block structure for inter frame coding in high efficiency video coding. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1707–1719.
Article Google Scholar
Author information
Authors and Affiliations
- EA2525-LIM, University of Reunion Island, Ile de La Reunion, Saint Denis, France
Cynthia Pitou & Jean Diatta
Authors
- Cynthia Pitou
- Jean Diatta
Corresponding author
Correspondence toCynthia Pitou .
Editor information
Editors and Affiliations
- Jacobs University Bremen , Bremen, Germany
Adalbert F.X. Wilhelm - Universität Ulm, Institute of Medical Systems Biology Universität Ulm, Ulm, Baden-Württemberg, Germany
Hans A. Kestler
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Pitou, C., Diatta, J. (2016). Textual Information Localization and Retrieval in Document Images Based on Quadtree Decomposition. In: Wilhelm, A., Kestler, H. (eds) Analysis of Large and Complex Data. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-25226-1\_6
Download citation
- .RIS
- .ENW
- .BIB
- DOI: https://doi.org/10.1007/978-3-319-25226-1\_6
- Published: 04 August 2016
- Publisher Name: Springer, Cham
- Print ISBN: 978-3-319-25224-7
- Online ISBN: 978-3-319-25226-1
- eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)Springer Nature Proceedings excluding Computer Science
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.