Please use this identifier to cite or link to this item: https://doi.org/10.1145/1284420.1284427
Title: A system for understanding imaged infographics and its applications
Authors: Huang, W. 
Tan, C.L. 
Keywords: Applications
Association of text and graphics
Document image understanding
Infographics
Issue Date: 2007
Citation: Huang, W.,Tan, C.L. (2007). A system for understanding imaged infographics and its applications. DocEng'07: Proceedings of the 2007 ACM Symposium on Document Engineering : 9-18. ScholarBank@NUS Repository. https://doi.org/10.1145/1284420.1284427
Abstract: Information graphics, or infographics, are visual representations of information, data or knowledge. Understanding of infographics in documents is a relatively new research problem, which becomes more challenging when infographics appear as raster images. This paper describes technical details and practical applications of the system we built for recognizing and understanding imaged infographics located in document pages. To recognize infographics in raster form, both graphical symbol extraction and text recognition need to be performed. The two kinds of information are then auto-associated to capture and store the semantic information carried by the infographics. Two practical applications of the system are introduced in this paper, including supplement to traditional optical character recognition (OCR) system and providing enriched information for question answering (QA). To test the performance of our system, we conducted experiments using a collection of downloaded and scanned infographic images. Another set of scanned document pages from the University of Washington document image database were used to demonstrate how the system output can be used by other applications. The results obtained confirm the practical value of the system. Copyright 2007 ACM.
Source Title: DocEng'07: Proceedings of the 2007 ACM Symposium on Document Engineering
URI: http://scholarbank.nus.edu.sg/handle/10635/41055
ISBN: 9781595937766
DOI: 10.1145/1284420.1284427
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.