Please use this identifier to cite or link to this item:
|Title:||A system for understanding imaged infographics and its applications|
|Authors:||Huang, W. |
Association of text and graphics
Document image understanding
|Citation:||Huang, W.,Tan, C.L. (2007). A system for understanding imaged infographics and its applications. DocEng'07: Proceedings of the 2007 ACM Symposium on Document Engineering : 9-18. ScholarBank@NUS Repository. https://doi.org/10.1145/1284420.1284427|
|Abstract:||Information graphics, or infographics, are visual representations of information, data or knowledge. Understanding of infographics in documents is a relatively new research problem, which becomes more challenging when infographics appear as raster images. This paper describes technical details and practical applications of the system we built for recognizing and understanding imaged infographics located in document pages. To recognize infographics in raster form, both graphical symbol extraction and text recognition need to be performed. The two kinds of information are then auto-associated to capture and store the semantic information carried by the infographics. Two practical applications of the system are introduced in this paper, including supplement to traditional optical character recognition (OCR) system and providing enriched information for question answering (QA). To test the performance of our system, we conducted experiments using a collection of downloaded and scanned infographic images. Another set of scanned document pages from the University of Washington document image database were used to demonstrate how the system output can be used by other applications. The results obtained confirm the practical value of the system. Copyright 2007 ACM.|
|Source Title:||DocEng'07: Proceedings of the 2007 ACM Symposium on Document Engineering|
|Appears in Collections:||Staff Publications|
Show full item record
Files in This Item:
There are no files associated with this item.
checked on Jan 20, 2019
checked on Nov 24, 2018
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.