Please use this identifier to cite or link to this item:
https://doi.org/10.1145/2467696.2467741
Title: | Multimodal alignment of scholarly documents and their presentations | Authors: | Bahrani, B. Kan, M.-Y. |
Keywords: | Digital library Fine-grained document alignment Slide image classification Slide presentation |
Issue Date: | 2013 | Citation: | Bahrani, B.,Kan, M.-Y. (2013). Multimodal alignment of scholarly documents and their presentations. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries : 281-284. ScholarBank@NUS Repository. https://doi.org/10.1145/2467696.2467741 | Abstract: | We present a multimodal system for aligning scholarly docu- ments to corresponding presentations in a fine-grained manner (i.e., per presentation slide and per paper section). Our method improves upon a state-of-the-art baseline that employs only textual similarity. Based on an analysis of base- line errors, we propose a three-pronged alignment system that combines textual, image, and ordering information to establish alignment. Our results show a statistically significant improvement of 25%, confirming the importance of visual content in improving alignment accuracy. Copyright © 2013 by the Association for Computing Machinery, Inc. (ACM). | Source Title: | Proceedings of the ACM/IEEE Joint Conference on Digital Libraries | URI: | http://scholarbank.nus.edu.sg/handle/10635/78246 | ISBN: | 9781450320764 | ISSN: | 15525996 | DOI: | 10.1145/2467696.2467741 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.