Please use this identifier to cite or link to this item: https://doi.org/10.1145/2467696.2467741
Title: Multimodal alignment of scholarly documents and their presentations
Authors: Bahrani, B.
Kan, M.-Y. 
Keywords: Digital library
Fine-grained document alignment
Slide image classification
Slide presentation
Issue Date: 2013
Citation: Bahrani, B.,Kan, M.-Y. (2013). Multimodal alignment of scholarly documents and their presentations. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries : 281-284. ScholarBank@NUS Repository. https://doi.org/10.1145/2467696.2467741
Abstract: We present a multimodal system for aligning scholarly docu- ments to corresponding presentations in a fine-grained manner (i.e., per presentation slide and per paper section). Our method improves upon a state-of-the-art baseline that employs only textual similarity. Based on an analysis of base- line errors, we propose a three-pronged alignment system that combines textual, image, and ordering information to establish alignment. Our results show a statistically significant improvement of 25%, confirming the importance of visual content in improving alignment accuracy. Copyright © 2013 by the Association for Computing Machinery, Inc. (ACM).
Source Title: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
URI: http://scholarbank.nus.edu.sg/handle/10635/78246
ISBN: 9781450320764
ISSN: 15525996
DOI: 10.1145/2467696.2467741
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.