Title
Multimodal alignment of scholarly documents and their presentations
Abstract
We present a multimodal system for aligning scholarly docu- ments to corresponding presentations in a fine-grained man- ner (i.e., per presentation slide and per paper section). Our method improves upon a state-of-the-art baseline that em- ploys only textual similarity. Based on an analysis of base- line errors, we propose a three-pronged alignment system that combines textual, image, and ordering information to establish alignment. Our results show a statistically sig- nificant improvement of 25%, confirming the importance of visual content in improving alignment accuracy.
Year
DOI
Venue
2013
10.1145/2467696.2467741
JCDL
Keywords
Field
DocType
scholarly document,textual similarity,corresponding presentation,alignment accuracy,multimodal system,nificant improvement,fine-grained man,paper section,multimodal alignment,line error,three-pronged alignment system,presentation slide,digital library
Information retrieval,Computer science,Digital library,Multimedia
Conference
ISSN
Citations 
PageRank 
2575-7865
2
0.41
References 
Authors
23
2
Name
Order
Citations
PageRank
Bamdad Bahrani170.83
Min-yen Kan22786162.35