Abstract | ||
---|---|---|
We present a multimodal system for aligning scholarly docu- ments to corresponding presentations in a fine-grained man- ner (i.e., per presentation slide and per paper section). Our method improves upon a state-of-the-art baseline that em- ploys only textual similarity. Based on an analysis of base- line errors, we propose a three-pronged alignment system that combines textual, image, and ordering information to establish alignment. Our results show a statistically sig- nificant improvement of 25%, confirming the importance of visual content in improving alignment accuracy. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1145/2467696.2467741 | JCDL |
Keywords | Field | DocType |
scholarly document,textual similarity,corresponding presentation,alignment accuracy,multimodal system,nificant improvement,fine-grained man,paper section,multimodal alignment,line error,three-pronged alignment system,presentation slide,digital library | Information retrieval,Computer science,Digital library,Multimedia | Conference |
ISSN | Citations | PageRank |
2575-7865 | 2 | 0.41 |
References | Authors | |
23 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bamdad Bahrani | 1 | 7 | 0.83 |
Min-yen Kan | 2 | 2786 | 162.35 |