Multimodal Alignment of Scholarly Documents and Their Presentations
Research Area: Digital Libraries Year: 2013
Type of Publication: Mastersthesis  
  • Bamdad Bahrani
We present a multimodal system for aligning scholarly documents to corresponding presentations in a fi ne-grained manner (i.e., per presentation slide and per paper section). Our method improves upon a state-of-the-art baseline that employs only textual similarity. Based on an analysis of errors made by the baseline, we propose a three-pronged alignment system that combines textual, image, and ordering information to establish alignment. Our results show a statistically signifi cant improvement of 25%. Our result con firms the importance of emphasizing on visual content to improve document alignment accuracy.
Digital version