Title
"Hey, vitrivr!" - A Multimodal UI for Video Retrieval.
Abstract
In this paper, we present a multimodal web-based user interface for the vitrivr system. vitrivr is a modern, open-source video retrieval system for searching in large collections of video using a great variety of query modes, including query-by-sketch, query-by-example and query-by-motion. With the multimodal user interface, prospective users benefit from being able to naturally interact with the vitrivr system by using spoken commands and also by applying multimodal commands which combine spoken instructions with manual pointing. While the main strength of the UI is the seamless combination of speech-based and sketch-based interaction for multimedia similarity search, the speech modality has shown to be very effective for retrieval on its own. In particular, it helps overcoming accessibility boundaries and offering retrieval functionality for users with disabilities. Finally, for a holistic natural experience with the vitrivr system, we have integrated a speech synthesis engine that returns spoken answers to the user.
Year
DOI
Venue
2017
10.1007/978-3-319-56608-5_75
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017
Field
DocType
Volume
Speech synthesis,Video retrieval,Information retrieval,Computer science,User interface,Nearest neighbor search,Sketch
Conference
10193
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
2
5
Name
Order
Citations
PageRank
Prateek Goel100.34
Ivan Giangreco29311.64
Luca Rossetto39221.00
Claudiu Tanase4316.05
H. Schuldt59820.60