Title
A Multiple-Instance Learning Approach to Sentence Selection for Question Ranking.
Abstract
In example-based retrieval a system is queried with a document aiming to retrieve other similar or relevant documents. We address an instance of this problem: question retrieval in community Question Answering (cQA) forums. In this scenario, both the document collection and the queries are relatively short multi-sentence documents subject to noise and redundancy, which makes it harder for learning-to-rank algorithms to build upon the proper text representation. In order to only exploit the relevant fragments of the query and collection documents, we treat them as a sequence of sentences, in a multipleinstance learning fashion. By automatically pre-selecting the best sentences for our tree-kernel-based learning model, we improve over using full text performance on the dataset of the 2016 SemEval cQA challenge in terms of accuracy and speed, reaching the state of the art.
Year
DOI
Venue
2017
10.1007/978-3-319-56608-5_34
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017
Field
DocType
Volume
Parse tree,Question answering,Information retrieval,Ranking,Computer science,Redundancy (engineering),Artificial intelligence,Natural language processing,Sentence
Conference
10193
ISSN
Citations 
PageRank 
0302-9743
0
0.34
References 
Authors
24
4
Name
Order
Citations
PageRank
Salvatore Romeo1275.32
Giovanni Da San Martino223627.08
Alberto Barrón-Cedeño334629.35
Alessandro Moschitti43262177.68