Title
Enabling multi-level relevance feedback on PubMed by integrating rank learning into DBMS
Abstract
BACKGROUND: Finding relevant articles from PubMed is challenging because it is hard to express the user's specific intention in the given query interface, and a keyword query typically retrieves a large number of results. Researchers have applied machine learning techniques to find relevant articles by ranking the articles according to the learned relevance function. However, the process of learning and ranking is usually done offline without integrated with the keyword queries, and the users have to provide a large amount of training documents to get a reasonable learning accuracy. This paper proposes a novel multi-level relevance feedback system for PubMed, called RefMed, which supports both ad-hoc keyword queries and a multi-level relevance feedback in real time on PubMed. RESULTS: RefMed supports a multi-level relevance feedback by using the RankSVM as the learning method, and thus it achieves higher accuracy with less feedback. RefMed "tightly" integrates the RankSVM into RDBMS to support both keyword queries and the multi-level relevance feedback in real time; the tight coupling of the RankSVM and DBMS substantially improves the processing time. An efficient parameter selection method for the RankSVM is also proposed, which tunes the RankSVM parameter without performing validation. Thereby, RefMed achieves a high learning accuracy in real time without performing a validation process. RefMed is accessible at http://dm.postech.ac.kr/refmed. CONCLUSIONS: RefMed is the first multi-level relevance feedback system for PubMed, which achieves a high accuracy with less feedback. It effectively learns an accurate relevance function from the user’s feedback and efficiently processes the function to return relevant articles in real time.
Year
DOI
Venue
2010
10.1186/1471-2105-11-S2-S6
BMC Bioinformatics
Keywords
Field
DocType
tight coupling,artificial intelligence,algorithms,microarrays,database management systems,computational biology,bioinformatics,machine learning,real time,feedback
Relevance feedback,Information retrieval,Ranking,Computer science,Ranking (information retrieval),Bioinformatics
Journal
Volume
Issue
ISSN
11
Suppl 2
1471-2105
Citations 
PageRank 
References 
12
0.64
18
Authors
6
Name
Order
Citations
PageRank
Hwanjo Yu11715114.02
Tae-Hoon Kim245953.02
Jinoh Oh330315.32
Ilhwan Ko4251.94
Sungchul Kim510810.36
Wook-Shin Han680557.85