Title
Fusion of Knowledge-Intensive and Statistical Approaches for Retrieving and Annotating Textual Genomics Documents
Abstract
This paper represents a continuation of research into the retrieval and annotation of textual genomics documents (both MEDLINE® citations and full text articles) for the purpose of satisfying biologists' real information needs. The overall approach taken here for both the ad hoc retrieval and categorization tasks within the TREC genomics track in 2005 was one combining the results of several NLP, statistical and ML methods, using a fusion method for ad hoc retrieval and ensemble methods for categorization. The results show that fusion approaches can improve the final outcome for the ad hoc and the categorization tasks, but that care must be taken in order to take advantage of the strengths of the constituent methods.
Year
Venue
Keywords
2005
TREC
thematic analysis.,medline/pubmed,information retrieval,machine learning,statistical natural language processing,vector space models,genomics,mesh,information need,vector space model,satisfiability
Field
DocType
Citations 
Data mining,Categorization,Information needs,Annotation,Information retrieval,Computer science,Genomics,Natural language processing,Artificial intelligence,MEDLINE,TREC Genomics,Ensemble learning
Conference
27
PageRank 
References 
Authors
2.03
18
10
Name
Order
Citations
PageRank
Alan R. Aronson12551260.67
Dina Demner Fushman21717147.70
Susanne M. Humphrey356163.27
Jimmy Lin44800376.93
patrick ruch511722.37
Miguel E. Ruiz662647.00
Lawrence H. Smith719614.48
Lorraine Tanabe838329.80
W. John Wilbur942443.91
Hongfang Liu101479160.66