Title
Comparison of full-text searching to metadata searching for genes in two biomedical literature cohorts
Abstract
Researchers have traditionally used bibliographic databases to search out information. Today, the full-text of resources is increasingly available for searching, and more researchers are performing full-text searches. This study compares differences in the number of articles discovered between metadata and full-text searches of the same literature cohort when searching for gene names in two biomedical literature domains. Three reviewers additionally ranked 100 articles in each domain. Significantly more articles were discovered via full-text searching; however, the precision of full-text searching also is significantly lower than that of metadata searching. Certain features of articles correlated with higher relevance ratings. A significant feature measured was the number of matches of the search term in the full-text of the article, with a larger number of matches having a statistically significant higher usefulness (i.e., relevance) rating. By using the number of hits of the search term in the full-text to rank the importance of the article, performance of full-text searching was improved so that both recall and precision were as good as or better than that for metadata searching. This suggests that full-text searching alone may be sufficient, and that metadata searching as a surrogate is not necessary. © 2007 Wiley Periodicals, Inc.
Year
DOI
Venue
2007
10.1002/asi.v58:14
JASIST
Keywords
Field
DocType
metadata,recall,medical informatics,precision
Data mining,Metadata,Text searching,Information retrieval,Bibliographic database,Ranking,Computer science,Precision and recall,Health informatics,Gene nomenclature,Recall
Journal
Volume
Issue
ISSN
58
14
1532-2882
Citations 
PageRank 
References 
14
0.63
15
Authors
4
Name
Order
Citations
PageRank
Bradley M. Hemminger145638.24
Billy Saelim2201.21
patrick f sullivan316522.87
Todd J Vision416814.98