Title
A High Recall Classifier for Selecting Articles for MEDLINE Indexing.
Abstract
MEDLINE is the National Library of Medicine's premier bibliographic database for biomedical literature. A highly valuable feature of the database is that each record is manually indexed with a controlled vocabulary called MeSH. Most MEDLINE journals are indexed cover-to-cover, but there are about 200 selectively indexed journals for which only articles related to biomedicine and life sciences are indexed. In recent years, the selection process has become an increasing burden for indexing staff, and this paper presents a machine learning based system that offers very significant time savings by semi-automating the task. At the core of the system is a high recall classifier for the identification ofjournal articles that are in-scope for MEDLINE. The system is shown to reduce the number of articles requiring manual review by 54%, equivalent to approximately 40,000 articles per year.
Year
Venue
DocType
2019
AMIA
Conference
Volume
ISSN
Citations 
2019
1942-597X
0
PageRank 
References 
Authors
0.34
0
4
Name
Order
Citations
PageRank
Alastair R. Rae102.70
Max E. Savery200.34
James G. Mork364765.22
Dina Demner Fushman41717147.70