Title
Extracting Characteristics of the Study Subjects from Full-Text Articles.
Abstract
Characteristics of the subjects of biomedical research are important in determining if a publication describing the research is relevant to a search. To facilitate finding relevant publications, MEDLINE citations provide Medical Subject Headings that describe the subjects' characteristics, such as their species, gender, and age. We seek to improve the recommendation of these headings by the Medical Text Indexer (MTI) that supports manual indexing of MEDLINE. To that end, we explore the potential of the full text of the publications. Using simple recall-oriented rule-based methods we determined that adding sentences extracted from the methods sections and captions to the abstracts prior to MTI processing significantly improved recall and F1 score with only a slight drop in precision. Improvements were also achieved in directly assigning several headings extracted from the full text. These results indicate the need for further development of automated methods capable of leveraging the full text for indexing.
Year
Venue
Field
2015
AMIA
F1 score,Text mining,Information retrieval,Computer science,Indexer,Search engine indexing,MEDLINE,Recall
DocType
Volume
Citations 
Conference
2015
0
PageRank 
References 
Authors
0.34
6
2
Name
Order
Citations
PageRank
Dina Demner Fushman11717147.70
James G. Mork264765.22