MEDLINE MeSH indexing: lessons learned from machine learning and future directions - Citegraph

Paper Info

Title
MEDLINE MeSH indexing: lessons learned from machine learning and future directions

Abstract
Due to the large yearly growth of MEDLINE, MeSH indexing is becoming a more difficult task for a relatively small group of highly qualified indexing staff at the US National Library of Medicine (NLM). The Medical Text Indexer (MTI) is a support tool for assisting indexers; this tool relies on MetaMap and a k-NN approach called PubMed Related Citations (PRC). Our motivation is to improve the quality of MTI based on machine learning. Typical machine learning approaches fit this indexing task into text categorization. In this work, we have studied some Medical Subject Headings (MeSH) recommended by MTI and analyzed the issues when using standard machine learning algorithms. We show that in some cases machine learning can improve the annotations already recommended by MTI, that machine learning based on low variance methods achieves better performance and that each MeSH heading presents a different behavior. In addition, there are several factors which make this task difficult (e.g. limited access to the full-text of the citations) which provide direction for future work.

Year	DOI	Venue
2012	10.1145/2110363.2110450	IHI
Keywords	Field	DocType
mesh indexing,typical machine,cases machine learning,medline mesh indexing,qualified indexing staff,machine learning,medical text indexer,standard machine,future direction,medical subject headings,indexing task,difficult task,indexation,subject headings	Information retrieval,Computer science,Indexer,Search engine indexing,Artificial intelligence,Natural language processing,Text categorization,MEDLINE,Machine learning	Conference
Citations	PageRank	References
12	0.61	12
Authors
5

Authors (5 rows)

Cited by (12 rows)

References (12 rows)

Name	Order	Citations	PageRank
Antonio Jimeno Yepes	1	40	7.59
James G. Mork	2	647	65.22
BartBomiej Wilkowski	3	12	0.61
Dina Demner Fushman	4	1717	147.70
Alan R. Aronson	5	2551	260.67

1