Title
Automated Assignment of Medical Subject Headings.
Abstract
citations published in 1997, with abstracts in English, were selected at random. The following methods ,of finding and ranking suitable MeSH descriptors have been investigated using this test collection: The Inquery Algorithm. This algorithm depends ,on parsing text into noun phrases, then using the Inquery search engine, develops ,an ordered ,list of UMLS Metathesaurus concepts for each citation, based on the noun phrases extracted from that text. A ranked list of concepts is developed for each phrase. Trigram Algorithm. A ,phrase ,is broken ,into overlapping,trigrams ,(three letters occurring ,in succession) for analysis. Candidate ,phrases ,are obtained from the title and abstract by examining,all maximal,contiguous sets of words ,that contain no punctuation or stop words (from a list of 310 common stop words). The trigrams are used to match,phrases inthe UMLS, with the maximal overlap of sets of trigrams resulting in the suggested UMLS concept. Restricting to MeSH. Once ,a UMLS concept has been identified, using the MetaMap method or the Trigram algorithm, the task becomes one of navigating within the UMLS to find the appropriate
Year
Venue
Keywords
1999
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION
noun phrase,bioinformatics,search engine,biomedical research,subject headings
Field
DocType
Issue
Noun phrase,Information retrieval,Ranking,Computer science,Trigram,Phrase,Natural language processing,Artificial intelligence,Parsing,Cluster analysis,Unified Medical Language System,Stop words
Conference
SUPnan
ISSN
Citations 
PageRank 
1067-5027
1
0.35
References 
Authors
4
8
Name
Order
Citations
PageRank
Stuart J. Nelson130640.05
Alan R. Aronson22551260.67
Tamas E. Doszkocs33110.44
W. John Wilbur442443.91
Olivier Bodenreider52715226.05
Florence Chang619324.51
James G. Mork764765.22
Alexa T. McCray829941.13