Abstract | ||
---|---|---|
citations published in 1997, with abstracts in English, were selected at random. The following methods ,of finding and ranking suitable MeSH descriptors have been investigated using this test collection: The Inquery Algorithm. This algorithm depends ,on parsing text into noun phrases, then using the Inquery search engine, develops ,an ordered ,list of UMLS Metathesaurus concepts for each citation, based on the noun phrases extracted from that text. A ranked list of concepts is developed for each phrase. Trigram Algorithm. A ,phrase ,is broken ,into overlapping,trigrams ,(three letters occurring ,in succession) for analysis. Candidate ,phrases ,are obtained from the title and abstract by examining,all maximal,contiguous sets of words ,that contain no punctuation or stop words (from a list of 310 common stop words). The trigrams are used to match,phrases inthe UMLS, with the maximal overlap of sets of trigrams resulting in the suggested UMLS concept. Restricting to MeSH. Once ,a UMLS concept has been identified, using the MetaMap method or the Trigram algorithm, the task becomes one of navigating within the UMLS to find the appropriate |
Year | Venue | Keywords |
---|---|---|
1999 | JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION | noun phrase,bioinformatics,search engine,biomedical research,subject headings |
Field | DocType | Issue |
Noun phrase,Information retrieval,Ranking,Computer science,Trigram,Phrase,Natural language processing,Artificial intelligence,Parsing,Cluster analysis,Unified Medical Language System,Stop words | Conference | SUPnan |
ISSN | Citations | PageRank |
1067-5027 | 1 | 0.35 |
References | Authors | |
4 | 8 |
Name | Order | Citations | PageRank |
---|---|---|---|
Stuart J. Nelson | 1 | 306 | 40.05 |
Alan R. Aronson | 2 | 2551 | 260.67 |
Tamas E. Doszkocs | 3 | 31 | 10.44 |
W. John Wilbur | 4 | 424 | 43.91 |
Olivier Bodenreider | 5 | 2715 | 226.05 |
Florence Chang | 6 | 193 | 24.51 |
James G. Mork | 7 | 647 | 65.22 |
Alexa T. McCray | 8 | 299 | 41.13 |