Abstract | ||
---|---|---|
Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1186/s12859-015-0539-7 | BMC Bioinformatics |
Keywords | Field | DocType |
microarrays,bioinformatics,algorithms,biomedical research | Noun phrase,Categorization,Argumentative,Information retrieval,Computer science,Citation,Feature engineering,Natural language processing,Bigram,Artificial intelligence,Syntax,Semantics | Journal |
Volume | Issue | ISSN |
16 | 1 | 1471-2105 |
Citations | PageRank | References |
7 | 0.49 | 40 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Antonio Jimeno-Yepes | 1 | 540 | 33.38 |
Laura Plaza | 2 | 212 | 17.36 |
Jorge Carrillo de Albornoz | 3 | 11 | 2.28 |
James G. Mork | 4 | 647 | 65.22 |
Alan R. Aronson | 5 | 2551 | 260.67 |