Title
Feature engineering for MEDLINE citation categorization with MeSH.
Abstract
Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations.
Year
DOI
Venue
2015
10.1186/s12859-015-0539-7
BMC Bioinformatics
Keywords
Field
DocType
microarrays,bioinformatics,algorithms,biomedical research
Noun phrase,Categorization,Argumentative,Information retrieval,Computer science,Citation,Feature engineering,Natural language processing,Bigram,Artificial intelligence,Syntax,Semantics
Journal
Volume
Issue
ISSN
16
1
1471-2105
Citations 
PageRank 
References 
7
0.49
40
Authors
5
Name
Order
Citations
PageRank
Antonio Jimeno-Yepes154033.38
Laura Plaza221217.36
Jorge Carrillo de Albornoz3112.28
James G. Mork464765.22
Alan R. Aronson52551260.67