Title
Classification Methods For Finding Articles Describing Protein-Protein Interactions In Pubmed
Abstract
With the rapid expansion in the number of published papers in the biomedical field, finding relevant articles has become a demanding task for researchers. This has led to increasing interest in the use of text mining tools that help search the literature and identify the most relevant documents or information. One specific topic of interest is related to the identification of articles that might be used for extracting protein-protein interactions. Using the BioCreative III Article Classification Task dataset, composed of PubMed abstracts classified as relevant or non-relevant for describing protein-protein interactions, we compare different classification methods with different sets of features. The best results - area under the interpolated precision-recall curve of 0.654 - indicate that the proposed classification strategy could be incorporated in the database curation workflows in order to prioritize articles for extraction of protein-protein interactions. Furthermore, we also analysed the use of this method for ranking documents resulting from general PubMed queries, and propose that this approach could be useful for general researchers looking for publications describing protein-protein interactions within a particular topic of interest.
Year
DOI
Venue
2011
10.2390/biecoll-jib-2011-178
JOURNAL OF INTEGRATIVE BIOINFORMATICS
DocType
Volume
Issue
Journal
8
3
ISSN
Citations 
PageRank 
1613-4516
2
0.36
References 
Authors
5
2
Name
Order
Citations
PageRank
Sérgio Matos141529.51
José Luis Oliveira276084.03