Title
Extracting protein sub-cellular localizations from literature
Abstract
Protein Sub-cellular Localization (PSL) prediction is an important task for predicting protein functions. Because the sequence-based approach used in the most previous work has focused on prediction of locations for given proteins, it failed to provide useful information for the cases in which single proteins are localized, depending on their states in progress, in several different sub-cellular locations. While it is difficult for the sequence-based approach, it can be tackled by the text-based approach. The proposed approach extracts PSL from literature using Natural Language Processing techniques. We conducted experiments to see how our system performs in identification of evidence sentences and what linguistic features from sentences significantly contribute to the task. This article presents a text-based novel approach to extract PSL relations with their evidence sentences. Evidence sentences will provide indispensable pieces of information that the sequence-based approach cannot supply.
Year
DOI
Venue
2010
10.1007/978-3-642-15470-6_39
AMT
Keywords
Field
DocType
important task,extracting protein sub-cellular localization,sequence-based approach,psl relation,text-based novel approach,protein sub-cellular localization,natural language processing technique,text-based approach,useful information,evidence sentence,system performance,natural language processing
Data mining,Computer science,Natural language processing,Artificial intelligence,Syntactic category,Parsing
Conference
Volume
ISSN
ISBN
6335.0
0302-9743
3-642-15469-7
Citations 
PageRank 
References 
1
0.36
5
Authors
4
Name
Order
Citations
PageRank
Hong-woo Chun116812.86
Jin-Dong Kim2170592.21
Yunsoo Choi3237.64
Won-Kyung Sung414519.93