Extraction and Grounding of Protein Mutations via Semantic Integration of Text and Sequence Information - Citegraph

Paper Info

Title
Extraction and Grounding of Protein Mutations via Semantic Integration of Text and Sequence Information

Abstract
Rich information on mutations and their impacts is scattered across scientific texts and literature. Reuse of mutation impact annotations requires grounding mutations to the correct positions on sequences extracted from protein databases as a critical step. This paper presents a generic method for grounding textual mentions of mutation entities to protein sequences, that is based on an OWL-DL ontology driven workflow that integrates text and sequence information in a semantically consistent way. Mutation mentions mined from texts are iteratively mapped onto candidate proteins, and an ontology mining algorithm facilitates their correct grounding to a protein sequence. Using a gold standard corpus of full text articles and corresponding protein sequences we show the proposed method is promising compared to existing approaches.

Year	DOI	Venue
2011	10.1109/AINA.2011.112	Advanced Information Networking and Applications
Keywords	Field	DocType
biology computing,data mining,knowledge representation languages,text analysis,OWL-DL ontology,ontology mining algorithm,protein databases,protein mutations grounding,semantic integration,sequence information,text information,Mutation Extraction,Mutation Grounding,Ontologies,Sequence Analysis,Text Mining	Ontology (information science),Semantic integration,Ontology,Text mining,Information retrieval,Protein sequencing,Computer science,Workflow,Protein Databases,Sequence analysis	Conference
ISSN	ISBN	Citations
1550-445X E-ISBN : 978-0-7695-4337-6	978-0-7695-4337-6	0
PageRank	References	Authors
0.34	13	2

Authors (2 rows)

Cited by (0 rows)

References (13 rows)

Name	Order	Citations	PageRank
christopher j o baker	1	329	30.96
K Rajaraman	2	380	31.94

1