Improving the inter-corpora compatibility for protein annotations. - Citegraph

Paper Info

Title
Improving the inter-corpora compatibility for protein annotations.

Abstract
Although there are several corpora with protein annotation, incompatibility between the annotations in different corpora remains a problem that hinders the progress of automatic recognition of protein names in biomedical literature. Here, we report on our efforts to find a solution to the incompatibility issue, and to improve the compatibility between two representative protein-annotated corpora: the GENIA corpus and the GENETAG corpus. In a comparative study, we improve our insight into the two corpora, and a series of experimental results show that most of the incompatibility can be removed.

Year	DOI	Venue
2010	10.1142/S0219720010004999	J. Bioinformatics and Computational Biology
Keywords	Field	DocType
corpus	Information retrieval,Compatibility (mechanics),Biology,Artificial intelligence,Natural language processing,Protein Annotation,Bioinformatics,Named-entity recognition	Journal
Volume	Issue	ISSN
8	5	1757-6334
Citations	PageRank	References
4	0.48	15
Authors
6

Authors (6 rows)

Cited by (4 rows)

References (15 rows)

Name	Order	Citations	PageRank
Yue Wang	1	4	0.48
Jin-Dong Kim	2	1705	92.21
Rune Sætre	3	560	28.49
Sampo Pyysalo	4	1941	100.14
Tomoko Ohta	5	1794	93.54
Jun-ichi Tsujii	6	1973	219.85

1