Title
GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes.
Abstract
Background: The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-specific probability (P-score) measures of confidence. Term-specific probabilities are a novel feature of GOtcha and allow the identification of conflicts or uncertainty in annotation. Results: The GOtcha method was applied to the recently sequenced genome for Plasmodium falciparum and six other genomes. GOtcha was compared quantitatively for retrieval of assigned GO terms against direct transitive assignment from the highest scoring annotated BLAST search hit (TOPBLAST). GOtcha exploits information deep into the 'twilight zone' of similarity search matches, making use of much information that is otherwise discarded by more simplistic approaches. At a P-score cutoff of 50%, GOtcha provided 60% better recovery of annotation terms and 20% higher selectivity than annotation with TOPBLAST at an E-value cutoff of 10-4. Conclusions: The GOtcha method is a useful tool for genome annotators. It has identified both errors and omissions in the original Plasmodium falciparum annotation and is being adopted by many other genome sequencing projects.
Year
DOI
Venue
2004
10.1186/1471-2105-5-178
BMC Bioinformatics
Keywords
Field
DocType
similarity search,genome,microarrays,bioinformatics,algorithms,predictive value of tests,genome sequence,genome annotation,production function,proteins
Genome,Annotation,Biology,Gene ontology,Directed acyclic graph,Critical Assessment of Function Annotation,Protein function,Bioinformatics,Genetics,DNA microarray,Transitive relation
Journal
Volume
Issue
ISSN
5
1
1471-2105
Citations 
PageRank 
References 
91
4.66
16
Authors
3
Name
Order
Citations
PageRank
David M A Martin137826.55
Matthew Berriman241634.14
Geoffrey J. Barton3103984.08