Proteinortho: detection of (co-)orthologs in large-scale analysis. - Citegraph

Paper Info

Title
Proteinortho: detection of (co-)orthologs in large-scale analysis.

Abstract
Orthology analysis is an important part of data analysis in many areas of bioinformatics such as comparative genomics and molecular phylogenetics. The ever-increasing flood of sequence data, and hence the rapidly increasing number of genomes that can be compared simultaneously, calls for efficient software tools as brute-force approaches with quadratic memory requirements become infeasible in practise. The rapid pace at which new data become available, furthermore, makes it desirable to compute genome-wide orthology relations for a given dataset rather than relying on relations listed in databases.The program Proteinortho described here is a stand-alone tool that is geared towards large datasets and makes use of distributed computing techniques when run on multi-core hardware. It implements an extended version of the reciprocal best alignment heuristic. We apply Proteinortho to compute orthologous proteins in the complete set of all 717 eubacterial genomes available at NCBI at the beginning of 2009. We identified thirty proteins present in 99% of all bacterial proteomes.Proteinortho significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.

Year	DOI	Venue
2011	10.1186/1471-2105-12-124	BMC Bioinformatics
Keywords	Field	DocType
phylogeny,genomics,data analysis,distributed computing,bioinformatics,comparative genomics,algorithms,microarrays,sequence alignment,molecular phylogenetics	Scale analysis (statistics),Data mining,Heuristic,Biology,Genomics,Comparative genomics,Algebraic connectivity,Software,Data sequences,Bioinformatics,Multi-core processor	Journal
Volume	Issue	ISSN
12	1	1471-2105
Citations	PageRank	References
30	0.99	12
Authors
6

Authors (6 rows)

Cited by (30 rows)

References (12 rows)

Name	Order	Citations	PageRank
Marcus Lechner	1	43	2.31
Sven Findeiß	2	77	4.67
Lydia Steiner	3	30	0.99
Manja Marz	4	55	8.41
Peter F. Stadler	5	1839	152.96
Sonja J. Prohaska	6	145	11.56

1