Title
Compressing proteomes: the relevance of medium range correlations.
Abstract
We study the nonrandomness of proteome sequences by analysing the correlations that arise between amino acids at a short and medium range, more specifically, between amino acids located 10 or 100 residues apart; respectively. We show that statistical models that consider these two types of correlation are more likely to seize the information contained in protein sequences and thus achieve good compression rates. Finally, we propose that the cause for this redundancy is related to the evolutionary origin of proteomes and protein sequences.
Year
DOI
Venue
2007
10.1155/2007/60723
EURASIP J. Bioinformatics and Systems Biology
Keywords
DocType
Volume
evolutionary origin,medium range,compressing proteomes,medium range correlation,good compression rate,amino acid,protein sequence,proteome sequence,statistical model
Journal
2007,
Issue
ISSN
Citations 
1
1687-4145
3
PageRank 
References 
Authors
0.38
9
3
Name
Order
Citations
PageRank
Dario Benedetto1101.24
Emanuele Caglioti2889.18
Claudia Chica327311.83