Title
Is there codon usage bias for poly-Q stretches in the human proteome?
Abstract
We have analyzed codon usage for poly-Q stretches of different lengths for the human proteome. First, we have obtained that all long poly-Q stretches in Protein Data Bank (PDB) belong to the disordered regions. Second, we have found the bias for codon usage for glutamine homorepeats in the human proteome. In the cases when the same codon is used for poly-Q stretches only CAG triplets are found. Similar results are obtained for human proteins with glutamine homo-repeats associated with diseases. Moreover, for proteins associated with diseases (from the HraDis database), the fraction of proteins for which the same codon is used for glutamine homorepeats is less (22%) than for proteins from the human proteome (26%). We have demonstrated for poly-Q stretches in the human proteome that in some cases (28) the splicing sites correspond to the homo-repeats and in 11 cases, these sites appear at the C-terminal part of the homorepeats with statistical significance 10(-8).
Year
DOI
Venue
2019
10.1142/S0219720019500100
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
Homo-repeat,codon usage,proteome,splicing site,disease
Human proteome project,Biology,Proteome,Bioinformatics,Computational biology,Protein Data Bank,Protein Data Bank (RCSB PDB),Codon usage bias
Journal
Volume
Issue
ISSN
17
SP1
0219-7200
Citations 
PageRank 
References 
0
0.34
1
Authors
4
Name
Order
Citations
PageRank
Oxana V. Galzitskaya112520.15
Georgii S Novikov200.34
Nikita V. Dovidchenko322.17
Mikhail Yu. Lobanov400.68