Title
Distribution of N-glycosylation sequons in proteins: How apart are they?
Abstract
N-glycosylation is a common protein modification process, which affects a number of properties of proteins. Little is known about the distribution of N-glycosylation sequons, for example, the distance between glycosylated sites and their position in the protein primary sequence. Using a large set of experimentally confirmed eukaryotic N-glycoproteins we analyzed the relative position and distribution of sequons. N-Glycosylation probability was found to be lower in the termini of protein sequences compared to the mid region. N-glycosylated sequons were found much farther from C terminus compared to the N-terminus of the protein sequence and this effect was more pronounced for NXS sequons. The distribution of sequons, modeled based on balls-in-boxes classical occupancy, showed a near-maximum probability. Considerable proportion of sequons was found within a distance of ten amino acids, indicating that the steric hindrance was not a key factor in protein N-glycosylation. Interestingly, the distribution of all sequons present in N-glycoproteins showed a pattern very similar to that of glycosylated sequons. The results indicate that protein N-glycosylation chiefly follows a random design.
Year
DOI
Venue
2011
10.1016/j.compbiolchem.2011.01.002
Computational Biology and Chemistry
Keywords
Field
DocType
Distribution,N-glycoprotein,Probability,Randomness,Sequon
Protein sequencing,Biology,Amino acid,Biochemistry,Protein modification process,Steric effects,N-linked glycosylation,Bioinformatics,Genetics,C-terminus,Sequon
Journal
Volume
Issue
ISSN
35
2
1476-9271
Citations 
PageRank 
References 
0
0.34
1
Authors
3
Name
Order
Citations
PageRank
R. Shyama Prasad Rao111.04
Ole Thomsen Buus211.04
Bernd Wollenweber3201.08