Title
Viral proteins acquired from a host converge to simplified domain architectures.
Abstract
The infection cycle of viruses creates many opportunities for the exchange of genetic material with the host. Many viruses integrate their sequences into the genome of their host for replication. These processes may lead to the virus acquisition of host sequences. Such sequences are prone to accumulation of mutations and deletions. However, in rare instances, sequences acquired from a host become beneficial for the virus. We searched for unexpected sequence similarity among the 900,000 viral proteins and all proteins from cellular organisms. Here, we focus on viruses that infect metazoa. The high-conservation analysis yielded 187 instances of highly similar viral-host sequences. Only a small number of them represent viruses that hijacked host sequences. The low-conservation sequence analysis utilizes the Pfam family collection. About 5% of the 12,000 statistical models archived in Pfam are composed of viral-metazoan proteins. In about half of Pfam families, we provide indirect support for the directionality from the host to the virus. The other families are either wrongly annotated or reflect an extensive sequence exchange between the viruses and their hosts. In about 75% of cross-taxa Pfam families, the viral proteins are significantly shorter than their metazoan counterparts. The tendency for shorter viral proteins relative to their related host proteins accounts for the acquisition of only a fragment of the host gene, the elimination of an internal domain and shortening of the linkers between domains. We conclude that, along viral evolution, the host-originated sequences accommodate simplified domain compositions. We postulate that the trimmed proteins act by interfering with the fundamental function of the host including intracellular signaling, post-translational modification, protein-protein interaction networks and cellular trafficking. We compiled a collection of hijacked protein sequences. These sequences are attractive targets for manipulation of viral infection.
Year
DOI
Venue
2012
10.1371/journal.pcbi.1002364
PLOS COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
conserved sequence,amino acid sequence,protein sequence,post translational modification,statistical model,sequence alignment,cluster analysis,genetics,sequence analysis
Sequence alignment,Genome,Conserved sequence,Protein domain,Gene,Viral evolution,Biology,Genome evolution,Bioinformatics,Genetics,Sequence analysis
Journal
Volume
Issue
ISSN
8
2
1553-7358
Citations 
PageRank 
References 
1
0.35
7
Authors
2
Name
Order
Citations
PageRank
Nadav Rappoport1213.08
Michal Linial21502149.92