Title
Learning virulent proteins from integrated query networks.
Abstract
Methods of weakening and attenuating pathogens' abilities to infect and propagate in a host, thus allowing the natural immune system to more easily decimate invaders, have gained attention as alternatives to broad-spectrum targeting approaches. The following work describes a technique to identifying proteins involved in virulence by relying on latent information computationally gathered across biological repositories, applicable to both generic and specific virulence categories.A lightweight method for data integration is used, which links information regarding a protein via a path-based query graph. A method of weighting is then applied to query graphs that can serve as input to various statistical classification methods for discrimination, and the combined usage of both data integration and learning methods are tested against the problem of both generalized and specific virulence function prediction.This approach improves coverage of functional data over a protein. Moreover, while depending largely on noisy and potentially non-curated data from public sources, we find it outperforms other techniques to identification of general virulence factors and baseline remote homology detection methods for specific virulence categories.
Year
DOI
Venue
2012
10.1186/1471-2105-13-321
BMC Bioinformatics
Keywords
Field
DocType
bioinformatics,microarrays,algorithms
Biology,Support vector machine,Natural immune system,Bioinformatics,Genetics,Virulence,DNA microarray
Journal
Volume
Issue
ISSN
13
321
1471-2105
Citations 
PageRank 
References 
4
0.34
33
Authors
3
Name
Order
Citations
PageRank
Eithon Cadag123513.60
P Tarczy-Hornoch255353.80
Peter J. Myler3584.78