Title
ATHENA: A knowledge-based hybrid backpropagation-grammatical evolution neural network algorithm for discovering epistasis among quantitative trait Loci.
Abstract
BACKGROUND: Growing interest and burgeoning technology for discovering genetic mechanisms that influence disease processes have ushered in a flood of genetic association studies over the last decade, yet little heritability in highly studied complex traits has been explained by genetic variation. Non-additive gene-gene interactions, which are not often explored, are thought to be one source of this "missing" heritability. METHODS: Stochastic methods employing evolutionary algorithms have demonstrated promise in being able to detect and model gene-gene and gene-environment interactions that influence human traits. Here we demonstrate modifications to a neural network algorithm in ATHENA (the Analysis Tool for Heritable and Environmental Network Associations) resulting in clear performance improvements for discovering gene-gene interactions that influence human traits. We employed an alternative tree-based crossover, backpropagation for locally fitting neural network weights, and incorporation of domain knowledge obtainable from publicly accessible biological databases for initializing the search for gene-gene interactions. We tested these modifications in silico using simulated datasets. RESULTS: We show that the alternative tree-based crossover modification resulted in a modest increase in the sensitivity of the ATHENA algorithm for discovering gene-gene interactions. The performance increase was highly statistically significant when backpropagation was used to locally fit NN weights. We also demonstrate that using domain knowledge to initialize the search for gene-gene interactions results in a large performance increase, especially when the search space is larger than the search coverage. CONCLUSIONS: We show that a hybrid optimization procedure, alternative crossover strategies, and incorporation of domain knowledge from publicly available biological databases can result in marked increases in sensitivity and performance of the ATHENA algorithm for detecting and modelling gene-gene interactions that influence a complex human trait.
Year
DOI
Venue
2010
10.1186/1756-0381-3-5
BioData mining
Keywords
Field
DocType
backpropagation,search space,biomedical research,quantitative trait loci,bioinformatics,biological database,statistical significance,evolutionary algorithm,genetic association,grammatical evolution,gene environment interaction,domain knowledge,genetic variation,neural network,genetics,knowledge base
Heritability,Data mining,Quantitative trait locus,Biology,Domain knowledge,Epistasis,Genetic variation,Genetic association,Bioinformatics,Backpropagation,Grammatical evolution
Journal
Volume
Issue
ISSN
3
1
1756-0381
Citations 
PageRank 
References 
11
0.92
18
Authors
3
Name
Order
Citations
PageRank
Stephen D. Turner1556.84
Scott M. Dudek220626.27
Marylyn D. Ritchie369286.79