Title
Association Rule Discovery Has the Ability to Model Complex Genetic Effects
Abstract
Dramatic advances in genotyping technology have established a need for fast, flexible analysis methods for genetic association studies. Common complex diseases, such as Parkinson's disease or multiple sclerosis, are thought to involve an interplay of multiple genes working either independently or together to influence disease risk. Also, multiple underlying traits, each its own genetic basis may be defined together as a single disease. These effects - trait heterogeneity, locus heterogeneity, and gene-gene interactions (epistasis) - contribute to the complex architecture of common genetic diseases. Association Rule Discovery (ARD) searches for frequent itemsets to identify rule-based patterns in large scale data. In this study, we apply Apriori (an ARD algorithm) to simulated genetic data with varying degrees of complexity. Apriori using information difference to prior as a rule measure shows good power to detect functional effects in simulated cases of simple trait heterogeneity, trait heterogeneity and epistasis, and moderate power in cases of trait heterogeneity and locus heterogeneity. Also, we illustrate that bootstrapping the rule induction process does not considerably improve the power to detect these effects. These results show that ARD is a framework with sufficient flexibility to characterize complex genetic effects.
Year
DOI
Venue
2007
10.1109/CIDM.2007.368934
CIDM
Field
DocType
Volume
Data mining,Epistasis,Computer science,Trait,Bootstrapping,Genetic association,Artificial intelligence,Computational biology,Disease,Genotyping,Rule induction,Machine learning,Locus heterogeneity
Conference
2007
Citations 
PageRank 
References 
1
0.35
8
Authors
3
Name
Order
Citations
PageRank
William S. Bush116118.45
Tricia A. Thornton-wells27011.17
Marylyn D. Ritchie369286.79