Title
Complexity control in a mixture model by the Hardy-Weinberg equilibrium
Abstract
A method of complexity control in multinomial mixture modeling of multiple-marker genotype data, imposing the Hardy-Weinberg equilibrium (HWE) between the genotype values, is studied. This is a very natural restriction, and known to hold at population level under modest assumptions. The hypothesis under study is that imposing this restriction will prevent overfitting and lead to a better model. This is shown to indeed be case. Experimental results on chromosomes 1 and 17 of the HapMap data demonstrate that the restricted model generalizes better to unseen data, and also finds clusters that correspond better to the ethnic groups of the HapMap, when compared with a model without the HWE restriction.
Year
DOI
Venue
2009
10.1016/j.csda.2008.07.023
Computational Statistics & Data Analysis
Keywords
Field
DocType
hardy-weinberg equilibrium,natural restriction,hapmap data,complexity control,genotype value,restricted model,multiple-marker genotype data,mixture model,unseen data,better model,hwe restriction,hardy weinberg equilibrium,ethnic group
Econometrics,Population,Hardy–Weinberg principle,Mixture modeling,International HapMap Project,Multinomial distribution,Overfitting,Statistics,Mixture model,Mathematics
Journal
Volume
Issue
ISSN
53
5
Computational Statistics and Data Analysis
Citations 
PageRank 
References 
0
0.34
11
Authors
2
Name
Order
Citations
PageRank
Ella Bingham191758.70
Heikki Mannila265951495.69