Title
Statistical analysis for aggregated count data in genetic association studies
Abstract
AbstractIn smoking behaviour studies, Cigarette Counts Per Day CPD are aggregated such as 0, one pack, two packs, etc. Analysis of such count data is a challenge, owing to its reporting bias and difficulty in estimating its appropriate distribution. In this study, we set forth to identify genetic variants, such as Single Nucleotide Polymorphisms SNPs, that correlate with aggregated count data, such as CPD. We first reviewed the existing approaches, in which the aggregated count data is a dependent variable and the SNP is an ordinal independent variable. We then considered a calibration model in which the SNP is the ordinal dependent variable and the aggregated count data is the independent variable. This calibration modelling approach becomes robust to accommodate distributional assumptions of count data. We applied our robust calibration modelling approach to CPD data from the Korean Association Resource project data of 4183 male samples. Through simulation studies, we investigated the performance of the proposed method for comparison to other competing approaches.
Year
DOI
Venue
2016
10.1504/IJDMB.2016.079802
Periodicals
Keywords
Field
DocType
self-reported studies, association studies, CPD, SNP, calibration model
Computer science,Ordinal number,Genetic association,Count data,Single-nucleotide polymorphism,Variables,Bioinformatics,Statistics,SNP,Reporting bias,Statistical analysis
Journal
Volume
Issue
ISSN
16
1
1748-5673
Citations 
PageRank 
References 
0
0.34
0
Authors
3
Name
Order
Citations
PageRank
Haewon Choi100.68
Hye-Young Jung2133.96
Taesung Park349064.41