Statistical analysis for aggregated count data in genetic association studies - Citegraph

Paper Info

Title
Statistical analysis for aggregated count data in genetic association studies

Abstract
AbstractIn smoking behaviour studies, Cigarette Counts Per Day CPD are aggregated such as 0, one pack, two packs, etc. Analysis of such count data is a challenge, owing to its reporting bias and difficulty in estimating its appropriate distribution. In this study, we set forth to identify genetic variants, such as Single Nucleotide Polymorphisms SNPs, that correlate with aggregated count data, such as CPD. We first reviewed the existing approaches, in which the aggregated count data is a dependent variable and the SNP is an ordinal independent variable. We then considered a calibration model in which the SNP is the ordinal dependent variable and the aggregated count data is the independent variable. This calibration modelling approach becomes robust to accommodate distributional assumptions of count data. We applied our robust calibration modelling approach to CPD data from the Korean Association Resource project data of 4183 male samples. Through simulation studies, we investigated the performance of the proposed method for comparison to other competing approaches.

Year	DOI	Venue
2016	10.1504/IJDMB.2016.079802	Periodicals
Keywords	Field	DocType
self-reported studies, association studies, CPD, SNP, calibration model	Computer science,Ordinal number,Genetic association,Count data,Single-nucleotide polymorphism,Variables,Bioinformatics,Statistics,SNP,Reporting bias,Statistical analysis	Journal
Volume	Issue	ISSN
16	1	1748-5673
Citations	PageRank	References
0	0.34	0
Authors
3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Haewon Choi	1	0	0.68
Hye-Young Jung	2	13	3.96
Taesung Park	3	490	64.41

1