Title
A novel approach to DNA copy number data segmentation.
Abstract
DNA copy number (DCN) is the number of copies of DNA at a region of a genome. The alterations of DCN are highly associated with the development of different tumors. Recently, microarray technologies are being employed to detect DCN changes at many loci at the same time in tumor samples. The resulting DCN data are often very noisy, and the tumor sample is often contaminated by normal cells. The goal of computational analysis of array-based DCN data is to infer the underlying DCNs from raw DCN data. Previous methods for this task do not model the tumor/normal cell mixture ratio explicitly and they cannot output segments with DCN annotations. We developed a novel model-based method using the minimum description length (MDL) principle for DCN data segmentation. Our new method can output underlying DCN for each chromosomal segment, and at the same time, infer the underlying tumor proportion in the test samples. Empirical results show that our method achieves better accuracies on average as compared to three previous methods, namely Circular Binary Segmentation, Hidden Markov Model and Ultrasome.
Year
DOI
Venue
2011
10.1142/S0219720011005343
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY
Keywords
Field
DocType
MDL,model-based,DCN data,segmentation
Binary segmentation,Data segment,Computer science,Segmentation,Minimum description length,Markov chain,DNA,Bioinformatics,Hidden Markov model,Computational analysis
Journal
Volume
Issue
ISSN
9
1
0219-7200
Citations 
PageRank 
References 
0
0.34
13
Authors
4
Name
Order
Citations
PageRank
Siling Wang1142.83
Yuhang Wang215916.49
Yang Xie300.34
Guanghua Xiao4569.63