Title
Examining the sublineage structure of Mycobacterium tuberculosis complex strains with multiple-biomarker tensors
Abstract
Strains of the Mycobacterium tuberculosis complex (MTBC) can be classified into coherent lineages of similar traits based on their genotype. We present a tensor clustering framework to group MTBC strains into sublineages of the known major lineages based on two biomarkers: spacer oligonucleotide type (spoligotype) and mycobacterial interspersed repetitive units (MIRU). We represent genotype information of MTBC strains in a high-dimensional array in order to include information about spoligotype, MIRU, and their coexistence using multiple-biomarker tensors. We use multiway models to transform this multidimensional data about the MTBC strains into two-dimensional arrays and use the resulting score vectors in a stable partitive clustering algorithm to classify MTBC strains into sublineages. We validate clusterings using cluster stability and accuracy measures, and find stabilities of each cluster. Based on validated clustering results, we present a sublineage structure of MTBC strains and compare it to the sublineage structures of SpolDB4 and MIRU-VNTRplus.
Year
DOI
Venue
2010
10.1109/BIBM.2010.5706625
Bioinformatics and Biomedicine
Keywords
Field
DocType
bioinformatics,cellular biophysics,data analysis,genetics,genomics,information analysis,microorganisms,molecular biophysics,genotype information,multidimensional data,multiple-biomarker tensors,mycobacterial interspersed repetitive units,mycobacterium tuberculosis complex strains,spacer oligonucleotide,spoligotype,sublineage structure,tensor clustering framework,Mycobacterium tuberculosis complex,Tuberculosis,cluster validation,clustering,multiway models
Genotype,Cellular biophysics,Biology,Genomics,Mycobacterium tuberculosis complex,Bioinformatics,Cluster analysis
Conference
ISSN
ISBN
Citations 
2156-1125
978-1-4244-8307-5
0
PageRank 
References 
Authors
0.34
6
4
Name
Order
Citations
PageRank
Ozcaglar, C.100.34
Shabbeer, A.200.34
Vandenberg, S.300.34
Bülent Yener4107594.51