Title
Nonlinear Dimensionality Reduction for Cluster Identification in Metagenomic Samples
Abstract
We investigate the potential of modern nonlinear dimensionality reduction techniques for an interactive cluster detection in bioinformatics applications. We demonstrate that recent non-parametric techniques such as t-distributed stochastic neighbor embedding (t-SNE) allow a cluster identification which is superior to direct clustering of the original data or cluster detection based on classical parametric dimensionality reduction approaches. Non-parametric approaches, however, display quadratic complexity which makes them unsuitable in interactive devices. As speedup, we propose kernel-t-SNE as a fast parametric counterpart based on t-SNE.
Year
DOI
Venue
2013
10.1109/IV.2013.22
Information Visualisation
Keywords
Field
DocType
bioinformatics,computational complexity,data analysis,genomics,nonparametric statistics,pattern clustering,stochastic processes,bioinformatics,cluster identification,data analysis,direct clustering,interactive cluster detection,kernel-t-SNE,metagenomic samples,next generation sequencing,nonlinear dimensionality reduction,nonparametric techniques,quadratic complexity,t-distributed stochastic neighbor embedding,Metagenomics,NGS data,clustering,kernel mapping,nonlinear dimensionality reduction,t-SNE
Data mining,Dimensionality reduction,Embedding,Nonparametric statistics,Parametric statistics,Cluster analysis,Nonlinear dimensionality reduction,Mathematics,Computational complexity theory,Speedup
Conference
ISSN
Citations 
PageRank 
1550-6037
6
0.47
References 
Authors
4
4
Name
Order
Citations
PageRank
Andrej Gisbrecht119515.60
Barbara Hammer22383181.34
Bassam Mokbel318914.73
Alexander Sczyrba460.47