Abstract | ||
---|---|---|
We investigate the potential of modern nonlinear dimensionality reduction techniques for an interactive cluster detection in bioinformatics applications. We demonstrate that recent non-parametric techniques such as t-distributed stochastic neighbor embedding (t-SNE) allow a cluster identification which is superior to direct clustering of the original data or cluster detection based on classical parametric dimensionality reduction approaches. Non-parametric approaches, however, display quadratic complexity which makes them unsuitable in interactive devices. As speedup, we propose kernel-t-SNE as a fast parametric counterpart based on t-SNE. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/IV.2013.22 | Information Visualisation |
Keywords | Field | DocType |
bioinformatics,computational complexity,data analysis,genomics,nonparametric statistics,pattern clustering,stochastic processes,bioinformatics,cluster identification,data analysis,direct clustering,interactive cluster detection,kernel-t-SNE,metagenomic samples,next generation sequencing,nonlinear dimensionality reduction,nonparametric techniques,quadratic complexity,t-distributed stochastic neighbor embedding,Metagenomics,NGS data,clustering,kernel mapping,nonlinear dimensionality reduction,t-SNE | Data mining,Dimensionality reduction,Embedding,Nonparametric statistics,Parametric statistics,Cluster analysis,Nonlinear dimensionality reduction,Mathematics,Computational complexity theory,Speedup | Conference |
ISSN | Citations | PageRank |
1550-6037 | 6 | 0.47 |
References | Authors | |
4 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Andrej Gisbrecht | 1 | 195 | 15.60 |
Barbara Hammer | 2 | 2383 | 181.34 |
Bassam Mokbel | 3 | 189 | 14.73 |
Alexander Sczyrba | 4 | 6 | 0.47 |