Abstract | ||
---|---|---|
Mass cytometry is a new high-throughput technology that is becoming a cornerstone in immunology and cell biology research. With technological advancement, the number of cellular characteristics cytometry can simultaneously quantify grows, making analysis increasingly computationally onerous. In this paper, we investigate the potential of dimensionality reduction techniques to ease computational burden in clustering cytometry data whilst minimally diminishing clustering performance. We explore 3 such techniques: Principal Component Analysis (PCA), Autoencoders (AE) and Uniform Manifold Approximation and Projection (UMAP). Thereafter we employ a recent clustering algorithm, ChronoClust, which clusters data at each time-point into cell populations and explicitly tracks them over time. We evaluate this approach through a 14-dimensional cytometry dataset describing the immune response to West Nile Virus over 8 days in mice. To obtain a broad sample of clustering performance, each of the four datasets (unreduced, PCA-, AE- and UMAP-reduced) is independently clustered 400 times, using 400 unique ChronoClust parameter value sets. We find that PCA and AE can reduce the computational expense whilst incurring a minimal degradation in clustering and cluster tracking performance. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1007/978-3-030-30490-4_50 | ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV |
Keywords | DocType | Volume |
Autoencoder, Clustering, Cytometry, Dimensionality reduction, PCA, UMAP | Conference | 11730 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Givanna H. Putri | 1 | 1 | 1.03 |
Mark Read | 2 | 12 | 3.93 |
Irena Koprinska | 3 | 10 | 3.93 |
Thomas M. Ashhurst | 4 | 1 | 0.70 |
Nicholas J. C. King | 5 | 1 | 0.70 |