Title
Enabling Factor Analysis On Thousand-Subject Neuroimaging Datasets
Abstract
The scale of functional magnetic resonance image data is rapidly increasing as large multi-subject datasets are becoming widely available and high-resolution scanners are adopted. The inherent low-dimensionality of the information in this data has led neuroscientists to consider factor analysis methods to extract and analyze the underlying brain activity. In this work, we consider two recent multi-subject factor analysis methods: the Shared Response Model and the Hierarchical Topographic Factor Analysis. We perform analytical, algorithmic, and code optimization to enable multi-node parallel implementations to scale. Single-node improvements result in 99x and 2062x speedups on the two methods, and enables the processing of larger datasets. Our distributed implementations show strong scaling of 3.3x and 5.5x respectively with 20 nodes on real datasets. We demonstrate weak scaling on a synthetic dataset with 1024 subjects, equivalent in size to the biggest fMRI dataset collected until now, on up to 1024 nodes and 32,768 cores.
Year
Venue
Keywords
2016
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)
functional Magnetic Resonance Imaging, Multi-subject Analysis, Scaling, Factor Analysis
DocType
Citations 
PageRank 
Conference
3
0.56
References 
Authors
9
10
Name
Order
Citations
PageRank
Michael Anderson1155.33
Mihai Capotă2462.77
Javier Turek3375.94
Xia Zhu4368.26
Theodore L. Willke545829.71
Yida Wang6105.40
Po-Hsuan Chen7254.52
Jeremy R. Manning861.97
Peter J. Ramadge932334.67
Kenneth A. Norman108913.20