Title
COPA: Constrained PARAFAC2 for Sparse & Large Datasets.
Abstract
PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is modeling treatments across a set of patients with the varying number of medical encounters over time. Despite recent improvements on unconstrained PARAFAC2, its model factors are usually dense and sensitive to noise which limits their interpretability. As a result, the following open challenges remain: a) various modeling constraints, such as temporal smoothness, sparsity and non-negativity, are needed to be imposed for interpretable temporal modeling and b) a scalable approach is required to support those constraints efficiently for large datasets. To tackle these challenges, we propose a COnstrained PARAFAC2 (COPA) method, which carefully incorporates optimization constraints such as temporal smoothness, sparsity, and non-negativity in the resulting factors. To efficiently support all those constraints, COPA adopts a hybrid optimization framework using alternating optimization and alternating direction method of multiplier (AO-ADMM). As evaluated on large electronic health record (EHR) datasets with hundreds of thousands of patients, COPA achieves significant speedups (up to 36 times faster) over prior PARAFAC2 approaches that only attempt to handle a subset of the constraints that COPA enables. Overall, our method outperforms all the baselines attempting to handle a subset of the constraints in terms of speed, while achieving the same level of accuracy. Through a case study on temporal phenotyping of medically complex children, we demonstrate how the constraints imposed by COPA reveal concise phenotypes and meaningful temporal profiles of patients. The clinical interpretation of both the phenotypes and the temporal profiles was confirmed by a medical expert.
Year
DOI
Venue
2018
10.1145/3269206.3271775
CIKM
Keywords
DocType
Volume
Computational Phenotyping,Tensor Factorization,Unsupervised Learning
Conference
abs/1803.04572
ISSN
ISBN
Citations 
2155-0751
978-1-4503-6014-2
4
PageRank 
References 
Authors
0.43
11
6
Name
Order
Citations
PageRank
Ardavan Afshar192.20
Ioakeim Perros2625.13
Evangelos Papalexakis387859.71
Elizabeth Searles4783.75
Joyce C. Ho531523.24
Jimeng Sun64729240.91