Title
On the importance of normative data in speech-based assessment.
Abstract
Data sets for identifying Alzheimeru0027s disease (AD) are often relatively sparse, which limits their ability to train generalizable models. Here, we augment such a data set, DementiaBank, with each of two normative data sets, the Wisconsin Longitudinal Study and Talk2Me, each of which employs a speech-based picture-description assessment. Through minority class oversampling with ADASYN, we outperform state-of-the-art results in binary classification of people with and without AD in DementiaBank. This work highlights the effectiveness of combining sparse and difficult-to-acquire patient data with relatively large and easily accessible normative datasets.
Year
Venue
Field
2017
arXiv: Computation and Language
Data set,Longitudinal study,Binary classification,Oversampling,Normative,Computer science,Artificial intelligence,Natural language processing,Machine learning
DocType
Volume
Citations 
Journal
abs/1712.00069
3
PageRank 
References 
Authors
0.49
3
3
Name
Order
Citations
PageRank
Zeinab Noorian130.49
Chloé Pou-Prom231.17
Frank Rudzicz323144.82