Title
Cell-Coupled Long Short-Term Memory With L-Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features.
Abstract
In early stages, patients with bipolar disorder are often diagnosed as having unipolar depression in mood disorder diagnosis. Because the long-term monitoring is limited by the delayed detection of mood disorder, an accurate and one-time diagnosis is desirable to avoid delay in appropriate treatment due to misdiagnosis. In this paper, an elicitation-based approach is proposed for realizing a one-time diagnosis by using responses elicited from patients by having them watch six emotion-eliciting videos. After watching each video clip, the conversations, including patient facial expressions and speech responses, between the participant and the clinician conducting the interview were recorded. Next, the hierarchical spectral clustering algorithm was employed to adapt the facial expression and speech response features by using the extended Cohn–Kanade and eNTERFACE databases. A denoizing autoencoder was further applied to extract the bottleneck features of the adapted data. Then, the facial and speech bottleneck features were input into support vector machines to obtain speech emotion profiles (EPs) and the modulation spectrum (MS) of the facial action unit sequence for each elicited response. Finally, a cell-coupled long short-term memory (LSTM) network with an <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$L$ </tex-math></inline-formula> -skip fusion mechanism was proposed to model the temporal information of all elicited responses and to loosely fuse the EPs and the MS for conducting mood disorder detection. The experimental results revealed that the cell-coupled LSTM with the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$L$ </tex-math></inline-formula> -skip fusion mechanism has promising advantages and efficacy for mood disorder detection.
Year
DOI
Venue
2020
10.1109/TNNLS.2019.2899884
IEEE transactions on neural networks and learning systems
Keywords
Field
DocType
Mood,Videos,Databases,Feature extraction,Medical services,Monitoring,Delays
Mood,Fusion mechanism,Bottleneck,Bipolar disorder,Autoencoder,Pattern recognition,Computer science,Support vector machine,Long short term memory,Facial expression,Artificial intelligence
Journal
Volume
Issue
ISSN
31
1
2162-237X
Citations 
PageRank 
References 
1
0.35
13
Authors
4
Name
Order
Citations
PageRank
Ming-Hsiang Su165.63
Chung-Hsien Wu21099116.79
Kun-Yi Huang3145.00
Tsung-Hsien Yang4454.71