Title
Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning
Abstract
In interpersonal spoken interactions, individuals tend to adapt to their conversation partner's vocal characteristics to become similar, a phenomenon known as entrainment. A majority of the previous computational approaches are often knowledge driven and linear and fail to capture the inherent nonlinearity of entrainment. In this article, we present an unsupervised deep learning framework to derive a representation from speech features containing information relevant for vocal entrainment. We investigate both an encoding based approach and a more robust triplet network based approach within the proposed framework. We also propose a number of distance measures in the representation space and use them for quantification of entrainment. We first validate the proposed distances by using them to distinguish real conversations from fake ones. Then we also demonstrate their applications in relation to modeling several entrainment-relevant behaviors in observational psychotherapy, namely agreement, blame and emotional bond.
Year
DOI
Venue
2022
10.1109/TAFFC.2020.3024972
IEEE Transactions on Affective Computing
Keywords
DocType
Volume
Entrainment,deep learning,unsupervised,triplet networks,behavioral signal processing,conversations,interaction
Journal
13
Issue
ISSN
Citations 
3
1949-3045
1
PageRank 
References 
Authors
0.63
15
5
Name
Order
Citations
PageRank
M. D. Nasir1586.14
Brian R. Baucom215216.36
craig bryan3102.32
Narayanan Shrikanth45558439.23
Georgiou Panayiotis542855.79