Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning - Citegraph

Paper Info

Title
Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning

Abstract
In interpersonal spoken interactions, individuals tend to adapt to their conversation partner's vocal characteristics to become similar, a phenomenon known as entrainment. A majority of the previous computational approaches are often knowledge driven and linear and fail to capture the inherent nonlinearity of entrainment. In this article, we present an unsupervised deep learning framework to derive a representation from speech features containing information relevant for vocal entrainment. We investigate both an encoding based approach and a more robust triplet network based approach within the proposed framework. We also propose a number of distance measures in the representation space and use them for quantification of entrainment. We first validate the proposed distances by using them to distinguish real conversations from fake ones. Then we also demonstrate their applications in relation to modeling several entrainment-relevant behaviors in observational psychotherapy, namely agreement, blame and emotional bond.

Year	DOI	Venue
2022	10.1109/TAFFC.2020.3024972	IEEE Transactions on Affective Computing
Keywords	DocType	Volume
Entrainment,deep learning,unsupervised,triplet networks,behavioral signal processing,conversations,interaction	Journal	13
Issue	ISSN	Citations
3	1949-3045	1
PageRank	References	Authors
0.63	15	5

Authors (5 rows)

Cited by (1 rows)

References (15 rows)

Name	Order	Citations	PageRank
M. D. Nasir	1	58	6.14
Brian R. Baucom	2	152	16.36
craig bryan	3	10	2.32
Narayanan Shrikanth	4	5558	439.23
Georgiou Panayiotis	5	428	55.79

1