Abstract | ||
---|---|---|
In interpersonal spoken interactions, individuals tend to adapt to their conversation partner's vocal characteristics to become similar, a phenomenon known as entrainment. A majority of the previous computational approaches are often knowledge driven and linear and fail to capture the inherent nonlinearity of entrainment. In this article, we present an unsupervised deep learning framework to derive a representation from speech features containing information relevant for vocal entrainment. We investigate both an encoding based approach and a more robust triplet network based approach within the proposed framework. We also propose a number of distance measures in the representation space and use them for quantification of entrainment. We first validate the proposed distances by using them to distinguish real conversations from fake ones. Then we also demonstrate their applications in relation to modeling several entrainment-relevant behaviors in observational psychotherapy, namely agreement, blame and emotional bond. |
Year | DOI | Venue |
---|---|---|
2022 | 10.1109/TAFFC.2020.3024972 | IEEE Transactions on Affective Computing |
Keywords | DocType | Volume |
Entrainment,deep learning,unsupervised,triplet networks,behavioral signal processing,conversations,interaction | Journal | 13 |
Issue | ISSN | Citations |
3 | 1949-3045 | 1 |
PageRank | References | Authors |
0.63 | 15 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
M. D. Nasir | 1 | 58 | 6.14 |
Brian R. Baucom | 2 | 152 | 16.36 |
craig bryan | 3 | 10 | 2.32 |
Narayanan Shrikanth | 4 | 5558 | 439.23 |
Georgiou Panayiotis | 5 | 428 | 55.79 |