Abstract | ||
---|---|---|
For cross-corpus speech emotion recognition (SER), how to obtain effective feature representation for the discrepancy elimination of feature distributions between source and target domains is a crucial issue. In this paper, we propose a Target-adapted Subspace Learning (TaSL) method for cross-corpus SER. The TaSL method trys to find a projection subspace, where the feature regress the label more accurately and the gap of feature distributions in target and source domains is bridged effectively. Then, in order to obtain more optimal projection matrix, l(1) norm and l(2,1) norm penalty terms are added to different regularization terms, respectively. Finally, we conduct extensive experiments on three public corpuses, EmoDB, eNTERFACE and AFEW 4.0. The experimental results show that our proposed method can achieve better performance compared with the state-of-the-art methods in the cross-corpus SER tasks. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1587/transinf.2019EDL8038 | IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS |
Keywords | Field | DocType |
cross-corpus speech emotion recognition, transfer learning, domain adaptation, subspace learning | Subspace topology,Pattern recognition,Emotion recognition,Computer science,Speech recognition,Artificial intelligence | Journal |
Volume | Issue | ISSN |
E102D | 12 | 1745-1361 |
Citations | PageRank | References |
1 | 0.35 | 0 |
Authors | ||
6 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiuzhen Chen | 1 | 1 | 0.68 |
Xiaoyan Zhou | 2 | 1 | 0.35 |
Cheng Lu | 3 | 4 | 1.78 |
Yuan Zong | 4 | 162 | 17.39 |
Wenming Zheng | 5 | 1240 | 80.70 |
Chuangao Tang | 6 | 28 | 4.25 |