Abstract | ||
---|---|---|
We investigate a structured sparse spectral transform method for voice conversion (VC) to perform frequency warping and spectral shaping simultaneously on high-dimensional (D) STRAIGHT spectra. Learning a large transform matrix for high-D data often results in an overfit matrix with low sparsity, which leads to muffled speech in VC. We address this problem by using the frequency-warping characteri... |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/TASLP.2018.2860682 | IEEE/ACM Transactions on Audio, Speech, and Language Processing |
Keywords | Field | DocType |
Transforms,Matrices,Sparse matrices,Speech processing,Distortion measurement,Distortion,Training | Speech processing,Pattern recognition,Computer science,Matrix (mathematics),Mean opinion score,Speech recognition,Artificial intelligence,Non-negative matrix factorization,Overfitting,Transformation matrix,Sparse matrix,Covariance | Journal |
Volume | Issue | ISSN |
26 | 12 | 2329-9290 |
Citations | PageRank | References |
0 | 0.34 | 4 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yunxin Zhao | 1 | 807 | 121.74 |
Mili Kuruvilla-Dugdale | 2 | 0 | 1.01 |
Minguang Song | 3 | 0 | 2.37 |