Abstract | ||
---|---|---|
We propose a concatenative synthesis approach to the problem of foreign accent conversion. The approach consists of replacing the most accented portions of nonnative speech with alternative segments from a corpus of the speaker's own speech based on their similarity to those from a reference native speaker. We propose and compare two approaches for selecting units, one based on acoustic similarity [e.g., mel frequency cepstral coefficients (MFCCs)] and a second one based on articulatory similarity, as measured through electromagnetic articulography (EMA). Our hypothesis is that articulatory features provide a better metric for linguistic similarity across speakers than acoustic features. To test this hypothesis, we recorded an articulatory-acoustic corpus from a native and a nonnative speaker, and evaluated the two speech representations (acoustic versus articulatory) through a series of perceptual experiments. Formal listening tests indicate that the approach can achieve a 20% reduction in perceived accent, but also reveal a strong coupling between accent and speaker identity. To address this issue, we disguised original and resynthesized utterances by altering their average pitch and normalizing vocal tract length. An additional listening experiment supports the hypothesis that articulatory features are less speaker dependent than acoustic features. |
Year | DOI | Venue |
---|---|---|
2012 | 10.1109/TASL.2012.2201474 | Audio, Speech, and Language Processing, IEEE Transactions |
Keywords | Field | DocType |
Accent conversion,speaker recognition,speech perception,speech synthesis | Mel-frequency cepstrum,Concatenative synthesis,Pragmatics,Computer science,Active listening,Speech recognition,Speaker recognition,Natural language processing,Artificial intelligence,Speaker diarisation,Hidden Markov model,Vocal tract | Journal |
Volume | Issue | ISSN |
20 | 8 | 1558-7916 |
Citations | PageRank | References |
10 | 0.71 | 15 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Daniel Felps | 1 | 10 | 0.71 |
Christian Geng | 2 | 10 | 0.71 |
Schiffman, S.S. | 3 | 13 | 1.62 |