Title
The Zero Resource Speech Challenge 2019: TTS without T
Abstract
We present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or phonetic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery dataset) and align them to the voice recordings in a way that works best for the purpose of synthesizing novel utterances from novel speakers, similar to the target speakeru0027s voice. We describe the metrics used for evaluation, a baseline system consisting of unsupervised subword unit discovery plus a standard TTS system, and a topline TTS using gold phoneme transcriptions. We present an overview of the 19 submitted systems from 11 teams and discuss the main results.
Year
DOI
Venue
2019
10.21437/interspeech.2019-2904
Conference of the International Speech Communication Association
DocType
Citations 
PageRank 
Journal
1
0.34
References 
Authors
0
13
Name
Order
Citations
PageRank
Ewan Dunbar1715.08
Robin Algayres210.34
Julien Karadayi351.71
Mathieu Bernard4102.48
Juan Benjumea561.78
Xuan-Nga Cao6263.52
Lucie Miskic730.69
Charlotte Dugrain830.69
Lucas Ondel9357.16
Alan W. Black104391742.28
laurent besacier11696102.67
Sakriani Sakti1225765.02
Emmanuel Dupoux1323837.33