Siri On-Device Deep Learning-Guided Unit Selection Text-To-Speech System - Citegraph

Paper Info

Title
Siri On-Device Deep Learning-Guided Unit Selection Text-To-Speech System

Abstract
This paper describes Apple's hybrid unit selection speech synthesis system. which provides the voices for Siri with the requirement of naturalness, personality and expressivity. It has been deployed into hundreds of millions of desktop and mobile devices (e.g. iPhone, iPad, Mac, etc.) via iOS and macOS in multiple languages. The system is following the classical unit selection framework with the advantage of using deep learning techniques to boost the performance. In particular. deep and recurrent mixture density networks are used to predict the target and concatenation reference distributions for respective costs during unit selection. In this paper, we present an overview of the run-time TTS engine and the voice building process. We also describe various techniques that enable on-device capability such as preselection optimization, caching for low latency. and unit pruning for low footprint, as well as techniques that improve the naturalness and expressivity of the voice such as the use of long units.

Year	DOI	Venue
2017	10.21437/Interspeech.2017-1798	18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords	Field	DocType
Speech synthesis, unit selection, hybrid, recurrent mixture density network, on-device	Speech synthesis,Computer science,Speech recognition,Artificial intelligence,Deep learning	Conference
ISSN	Citations	PageRank
2308-457X	3	0.52
References	Authors
7	18

Authors (18 rows)

Cited by (3 rows)

References (7 rows)

Name	Order	Citations	PageRank
Tim Capes	1	3	0.52
Paul Coles	2	3	0.52
Alistair Conkie	3	264	38.03
Ladan Golipour	4	23	3.17
Abie Hadjitarkhani	5	3	0.52
Qiong Hu	6	3	1.54
Nancy Huddleston	7	3	0.52
Melvyn Hunt	8	3	0.86
Jiangchuan Li	9	3	0.52
Matthias Neeracher	10	3	0.52
Kishore Prahallad	11	239	19.55
Tuomo Raitio	12	149	12.86
Ramya Rasipuram	13	57	6.90
Greg Townsend	14	3	0.52
Becci Williamson	15	3	0.52
David Winarsky	16	3	0.52
Zhizheng Wu	17	565	35.23
Hepeng Zhang	18	3	0.52

1