Abstract | ||
---|---|---|
The AT&T VOICEBUILDER provides a new tool to researchers and practitioners who want to have their voices synthesized by a high-quality commercial-grade text-to-speech system without the need to install, configure, or manage speech processing software and equipment. It is implemented as a web service on the AT&T Speech Mashup Portal. The system records and validates users' utterances, processes them to build a synthetic voice and provides a web service API to make the voice available to real-time applications through a scalable cloud-based processing platform. All the procedures are automated to avoid human intervention. We present experimental comparisons of voices built using the system. |
Year | Venue | Keywords |
---|---|---|
2012 | LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | Text-to-Speech,Voice Building,Cloud Computing |
Field | DocType | Citations |
Mashup,Speech processing,Speech synthesis,Computer science,Software,Artificial intelligence,Natural language processing,Web service,Multimedia,Scalability,Cloud computing | Conference | 0 |
PageRank | References | Authors |
0.34 | 3 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alistair Conkie | 1 | 264 | 38.03 |
Thomas Okken | 2 | 19 | 2.09 |
Yeon-Jun Kim | 3 | 52 | 9.52 |
Giuseppe Di Fabbrizio | 4 | 330 | 44.45 |