Title
Dynamic Unit Selection for Very Low Bit Rate Coding at 500 bits/sec
Abstract
This paper presents a new unit selection process for Very Low Bit Rate speech encoding around 500 bits/sec. The encoding is based on speech recognition and speech synthesis technologies. The aim of this approach is to use at best the speech corpus of the speaker. The proposed solution uses HMM modelling for the recognition of elementary speech units. The HMM are first trained in an unsupervised phase and then are used to build the synthesis unit corpus. The coding process relies on the synthesis unit selection. The speech is decoded by concatenating the selected units through HNM-like decomposition of speech. The new unit selection aims at finding the unit that best match the prosody constraints to model its evolution. It enables the size of the synthesis unit corpus to be independant of the targeted bit rate. A complete quantisation scheme of the overall set of encoded parameters is given.
Year
DOI
Venue
2004
10.1007/978-3-540-30120-2_53
Lecture Notes in Artificial Intelligence
Keywords
Field
DocType
speech recognition,speech synthesis
Speech corpus,Speech processing,Speech synthesis,Speech coding,Voice activity detection,Markov model,Computer science,Speech recognition,Hidden Markov model,Encoding (memory)
Conference
Volume
ISSN
Citations 
3206
0302-9743
0
PageRank 
References 
Authors
0.34
10
3
Name
Order
Citations
PageRank
Marc Padellini130.80
François Capman2163.07
Geneviève Baudoin313819.08