A New Model-Based Mandarin-Speech Coding System - Citegraph

Paper Info

Title
A New Model-Based Mandarin-Speech Coding System

Abstract
In this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.

Year	Venue	Keywords
2011	12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5	model-based speech coding, prosody-enriched ASR, enriched transcriptions, hierarchical prosodic model
Field	DocType	Citations
Modified Huffman coding,Tunstall coding,Speech coding,Context-adaptive variable-length coding,Computer science,Speech recognition,Shannon–Fano coding,Mandarin Chinese,Context-adaptive binary arithmetic coding,Variable-length code	Conference	1
PageRank	References	Authors
0.35	1	6

Authors (6 rows)

Cited by (1 rows)

References (1 rows)

Name	Order	Citations	PageRank
Chen-Yu Chiang	1	31	11.55
Jyh-Her Yang	2	9	1.68
Ming-Chieh Liu	3	7	1.28
Yih-Ru Wang	4	237	34.68
Yuan-Fu Liao	5	73	20.38
Sin-Horng Chen	6	273	39.86

1