Title
Data embedding in speech signals using perceptual masking
Abstract
In this paper, a data embedding technique for speech signals, exploiting the masking property of the human auditory system, is presented. The signal in the frequency domain is partitioned into subbands. The data embedding parameters of each subband are computed from the auditory masking threshold function and a channel noise estimate. Data embedding is performed by modifying the Discrete Hartley Transform (DHT) coefficients according to the principles of the Scalar Costa Scheme (SCS). A maximum likelihood detector is employed in the decoder for embedded-data presence detection and data-embedding quantization-step estimation. We demonstrate the proposed data embedding technique by simulation of data embedding in a speech signal transmitted over a telephone line. The demonstrated system achieves transparent data-embedding at the rate of 300 information bits/second with a bit-error-rate of approximately 10-4. The proposed technique outperforms spread spectrum (SS) based data-embedding techniques for speech signals.
Year
Venue
Keywords
2004
EUSIPCO
discrete hartley transforms,maximum likelihood estimation,quantisation (signal),speech intelligibility,speech processing,dht,scs,auditory masking threshold function,bit-error-rate,channel noise estimate,data embedding technique,data-embedding quantization-step estimation,discrete hartley transform coefficients,embedded-data presence detection,frequency domain,human auditory system,masking property,maximum likelihood detector,perceptual masking,scalar costa scheme,speech signals,telephone line
Field
DocType
ISBN
Frequency domain,Auditory masking,Speech processing,Embedding,Masking (art),Computer science,Speech recognition,Discrete Hartley transform,Spread spectrum,Perceptual Masking
Conference
978-320-0001-65-7
Citations 
PageRank 
References 
3
0.52
6
Authors
2
Name
Order
Citations
PageRank
Ariel Sagi1232.70
David Malah221960.95