Title
Pattern extraction in sparse representations with application to audio coding
Abstract
This article deals with the extraction of frequency-domain auditory objects in sparse representations. To do so, we first generate sparse audio representations we call spikegrams, based on neural spikes using gammatone/gammachirp kernels and matching pursuit. We then propose a method to extract frequent auditory objects (patterns) in the afore-mentioned sparse representations. The extracted frequency-domain patterns help us address spikes (atoms or auditory events) collectively rather than individually. When audio compression is needed, the different patterns are stored in a small codebook that can be used to efficiently encode audio materials in a lossless way. The approach is applied to different audio signals and results are discussed and compared. Our experiments show that substantial coding gain is obtained when our technique based on pattern extraction is used as opposed to the case where spikes (atoms) are coded individually. This work is a first step towards the design of a high-quality “object-based” audio coder.
Year
Venue
Keywords
2009
Glasgow
audio coding,feature extraction,iterative methods,audio coding,audio compression,frequency-domain auditory object extraction,gammachirp kernel,gammatone kernel,high quality object based audio coder,matching pursuit,neural spikes,pattern extraction,sparse audio representations,sparse representation,spikegram
Field
DocType
ISBN
Matching pursuit,Audio signal,Speech coding,Pattern recognition,Audio mining,Computer science,Speech recognition,Sub-band coding,Artificial intelligence,Dynamic range compression,Lossless compression,Codebook
Conference
978-161-7388-76-7
Citations 
PageRank 
References 
1
0.35
7
Authors
3
Name
Order
Citations
PageRank
Ramin Pichevar1569.92
Hossein Najaf-Zadeh2224.37
Najaf-Zadeh, H.310.35