Pattern extraction in sparse representations with application to audio coding - Citegraph

Paper Info

Title
Pattern extraction in sparse representations with application to audio coding

Abstract
This article deals with the extraction of frequency-domain auditory objects in sparse representations. To do so, we first generate sparse audio representations we call spikegrams, based on neural spikes using gammatone/gammachirp kernels and matching pursuit. We then propose a method to extract frequent auditory objects (patterns) in the afore-mentioned sparse representations. The extracted frequency-domain patterns help us address spikes (atoms or auditory events) collectively rather than individually. When audio compression is needed, the different patterns are stored in a small codebook that can be used to efficiently encode audio materials in a lossless way. The approach is applied to different audio signals and results are discussed and compared. Our experiments show that substantial coding gain is obtained when our technique based on pattern extraction is used as opposed to the case where spikes (atoms) are coded individually. This work is a first step towards the design of a high-quality “object-based” audio coder.

Year	Venue	Keywords
2009	Glasgow	audio coding,feature extraction,iterative methods,audio coding,audio compression,frequency-domain auditory object extraction,gammachirp kernel,gammatone kernel,high quality object based audio coder,matching pursuit,neural spikes,pattern extraction,sparse audio representations,sparse representation,spikegram
Field	DocType	ISBN
Matching pursuit,Audio signal,Speech coding,Pattern recognition,Audio mining,Computer science,Speech recognition,Sub-band coding,Artificial intelligence,Dynamic range compression,Lossless compression,Codebook	Conference	978-161-7388-76-7
Citations	PageRank	References
1	0.35	7
Authors
3

Authors (3 rows)

Cited by (1 rows)

References (7 rows)

Name	Order	Citations	PageRank
Ramin Pichevar	1	56	9.92
Hossein Najaf-Zadeh	2	22	4.37
Najaf-Zadeh, H.	3	1	0.35

1