Abstract | ||
---|---|---|
This article deals with the extraction of frequency-domain auditory objects in sparse representations. To do so, we first generate sparse audio representations we call spikegrams, based on neural spikes using gammatone/gammachirp kernels and matching pursuit. We then propose a method to extract frequent auditory objects (patterns) in the afore-mentioned sparse representations. The extracted frequency-domain patterns help us address spikes (atoms or auditory events) collectively rather than individually. When audio compression is needed, the different patterns are stored in a small codebook that can be used to efficiently encode audio materials in a lossless way. The approach is applied to different audio signals and results are discussed and compared. Our experiments show that substantial coding gain is obtained when our technique based on pattern extraction is used as opposed to the case where spikes (atoms) are coded individually. This work is a first step towards the design of a high-quality “object-based” audio coder. |
Year | Venue | Keywords |
---|---|---|
2009 | Glasgow | audio coding,feature extraction,iterative methods,audio coding,audio compression,frequency-domain auditory object extraction,gammachirp kernel,gammatone kernel,high quality object based audio coder,matching pursuit,neural spikes,pattern extraction,sparse audio representations,sparse representation,spikegram |
Field | DocType | ISBN |
Matching pursuit,Audio signal,Speech coding,Pattern recognition,Audio mining,Computer science,Speech recognition,Sub-band coding,Artificial intelligence,Dynamic range compression,Lossless compression,Codebook | Conference | 978-161-7388-76-7 |
Citations | PageRank | References |
1 | 0.35 | 7 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Ramin Pichevar | 1 | 56 | 9.92 |
Hossein Najaf-Zadeh | 2 | 22 | 4.37 |
Najaf-Zadeh, H. | 3 | 1 | 0.35 |