Title
Image Representation Of The Subband Power Distribution For Robust Sound Classification
Abstract
This paper proposes a robust sound event classification method, based on a selective image feature driven from the novel subband power distribution (SPD), which represents the distribution of power over frequency components. This method is an extension of our previous work, which was motivated by the visual perception of the spectrogram to produce a robust feature for sound classification. Unlike the conventional spectrogram, the proposed SPD representation is invariant to timeshifting and therefore suitable for real scenarios where the detected sound clips are not always balanced. Furthermore, we develop a missing feature classification method, which automatically selects the sparse, representative areas of the signal from the noisy SPD image of the sound clip. The method is tested on a large database containing 50 sound classes, under four different noise environments, varying from clean to severe noise conditions. A significant improvement in performance was obtained in mismatched conditions, producing an average classification accuracy of 87.5% in the OdB noise condition.
Year
Venue
Keywords
2011
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5
Sound event classification, time-shifting, spectrogram, support vector machines
Field
DocType
Citations 
Computer vision,Pattern recognition,Computer science,Image representation,Sound classification,Speech recognition,Artificial intelligence
Conference
1
PageRank 
References 
Authors
0.38
1
3
Name
Order
Citations
PageRank
Jonathan Dennis11009.36
Tran Huy Dat216525.31
Haizhou Li33678334.61