Title | ||
---|---|---|
Image Representation Of The Subband Power Distribution For Robust Sound Classification |
Abstract | ||
---|---|---|
This paper proposes a robust sound event classification method, based on a selective image feature driven from the novel subband power distribution (SPD), which represents the distribution of power over frequency components. This method is an extension of our previous work, which was motivated by the visual perception of the spectrogram to produce a robust feature for sound classification. Unlike the conventional spectrogram, the proposed SPD representation is invariant to timeshifting and therefore suitable for real scenarios where the detected sound clips are not always balanced. Furthermore, we develop a missing feature classification method, which automatically selects the sparse, representative areas of the signal from the noisy SPD image of the sound clip. The method is tested on a large database containing 50 sound classes, under four different noise environments, varying from clean to severe noise conditions. A significant improvement in performance was obtained in mismatched conditions, producing an average classification accuracy of 87.5% in the OdB noise condition. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | Sound event classification, time-shifting, spectrogram, support vector machines |
Field | DocType | Citations |
Computer vision,Pattern recognition,Computer science,Image representation,Sound classification,Speech recognition,Artificial intelligence | Conference | 1 |
PageRank | References | Authors |
0.38 | 1 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jonathan Dennis | 1 | 100 | 9.36 |
Tran Huy Dat | 2 | 165 | 25.31 |
Haizhou Li | 3 | 3678 | 334.61 |