Acoustic Scene Classification Using Joint Time-Frequency Image-Based Feature Representations - Citegraph

Paper Info

Title
Acoustic Scene Classification Using Joint Time-Frequency Image-Based Feature Representations

Abstract
The classification of acoustic scenes is important in emerging applications such as automatic audio surveillance, machine listening and multimedia content analysis. In this paper, we present an approach for acoustic scene classification by using joint time-frequency image-based feature representations. In acoustic scene classification, joint time-frequency representation (TFR) is shown to better represent important information across a wide range of low and middle frequencies in the audio signal. The audio signal is converted to Constant-Q Transform (CQT) and Mel-spectrum TFRs and local binary patterns (LBP) are used to extract the features from these TFRs. To ensure localized spectral information is not lost, the TFRs are divided into a number of zones. Then, we perform score level fusion to further improve the classification performance accuracy. Our technique achieves a competitive performance with a classification accuracy of 83.4% on the DCASE 2016 development dataset compared to the existing current state of the art.

Year	DOI	Venue
2018	10.1109/AVSS.2018.8639164	2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)
Keywords	Field	DocType
Feature extraction,Time-frequency analysis,Streaming media,Surveillance,Mel frequency cepstral coefficient,Histograms	Histogram,Mel-frequency cepstrum,Audio signal,Pattern recognition,Computer science,Local binary patterns,Image based,Feature extraction,Time–frequency analysis,Artificial intelligence,Machine listening	Conference
ISBN	Citations	PageRank
978-1-5386-9294-3	0	0.34
References	Authors
0	3

Authors (3 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Shamsiah Abidin	1	0	1.01
Roberto Togneri	2	814	48.33
Ferdous Sohel	3	55	10.97

1