Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network. - Citegraph

Paper Info

Title
Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network.

Abstract
Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years. However, it remains challenging and falls short of accuracy and efficiency. Recently, Convolutional Neural Network (CNN)-based methods have achieved better performance with comparison to the traditional methods. Nevertheless, conventional single channel CNN may fail to consider the fact that additional cues may be embedded in the multi-channel recordings. In this paper, we explore the use of Multi-channel CNN for the classification task, which aims to extract features from different channels in an end-to-end manner. We conduct the evaluation compared with the conventional CNN and traditional Gaussian Mixture Model-based methods. Moreover, to improve the classification accuracy further, this paper explores the using of mixup method. In brief, mixup trains the neural network on linear combinations of pairs of the representation of audio scene examples and their labels. By employing the mixup approach for data augmentation, the novel model can provide higher prediction accuracy and robustness in contrast with previous models, while the generalization error can also be reduced on the evaluation data.

Year	DOI	Venue
2018	10.1007/978-3-030-00764-5_2	ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III
DocType	Volume	ISSN
Conference	11166	0302-9743
Citations	PageRank	References
3	0.45	15
Authors
8

Authors (8 rows)

Cited by (3 rows)

References (15 rows)

Name	Order	Citations	PageRank
Kele Xu	1	46	21.80
Dawei Feng	2	14	5.09
Haibo Mi	3	134	12.60
Boqing Zhu	4	6	1.60
Dezhi Wang	5	8	2.20
Lilun Zhang	6	5	1.25
Hengxing Cai	7	9	2.21
Shuwen Liu	8	3	0.45

1