Title | ||
---|---|---|
Music Auto-Tagging Using Scattering Transform And Convolutional Neural Network With Self-Attention |
Abstract | ||
---|---|---|
As a branch of machine learning, deep learning has been used for tackling with the music auto-tagging problem. Deep learning methods, especially those with convolutional neural network (CNN) architecture, have exhibited good performance on this multi-label classification task. However, the feature extracting part and preprocessing part of this architecture need to be improved. In this paper, we propose a deep-learning model based on CNN with scattering transform and self-attention mechanism for music automatic tagging. To get a balance between information integrity and feature extraction in the preprocessing phase, we employ the scattering transform. Then, a multi-layer CNN is used to extract higher-level features from the scattering coefficients. In order to select better receptive fields of the CNN, self-attention sub-network is appended at the last layer of CNN. Experimental results on the MagnaTagATune dataset and Million Song Dataset (MSD) show the proposed model is a good choice for music auto-tagging task, since the scores of the area under the receiver operating characteristic curve (ROC-AUC) and the area under the precision-recall curve (PR-AUC) obtained in this paper surpass the state-of-the-art models. Furthermore, we visualize the distributions of attention weights, activations of the CNN and ROC-AUC scores on each tag for better understanding of the model. (C) 2020 Elsevier B.V. All rights reserved. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1016/j.asoc.2020.106702 | APPLIED SOFT COMPUTING |
Keywords | DocType | Volume |
Music auto-tagging, Convolutional neural network, Attention mechanism, Scattering transform, Deep learning | Journal | 96 |
ISSN | Citations | PageRank |
1568-4946 | 0 | 0.34 |
References | Authors | |
0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Guangxiao Song | 1 | 6 | 2.10 |
Zhijie Wang | 2 | 89 | 11.14 |
Han, Fang | 3 | 4 | 3.51 |
Shenyi Ding | 4 | 0 | 0.34 |
Xiaochun Gu | 5 | 0 | 0.34 |