Title
"Multilingual" Deep Neural Network For Music Genre Classification
Abstract
Multilingual deep neural network (DNN) has been widely used in low-resource automatic speech recognition (ASR) in order to balance the rich-resource and low-resource speech recognition or to build the low-resource ASR system quickly. Inspired by the idea of using multilingual DNN for ASR, we use a "multilingual" DNN (Multi-DNN) for music genre classification. However, we do not have "multilingual" in music, so we use the similar resource instead. In order to obtain the similar resource corresponding to small target database, the nearest neighbor (NN) algorithm is used to re-label the large similar database. Then the re-labeled large similar database is used to train a Multi-DNN, and the small target database is used to further adapt the trained Multi-DNN. By using the Multi-DNN approach, the DNN can be well trained, and be transferred to the small target database quickly. The experiments are evaluated on the benchmark databases, ISMIR database and GTZAN database, which are used as the large similar database and small target database respectively. The experiment results show that the proposed method can achieve 93.4% (10-fold cross-validation) average classification accuracy on GTZAN database, which outperforms the state-of-the-art best performance on this database.
Year
Venue
Keywords
2015
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5
multilingual, music genre classification, DNN
Field
DocType
Citations 
Computer science,Speech recognition,Artificial neural network
Conference
1
PageRank 
References 
Authors
0.36
0
5
Name
Order
Citations
PageRank
Jia Dai121.74
Wenju Liu230.73
Chong-Jia Ni3204.84
Like Dong431.09
Hong Yang531.09