Title
Predicting the Geographical Origin of Music
Abstract
Traditional research into the arts has almost always been based around the subjective judgment of human critics. The use of data mining tools to understand art has great promise as it is objective and operational. We investigate the distribution of music from around the world: geographical ethnomusicology. We cast the problem as training a machine learning program to predict the geographical origin of pieces of music. This is a technically interesting problem as it has features of both classification and regression, and because of the spherical geometry of the surface of the Earth. Because of these characteristics of the representation of geographical positions, most standard classification/regression methods cannot be directly used. Two applicable methods are K-Nearest Neighbors and Random forest regression, which are robust to the non-standard structure of data. We also investigated improving performance through use of bagging. We collected 1,142 pieces of music from 73 countries/areas, and described them using 2 different sets of standard audio descriptors using MARSYAS. 10-fold cross validation was used in all experiments. The experimental results indicate that Random forest regression produces significantly better results than KNN, and the use of bagging improves the performance of KNN. The best performing algorithm achieved a mean great circle distance error of 3,113 km.
Year
DOI
Venue
2014
10.1109/ICDM.2014.73
ICDM
Keywords
Field
DocType
music geographical origin,learning (artificial intelligence),music,regression analysis,pattern classification,machine learning program,random forest regression,geography,k-nearest neighbor method,regression,knn,data mining,random forest regression method,geographical ethnomusicology,earth,art,training data,feature extraction
Training set,Data mining,Regression,Computer science,Feature extraction,Artificial intelligence,Great-circle distance,Random forest,The arts,Cross-validation,Machine learning
Conference
ISSN
Citations 
PageRank 
1550-4786
9
0.60
References 
Authors
8
3
Name
Order
Citations
PageRank
Fang Zhou1132.18
Claire Q290.60
Ross D. King31774194.85