Title
Towards Multi-Purpose Spectral Rhythm Features: An Application to Dance Style, Meter and Tempo Estimation.
Abstract
This paper addresses the extraction of multipurpose spectral rhythm features that simultaneously tackle a variety of rhythm analysis tasks, namely, dance style classification, meter estimation, and tempo estimation. The term spectral rhythm features emanates from the origin of the extracted features, which is the periodicity function PF, a spectral representation that encapsulates the salience of the rhythm frequencies. Two dimensionality reduction techniques applied on the PF to extract expressive and compact features are compared, namely, a linear transformation resulting from Principal Component Analysis and a nonlinear mapping derived from a Restricted Boltzmann Machine. Subsequently, the derived features were used as input to an SVM classifier for each task. Moreover, an additional method is proposed that reformulates the well-studied tempo estimation task as a combination of multiple binary classification sub-problems. Evaluation was performed on a large number of datasets demonstrating that the same set of features learned from the PF provide a robust rhythmic representation that achieved comparable results to the current state-of-the-art methods for the aforementioned tasks.
Year
DOI
Venue
2016
10.1109/TASLP.2016.2554283
IEEE/ACM Trans. Audio, Speech & Language Processing
Keywords
Field
DocType
Rhythm,Estimation,Feature extraction,Speech,Speech processing,Context
Restricted Boltzmann machine,Speech synthesis,Dimensionality reduction,Pattern recognition,Binary classification,Computer science,Speech recognition,Information extraction,Artificial intelligence,Linear map,Rhythm,Principal component analysis
Journal
Volume
Issue
ISSN
24
11
2329-9290
Citations 
PageRank 
References 
2
0.37
41
Authors
3
Name
Order
Citations
PageRank
Aggelos Gkiokas1354.64
Vassilios Katsouros27310.63
George Carayannis321538.14