Title
The Beat Spectrum: A New Approach To Rhythm Analysis
Abstract
We introduce the beat spectrum, a new method of automatically characterizing the rhythm and tempo of music and audio. The beat spectrum is a measure of acoustic self-similarity as a func- tion of time lag. Highly structured or repetitive music will have strong beat spectrum peaks at the repetition times. This reveals both tempo and the relative strength of particular beats, and there- fore can distinguish between different kinds of rhythms at the same tempo. We also introduce the beat spectrogram which graphically illustrates rhythm variation over time. Unlike previ- ous approaches to tempo analysis, the beat spectrum does not depend on particular attributes such as energy or frequency, and thus will work for any music or audio in any genre. We present tempo estimation results which are accurate to within 1% for a variety of musical genres. This approach has a variety of applica- tions, including music retrieval by similarity and automatically generating music videos. Anyone who has ever tapped a foot in time to music has per- formed rhythm analysis. Though simple for humans, this task is considerably more difficult to automate. We introduce a new mea- sure of tempo analysis called the beat spectrum. This is a measure of acoustic self-similarity versus lag time, computed from a repre- sentation of spectrally similarity. Peaks in the beat spectrum cor- respond to major rhythmic components of the source audio. The repetition time of each component can be determined by the lag time of the corresponding peak, while the relative amplitudes of different peaks reflects the strengths of their corresponding rhyth- mic components. We also present the beat spectrogram which graphically illustrates rhythmic variation over time. The beat spectrogram is an image formed from the beat spectrum over suc- cessive windows. Strong rhythmic components are visible as bright bars in the beat spectrogram, making changes in tempo or time signature visible. In addition, a measure of audio novelty can be computed that measures how novel the source audio is at any time (2). Instances when this measure is large correspond to sig- nificant audio changes. Periodic peaks correspond to rhythmic periodicity in the music. In the final section, we present various applications of the beat spectrum, including music retrieval by rhythmic similarity, an "automatic DJ" that can smoothly sequence music with similar tempos and automatic music video generation.
Year
DOI
Venue
2001
10.1109/ICME.2001.1237863
ICME
Field
DocType
Citations 
Mel-frequency cepstrum,Counting,Time signature,Pattern recognition,Computer science,Spectrogram,Transform coding,Speech recognition,Beat (music),Artificial intelligence,Relative strength,Rhythm
Conference
75
PageRank 
References 
Authors
11.26
3
2
Name
Order
Citations
PageRank
Jonathan Foote11625176.16
Shingo Uchihashi243141.70