Title
Continuous Birdsong Recognition Using Gaussian Mixture Modeling of Image Shape Features
Abstract
Traditional birdsong recognition approaches used acoustic features based on the acoustic model of speech production or the perceptual model of the human auditory system to identify the associated bird species. In this paper, a new feature descriptor that uses image shape features is proposed to identify bird species based on the recognition of fixed-duration birdsong segments where their corresponding spectrograms are viewed as gray-level images. The MPEG-7 angular radial transform (ART) descriptor, which can compactly and efficiently describe the gray-level variations within an image region in both angular and radial directions, will be employed to extract the shape features from the spectrogram image. To effectively capture both frequency and temporal variations within a birdsong segment using ART, a sector expansion algorithm is proposed to transform its spectrogram image into a corresponding sector image such that the frequency and temporal axes of the spectrogram image will align with the radial and angular directions of the ART basis functions, respectively. For the classification of 28 bird species using Gaussian mixture models (GMM), the best classification accuracy is 86.30% and 94.62% for 3-second and 5-second birdsong segments using the proposed ART descriptor, which is better than traditional descriptors such as LPCC, MFCC, and TDMFCC.
Year
DOI
Venue
2013
10.1109/TMM.2012.2229969
IEEE Transactions on Multimedia
Keywords
Field
DocType
Subspace constraints,Birds,Feature extraction,Image recognition,Vectors,Spectrogram,Acoustics
Computer vision,Mel-frequency cepstrum,Pattern recognition,Spectrogram,Computer science,Feature extraction,Gaussian,Artificial intelligence,Basis function,Gaussian process,Mixture model,Acoustic model
Journal
Volume
Issue
ISSN
15
2
1520-9210
Citations 
PageRank 
References 
9
0.60
19
Authors
4
Name
Order
Citations
PageRank
Chang-Hsing Lee143931.86
Sheng-Bin Hsu290.60
Jau-Ling Shih321711.64
Chih-Hsun Chou420722.27