Title
Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification
Abstract
The present paper focuses on the investigation of various audio pattern classifiers in broadcast-audio semantic analysis, using radio-programme-adaptive classification strategies with supervised training. Multiple neural network topologies and training configurations are evaluated and compared in combination with feature-extraction, ranking and feature-selection procedures. Different pattern classification taxonomies are implemented, using programme-adapted multi-class definitions and hierarchical schemes. Hierarchical and hybrid classification taxonomies are deployed in speech analysis tasks, facilitating efficient speaker recognition/identification, speech/music discrimination, and generally speech/non-speech detection-segmentation. Exhaustive qualitative and quantitative evaluation is conducted, including indicative comparison with non-neural approaches. Hierarchical approaches offer classification-similarities for easy adaptation to generic radio-broadcast semantic analysis tasks. The proposed strategy exhibits increased efficiency in radio-programme content segmentation and classification, which is one of the most demanding audio semantics tasks. This strategy can be easily adapted in broader audio detection and classification problems, including additional real-world speech-communication demanding scenarios.
Year
DOI
Venue
2012
10.1016/j.specom.2012.01.004
Speech Communication
Keywords
Field
DocType
generic radio-broadcast semantic analysis,hierarchical scheme,different pattern classification taxonomy,radio-programme-adaptive classification strategy,hierarchical approach,broader audio detection,hybrid classification taxonomy,classification problem,radio-programme-adaptive pattern classification,broadcast-audio semantic analysis,demanding audio semantics task,broadcast-audio semantic analysis scenario,neural networks,content management
Computer science,Speaker recognition,Artificial intelligence,Content management,Artificial neural network,Broadcasting,Pattern recognition,Ranking,Segmentation,Network topology,Speech recognition,Semantics,Machine learning
Journal
Volume
Issue
ISSN
54
6
0167-6393
Citations 
PageRank 
References 
9
0.62
34
Authors
3
Name
Order
Citations
PageRank
R. Kotsakis1365.76
G. Kalliris227714.72
Charalampos Dimoulas310412.35