Title
Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection
Abstract
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-746
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
Field
DocType
acoustic event detection, multi-label classification, dynamic threshold
Pattern recognition,Event type,Computer science,Speech recognition,Acoustic event detection,Artificial intelligence,Polyphony
Conference
ISSN
Citations 
PageRank 
2308-457X
2
0.43
References 
Authors
0
4
Name
Order
Citations
PageRank
Xianjun Xia1123.02
Roberto Togneri281448.33
Ferdous Ahmed Sohel362331.78
David Huang441.12