Abstract | ||
---|---|---|
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches. |
Year | DOI | Venue |
---|---|---|
2017 | 10.21437/Interspeech.2017-746 | 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION |
Keywords | Field | DocType |
acoustic event detection, multi-label classification, dynamic threshold | Pattern recognition,Event type,Computer science,Speech recognition,Acoustic event detection,Artificial intelligence,Polyphony | Conference |
ISSN | Citations | PageRank |
2308-457X | 2 | 0.43 |
References | Authors | |
0 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xianjun Xia | 1 | 12 | 3.02 |
Roberto Togneri | 2 | 814 | 48.33 |
Ferdous Ahmed Sohel | 3 | 623 | 31.78 |
David Huang | 4 | 4 | 1.12 |