Learning to match transient sound events using attentional similarity for few-shot sound recognition. - Citegraph

Paper Info

Title
Learning to match transient sound events using attentional similarity for few-shot sound recognition.

Abstract
In this paper, we introduce a novel attentional similarity module for the problem of few-shot sound recognition. Given a few examples of an unseen sound event, a classifier must be quickly adapted to recognize the new sound event without much fine-tuning. The proposed attentional similarity module can be plugged into any metric-based learning method for few-shot learning, allowing the resulting model to especially match related short sound events. Extensive experiments on two datasets show that the proposed module consistently improves the performance of five different metric-based learning methods for few-shot sound recognition. The relative improvement ranges from +4.1% to +7.7% for 5-shot 5-way accuracy for the ESC-50 dataset, and from +2.1% to +6.5% for noiseESC-50. Qualitative results demonstrate that our method contributes in particular to the recognition of transient sound events.

Year	DOI	Venue
2018	10.1109/icassp.2019.8682558	ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords	Field	DocType
Training,Transient analysis,Task analysis,Feature extraction,Learning systems,Image color analysis,Noise measurement	Sound recognition,Computer science,Speech recognition,Classifier (linguistics)	Journal
Volume	ISSN	Citations
abs/1812.01269	1520-6149	4
PageRank	References	Authors
0.48	12	4

Authors (4 rows)

Cited by (4 rows)

References (12 rows)

Name	Order	Citations	PageRank
Szu-Yu Chou	1	49	6.82
Kai-Hsiang Cheng	2	4	0.81
Jyh-Shing Roger Jang	3	525	56.34
Yi-Hsuan Yang	4	1022	84.71

1