Abstract | ||
---|---|---|
Sound scene geotagging is a new topic of research which has evolved from acoustic scene classification. It is motivated by the idea of audio surveillance. Not content with only describing a scene in a recording, a machine which can locate where the recording was captured would be of use to many. In this paper we explore a series of common audio data augmentation methods to evaluate which best improves the accuracy of audio geotagging classifiers. Our work improves on the state-of-the-art city geotagging method by 23% in terms of classification accuracy. |
Year | DOI | Venue |
---|---|---|
2021 | 10.21437/Interspeech.2021-1837 | Interspeech |
DocType | Citations | PageRank |
Conference | 0 | 0.34 |
References | Authors | |
0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Helen L. Bear | 1 | 30 | 7.10 |
Veronica Morfi | 2 | 0 | 0.68 |
Emmanouil Benetos | 3 | 557 | 52.48 |