Abstract | ||
---|---|---|
To reveal the importance of temporal precision in ground truth audio event labels, we collected precise (∼0.1 sec resolution) "strong" labels for a portion of the AudioSet dataset. We devised a temporally-strong evaluation set (including explicit negatives of varying difficulty) and a small strong-labeled training subset of 67k clips (compared to the original dataset’s 1.8M clips labeled at 10 sec... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ICASSP39728.2021.9414579 | ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Keywords | DocType | ISBN |
Training,Conferences,Signal processing,Acoustics,Speech processing | Conference | 978-1-7281-7605-5 |
Citations | PageRank | References |
1 | 0.40 | 0 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shawn Hershey | 1 | 10 | 2.38 |
Daniel P. W. Ellis | 2 | 4198 | 356.08 |
Fonseca Eduardo | 3 | 23 | 5.42 |
Lorena Álvarez | 4 | 504 | 36.47 |
Caroline Liu | 5 | 1 | 0.40 |
R. Channing Moore | 6 | 5 | 1.33 |
Manoj Plakal | 7 | 161 | 13.41 |