Abstract | ||
---|---|---|
Automatic transcription of monophonic/polyphonic music is a challenging task due to the lack of availability of large amounts of transcribed data. In this paper, we propose a data augmentation method that converts natural speech to singing voice based on vocoder based speech synthesizer. This approach, called voice to singing (V2S), performs the voice style conversion by modulating the F0 contour ... |
Year | DOI | Venue |
---|---|---|
2021 | 10.1109/ICASSP39728.2021.9415096 | ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Keywords | DocType | ISBN |
Vocoders,Synthesizers,Conferences,Natural languages,Transfer learning,Buildings,Speech recognition | Conference | 978-1-7281-7605-5 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sakya Basak | 1 | 0 | 0.34 |
Shrutina Agarwal | 2 | 0 | 0.34 |
Sriram Ganapathy | 3 | 252 | 39.62 |
Naoya Takahashi | 4 | 34 | 9.44 |