Title
Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing
Abstract
Singing voice separation aims to separate music into vocals and accompaniment components. One of the major constraints for the task is the limited amount of training data with separated vocals. Data augmentation techniques such as random source mixing have been shown to make better use of existing data and mildly improve model performance. We propose a novel data augmentation technique, chromagram-based pitch-aware remixing, where music segments with high pitch alignment are mixed. By performing controlled experiments in both supervised and semi-supervised settings, we demonstrate that training models with pitch-aware remixing significantly improves the test signal-to-distortion ratio (SDR).
Year
DOI
Venue
2022
10.1109/ICASSP43922.2022.9747612
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Keywords
DocType
ISSN
Singing voice separation,augmentation,pitch-aware,chromagram,self-training
Conference
1520-6149
ISBN
Citations 
PageRank 
978-1-6654-0541-6
0
0.34
References 
Authors
6
7
Name
Order
Citations
PageRank
Siyuan Yuan100.34
Zhepei Wang230.78
Umut Isik3103.33
Ritwik Giri416412.93
J.-M. Valin574066.29
Michael M. Goodwin600.34
Arvindh Krishnaswamy7123.37