Title
Blind Source Separation with Optimal Transport Non-negative Matrix Factorization.
Abstract
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (STFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
Year
DOI
Venue
2018
10.1186/s13634-018-0576-2
EURASIP Journal on Advances in Signal Processing
Keywords
Field
DocType
NMF,Speech,BSS,Optimal transport
Spectrogram,Computer science,Matrix decomposition,Short-time Fourier transform,Algorithm,Fourier transform,Speech recognition,Non-negative matrix factorization,Audio signal processing,Optimization problem,Blind signal separation
Journal
Volume
Issue
ISSN
abs/1802.05429
1
1687-6180
Citations 
PageRank 
References 
1
0.36
15
Authors
4
Name
Order
Citations
PageRank
Antoine Rolet1272.54
Vivien Seguy2152.78
Mathieu Blondel34055174.33
Hiroshi Sawada410.70