Abstract | ||
---|---|---|
Generative models based on subband amplitude envelopes of natural sounds have resulted in convincing synthesis, showing subband amplitude modulation to be a crucial component of auditory perception. Probabilistic latent variable analysis can be particularly insightful, but existing approaches don't incorporate prior knowledge about the physical behaviour of amplitude envelopes, such as exponential decay or feedback. We use latent force modelling, a probabilistic learning paradigm that encodes physical knowledge into Gaussian process regression, to model correlation across spectral subband envelopes. We augment the standard latent force model approach by explicitly modelling dependencies across multiple time steps. Incorporating this prior knowledge strengthens the interpretation of the latent functions as the source that generated the signal. We examine this interpretation via an experiment showing that sounds generated by sampling from our probabilistic model are perceived to be more realistic than those generated by comparative models based on nonnegative matrix factorisation, even in cases where our model is outperformed from a reconstruction error perspective. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1007/978-3-319-93764-9_25 | Lecture Notes in Computer Science |
Keywords | DocType | Volume |
Latent force model,Gaussian processes,Natural sounds,Generative model | Conference | 10891 |
ISSN | Citations | PageRank |
0302-9743 | 0 | 0.34 |
References | Authors | |
7 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
William J. Wilkinson | 1 | 0 | 2.70 |
joshua d reiss | 2 | 119 | 22.72 |
Dan Stowell | 3 | 209 | 21.84 |