Autoregressive Co-Training for Learning Discrete Speech Representation. - Citegraph

Paper Info

Title
Autoregressive Co-Training for Learning Discrete Speech Representation.

Abstract
While several self-supervised approaches for learning discrete speech representation have been proposed, it is unclear how these seemingly similar approaches relate to each other. In this paper, we consider a generative model with discrete latent variables that learns a discrete representation for speech. The objective of learning the generative model is formulated as information-theoretic co-training. Besides the wide generality, the objective can be optimized with several approaches, subsuming HuBERT-like training and vector quantization for learning discrete representation. Empirically, we find that the proposed approach learns discrete representation that is highly correlated with phonetic units, more correlated than HuBERT-like training and vector quantization.

Year	DOI	Venue
2022	10.21437/Interspeech.2022-530	Conference of the International Speech Communication Association (INTERSPEECH)
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	2

Authors (2 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Sung-Lin Yeh	1	0	0.68
Hao Tang	2	15	7.02

1