MTL-SLT: Multi-Task Learning for Spoken Language Tasks - Citegraph

Paper Info

Title
MTL-SLT: Multi-Task Learning for Spoken Language Tasks

Abstract
Language understanding in speech-based systems has attracted extensive interest from both academic and industrial communities in recent years with the growing demand for voice-based applications. Prior works focus on independent research by the automatic speech recognition (ASR) and natural language processing (NLP) communities, or on jointly modeling the speech and NLP problems focusing on a single dataset or single NLP task. To facilitate the development of spoken language research, we introduce MTL-SLT, a multi-task learning framework for spoken language tasks. MTL-SLT takes speech as input, and outputs transcription, intent, named entities, summaries, and answers to text queries, supporting the tasks of spoken language understanding, spoken summarization and spoken question answering respectively. The proposed framework benefits from three key aspects: 1) pre-trained sub-networks of ASR model and language model; 2) multi-task learning objective to exploit shared knowledge from different tasks; 3) end-to-end training of ASR and downstream NLP task based on sequence loss. We obtain state-of-the-art results on spoken language understanding tasks such as SLURP and ATIS. Spoken summarization results are reported on a new dataset: Spoken-Gigaword.

Year	DOI	Venue
2022	10.18653/v1/2022.nlp4convai-1.11	PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI
DocType	Volume	Citations
Conference	Proceedings of the 4th Workshop on NLP for Conversational AI	0
PageRank	References	Authors
0.34	0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Zhiqi Huang	1	0	1.35
Milind Rao	2	0	0.34
Anirudh Raju	3	0	1.69
Zhe Zhang	4	0	0.34
Bach Bui	5	0	0.34
Chul Lee	6	0	0.34

1