Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation

Raffel, Matthew; Penney, Drew; Chen, Lizhong

Computer Science > Computation and Language

arXiv:2307.01377 (cs)

[Submitted on 3 Jul 2023]

Title:Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation

Authors:Matthew Raffel, Drew Penney, Lizhong Chen

View PDF

Abstract:Transformer models using segment-based processing have been an effective architecture for simultaneous speech translation. However, such models create a context mismatch between training and inference environments, hindering potential translation accuracy. We solve this issue by proposing Shiftable Context, a simple yet effective scheme to ensure that consistent segment and context sizes are maintained throughout training and inference, even with the presence of partially filled segments due to the streaming nature of simultaneous translation. Shiftable Context is also broadly applicable to segment-based transformers for streaming tasks. Our experiments on the English-German, English-French, and English-Spanish language pairs from the MUST-C dataset demonstrate that when applied to the Augmented Memory Transformer, a state-of-the-art model for simultaneous speech translation, the proposed scheme achieves an average increase of 2.09, 1.83, and 1.95 BLEU scores across each wait-k value for the three language pairs, respectively, with a minimal impact on computation-aware Average Lagging.

Comments:	Accepted at ICML 2023
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2307.01377 [cs.CL]
	(or arXiv:2307.01377v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2307.01377

Submission history

From: Lizhong Chen [view email]
[v1] Mon, 3 Jul 2023 22:11:51 UTC (387 KB)

Computer Science > Computation and Language

Title:Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators