arXiv Analytics

Sign in

arXiv:2205.10068 [cs.CL]AbstractReferencesReviewsResources

Understanding and Mitigating the Uncertainty in Zero-Shot Translation

Wenxuan Wang, Wenxiang Jiao, Shuo Wang, Zhaopeng Tu, Michael R. Lyu

Published 2022-05-20Version 1

Zero-shot translation is a promising direction for building a comprehensive multilingual neural machine translation (MNMT) system. However, its quality is still not satisfactory due to off-target issues. In this paper, we aim to understand and alleviate the off-target issues from the perspective of uncertainty in zero-shot translation. By carefully examining the translation output and model confidence, we identify two uncertainties that are responsible for the off-target issues, namely, extrinsic data uncertainty and intrinsic model uncertainty. Based on the observations, we propose two light-weight and complementary approaches to denoise the training data for model training, and mask out the vocabulary of the off-target languages in inference. Extensive experiments on both balanced and unbalanced datasets show that our approaches significantly improve the performance of zero-shot translation over strong MNMT baselines. Qualitative analyses provide insights into where our approaches reduce off-target translations

Related articles: Most relevant | Search more
arXiv:2305.08706 [cs.CL] (Published 2023-05-15)
Understanding and Bridging the Modality Gap for Speech Translation
arXiv:2102.10437 [cs.CL] (Published 2021-02-20)
Understanding and Enhancing the Use of Context for Machine Translation
arXiv:2206.14576 [cs.CL] (Published 2022-06-21)
Using cognitive psychology to understand GPT-3