Back Translation, Translate Monolingual Target Data Back to Source Data, as Additional Training Data
- Back Translation
The overall training procedures are as follows:
First, NMT is trained using paired training data.
Then, with trained NMT, back-translation can be performed, i.e. an automatic translation of the monolingual target text into the source language. Additional synthetic parallel data is obtained.
During further training, synthetic parallel data is mixed into the original (human-translated) parallel data for training.
Reference
Improving Neural Machine Translation Models with Monolingual Data,
Back Translation, by University of Edinburgh
2016 ACL, Over 1900 Citations (
Sik-Ho Tsang
@ Medium)
Natural Language Processing, NLP, Neural Machine Translation, NMT