Paper
20 October 2022 Improving English to Chinese machine translation performance via adversarial training
Huaxin Pu, Donghui Guo
Author Affiliations +
Proceedings Volume 12451, 5th International Conference on Computer Information Science and Application Technology (CISAT 2022); 124513I (2022) https://doi.org/10.1117/12.2656585
Event: 5th International Conference on Computer Information Science and Application Technology (CISAT 2022), 2022, Chongqing, China
Abstract
The transformer has improved slow training, the most criticized disadvantage, of RNNs and achieves fast parallelism by self-attention mechanism. Also, the transformer can be enhanced to a great depth to fully exploit the characteristics of the deep neural network model and improve the model’s accuracy, which is widely used in machine translation. Deep models, however, have the inevitable disadvantage of being prone to overfitting. We consider introducing the adversarial training method in the translation model to improve the robustness of the model through regularization to alleviate the overfitting problem. We conducted analytical experiments on the English-Chinese translation dataset which shows that the adversarial training approach can improve translation quality.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Huaxin Pu and Donghui Guo "Improving English to Chinese machine translation performance via adversarial training", Proc. SPIE 12451, 5th International Conference on Computer Information Science and Application Technology (CISAT 2022), 124513I (20 October 2022); https://doi.org/10.1117/12.2656585
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Transformers

Neural networks

Computer programming

Statistical modeling

Data modeling

Translational research

RELATED CONTENT

Tax service volume forecasting based on informer
Proceedings of SPIE (December 02 2022)
Toxic detection based on RoBERTa and TF-IDF
Proceedings of SPIE (November 10 2022)
A novel text generation algorithm based on few-shot dataset
Proceedings of SPIE (November 30 2022)
Numerical reasoning based on transformer
Proceedings of SPIE (December 28 2022)
Trajectory-text retrieval model based on graph neural network
Proceedings of SPIE (February 03 2023)

Back to Top