Conference Paper (published)

Dropped Pronoun Generation for Dialogue Machine Translation

Details

Citation

Wang L, Zhang X, Tu Z, Li H & Liu Q (2016) Dropped Pronoun Generation for Dialogue Machine Translation. In: 2016 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings. 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20.03.2016-25.03.2016. Danvers, MA USA: IEEE, pp. 6110-6114. https://doi.org/10.1109/ICASSP.2016.7472851

Abstract
Dropped pronoun (DP) is a common problem in dialogue machine translation, in which pronouns are frequently dropped in the source sentence and thus are missing in its translation. In response to this problem, we propose a novel approach to improve the translation of DPs for dialogue machine translation. Firstly, we build a training data for DP generation, in which the DPs are automatically added according to the alignment information from a parallel corpus. Then we model the DP generation problem as a sequence labelling task, and develop a generation model based on recurrent neural networks and language models. Finally, we apply the DP generator to machine translation task by completing the source sentences with the missing pronouns. Experimental results show that our approach achieves a significant improvement of 1.7 BLEU points by recalling possible DPs in the source sentences.

Keywords
Machine Translation; Dialogue; Dropped Pronoun

StatusPublished
Publication date01/03/2016
Publication date online01/03/2016
URLhttp://hdl.handle.net/1893/23458
PublisherIEEE
Place of publicationDanvers, MA USA
ISBN978-1-4799-9988-0
Conference41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016)
Conference locationShanghai, China
Dates