Diversify and combine: Improving word alignment for machine translation on low-resource languages

Bing Xiang; Yonggang Deng; Bowen Zhou

ACL 2010

Conference paper

01 Dec 2010

Diversify and combine: Improving word alignment for machine translation on low-resource languages

Abstract

We present a novel method to improve word alignment quality and eventually the translation performance by producing and combining complementary word alignments for low-resource languages. Instead of focusing on the improvement of a single set of word alignments, we generate multiple sets of diversified alignments based on different motivations, such as linguistic knowledge, morphology and heuristics. We demonstrate this approach on an English-to-Pashto translation task by combining the alignments obtained from syntactic reordering, stemming, and partial words. The combined alignment outperforms the baseline alignment, with significantly higher F-scores and better translation performance. © 2010 Association for Computational Linguistics.

Conference paper