Skip to main content Skip to main navigation

Publikation

Can Machine Learning Algorithms Improve Phrase Selection in Hybrid Machine Translation?

Christian Federmann
In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra) (ESIRMT-HyTra-12), located at 13th Conference of the European Chapter of the Association for Computational Linguistics, April 23-24, Avignon, France, Pages 113-118, European Chapter of the Association for Computational Linguistics (EACL), 4/2012.

Zusammenfassung

We describe a substitution-based, hybrid machine translation (MT) system that has been extended with a machine learning component controlling its phrase selection. Our approach is based on a rule-based MT (RBMT) system which creates template translations. Based on the generation parse tree of the RBMT system and standard word alignment computation, we identify potential “translation snippets” from one or more translation engines which could be substituted into our translation templates. The substitution process is controlled by a binary classifier trained on feature vectors from the different MT engines. Using a set of manually annotated training data, we are able to observe improvements in terms of BLEU scores over a baseline version of the hybrid system.