Improving Machine Translation through Linguistic Treatment of Phrasal Verbs

Valia Kordoni

In this talk we will focus on a linguistically informed method we have developed for integrating phrasal verbs into statistical machine translation (SMT) systems. In a case study involving English to Bulgarian SMT, we show that our method does not only improve translation quality, but also outperforms similar methods previously applied to the same task. We attribute this to the fact that, in contrast to previous work on the subject, we employ detailed linguistic information. We will show that features which describe phrasal verbs as idiomatic or compositional contribute most to the better translation quality achieved by our method.