Analysis of MultiWord Expression Translation Errors in Statistical Machine Translation

Natalia Klyueva, Jeevanthi Liyanapathirana

    Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

    Abstract

    In this paper, we analyse the usage of multiword expressions (MWE) in Statistical Machine Translation (SMT). We exploit the Moses SMT toolkit to train models for French-English and Czech-Russian language pairs. For each language pair, two models were built: a baseline model without additional MWE data and the model enhanced with information on MWE. For the French-English pair, we tried three methods of introducing the MWE data. For Czech-Russian pair, we used just one method – adding automatically extracted data as a parallel corpus.
    OriginalsprogEngelsk
    TitelWorkshop Proceedings. Multi-Word Units in Machine Translation and Translation Technologies. MUMTTT 2015
    RedaktørerGloria Corpas Pastor, Johanna Monti, Violeta Seretan, Ruslan Mitkov
    UdgivelsesstedGeneve
    ForlagTradulex
    Publikationsdato2015
    Sider87-91
    ISBN (Trykt)9782970073697
    StatusUdgivet - 2015
    BegivenhedThe 2nd Workshop on Multi-word Units in Machine Translation and Translation Technology. MUMTTT 2015 : Part of EUROPHRAS 2015 - Malaga, Spanien
    Varighed: 1 jul. 20152 jul. 2015
    Konferencens nummer: 2
    http://typo.uni-konstanz.de/parseme/index.php/events/118-mumttt-workshop-at-europhras-15

    Konference

    KonferenceThe 2nd Workshop on Multi-word Units in Machine Translation and Translation Technology. MUMTTT 2015
    Nummer2
    Land/OmrådeSpanien
    ByMalaga
    Periode01/07/201502/07/2015
    Internetadresse

    Citationsformater