Predicting Post-Editor Profiles from the Translation Process

Karan Singla, David Orrego-Carmona, Ashleigh Rhea Gonzales, Michael Carl, Srinivas Bangalore

    Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

    43 Downloads (Pure)

    Abstract

    The purpose of the current investigation is to predict post-editor profiles based on user behaviour and demographics using machine learning techniques to gain a better understanding of post-editor styles. Our study extracts process unit features from the CasMaCat LS14 database from the CRITT Translation Process Research Database (TPR-DB). The analysis has two main research goals: We create n-gram models based on user activity and part-of-speech sequences
    to automatically cluster post-editors, and we use discriminative classifier models to characterize post-editors based on a diverse range of translation process features. The classification and clustering of participants resulting from our study suggest this type of exploration could be used as a tool to develop new translation tool features or customization possibilities.
    OriginalsprogEngelsk
    TitelProceedings of the Workshop on Interactive and Adaptive Machine Translation
    RedaktørerFrancisco Casacuberta, Marcello Federico, Philipp Koehn
    Antal sider10
    ForlagAssociation for Machine Translation in the Americas (AMTA)
    Publikationsdato2014
    Sider51-60
    StatusUdgivet - 2014
    BegivenhedThe 11th Conference of the Association for Machine Translation in the Americas 2014 - Vancouver, Canada
    Varighed: 22 okt. 201426 okt. 2014
    Konferencens nummer: 11
    http://amta2014.amtaweb.org/

    Konference

    KonferenceThe 11th Conference of the Association for Machine Translation in the Americas 2014
    Nummer11
    Land/OmrådeCanada
    ByVancouver
    Periode22/10/201426/10/2014
    Internetadresse

    Citationsformater