Estimating Grammar Correctness for a Priori Estimation of Machine Translation Post-Editing Effort

Nicholas H. Kirk, Guchun Zhang, Georg Groh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present a supervised learning pilot application for estimating Machine Translation (MT) output reusability, in view of supporting a human post-editor of MT content. We train our model on typed dependencies (labeled grammar relationships) extracted from human reference and raw MT data, to then predict grammar relationship correctness values that we aggregate to provide a binary segment-level evaluation. In view of scaling up to larger data, we provide implemented Naïve Bayes and Stochastic Gradient Descent with Support Vector Machine loss function approaches and their evaluation, and verify the correlation of predicted values with human judgement.

Original languageEnglish
Title of host publicationEACL 2014 - 14th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Workshop on Humans and Computer-assisted Translation, HaCaT 2014
PublisherAssociation for Computational Linguistics (ACL)
Pages16-21
Number of pages6
ISBN (Electronic)9781937284824
StatePublished - 2014
Event14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014 - Gothenburg, Sweden
Duration: 26 Apr 201430 Apr 2014

Publication series

NameEACL 2014 - 14th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Workshop on Humans and Computer-assisted Translation, HaCaT 2014

Conference

Conference14th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2014
Country/TerritorySweden
CityGothenburg
Period26/04/1430/04/14

Fingerprint

Dive into the research topics of 'Estimating Grammar Correctness for a Priori Estimation of Machine Translation Post-Editing Effort'. Together they form a unique fingerprint.

Cite this