WMT 2012 Table of Contents

WMT 2012

7th Workshop on

Statistical Machine Translation

Proceedings of the Workshop

June 7-8, 2012

Montréal, Canada

Conference program

Table of Contents

Putting Human Assessments of Machine Translation Systems in Order

Adam Lopez . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

Findings of the 2012 Workshop on Statistical Machine Translation

Chris Callison-Burch, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut and Lucia Specia …………10

Semantic Textual Similarity for MT evaluation

Julio Castillo and Paula Estrella . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

Improving AMBER, an MT Evaluation Metric

Boxing Chen, Roland Kuhn and George Foster . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59

TerrorCat: a Translation Error Categorization-based MT Quality Metric

Mark Fishel, Rico Sennrich, Maja Popovi´c and Ondˇrej Bojar. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64

Class error rates for evaluation of machine translation output

Maja Popovic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71

SPEDE: Probabilistic Edit Distance Metrics for MT Evaluation

Mengqiu Wang and Christopher Manning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

Quality estimation for Machine Translation output using linguistic analysis and decoding features

Eleftherios Avramidis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

Black Box Features for the WMT 2012 Quality Estimation Shared Task

Christian Buck . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

Linguistic Features for Quality Estimation

Mariano Felice and Lucia Specia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

PRHLT Submission to the WMT12 Quality Estimation Task

Jesus Gonzalez-Rubio, Alberto Sanchıs and Francisco Casacuberta . . . . . . . . . . . . . . . . . . . . . . . . 104

Tree Kernels for Machine Translation Quality Estimation

Christian Hardmeier, Joakim Nivre and J¨org Tiedemann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

LORIA System for the WMT12 Quality Estimation Shared Task

David Langlois, Sylvain Raybaud and Kamel Sma¨ıli . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114

Quality Estimation: an experimental study using unsupervised similarity measures

Erwan Moreau and Carl Vogel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

The UPC Submission to the WMT 2012 Shared Task on Quality Estimation

Daniele Pighin, Meritxell Gonzalez and Lluıs Marquez . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127

Morpheme- and POS-based IBM1 and language model scores for translation quality estimation

Maja Popovic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

DCU-Symantec Submission for the WMT 2012 Quality Estimation Task

Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann Roturier, Rasul Samad Zadeh Kaljahi and Fred Hollowood . . . . . . 138

The SDL Language Weaver Systems in the WMT12 Quality Estimation Shared Task

Radu Soricut, Nguyen Bach and Ziyuan Wang. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .145

Regression with Phrase Indicators for Estimating MT Quality

Chunyang Wu and Hai Zhao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

Non-Linear Models for Confidence Estimation

Yong Zhuang, Guillaume Wisniewski and François Yvon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

Combining Quality Prediction and System Selection for Improved Automatic Translation Output

Radu Soricut and Sushant Narsale . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163

Match without a Referee: Evaluating MT Adequacy without Reference Translations

Yashar Mehdad, Matteo Negri and Marcello Federico . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171

Comparing human perceptions of post-editing effort with post-editing operations

Maarit Koponen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181

Review of Hypothesis Alignment Algorithms for MT System Combination via Confusion Network Decoding

Antti-Veikko Rosti, Xiaodong He, Damianos Karakos, Gregor Leusch, Yuan Cao, Markus Freitag,

Spyros Matsoukas, Hermann Ney, Jason Smith and Bing Zhang. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .191

On Hierarchical Re-ordering and Permutation Parsing for Phrase-based Decoding

Colin Cherry, Robert C. Moore and Chris Quirk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200

CCG Syntactic Reordering Models for Phrase-based Machine Translation

Dennis Nolan Mehay and Christopher Hardie Brew . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

Using Categorial Grammar to Label Translation Rules

Jonathan Weese, Chris Callison-Burch and Adam Lopez . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222

Using Syntactic Head Information in Hierarchical Phrase-Based Translation

Junhui Li, Zhaopeng Tu, Guodong Zhou and Josef van Genabith . . . . . . . . . . . . . . . . . . . . . . . . . . 232

Fully Automatic Semantic MT Evaluation

Chi-kiu Lo, Anand Karthik Tumuluru and Dekai Wu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

Probes in a Taxonomy of Factored Phrase-Based Models

Ondˇrej Bojar, Bushra Jawaid and Amir Kamran . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253

The CMU-Avenue French-English Translation System

Michael Denkowski, Greg Hanneman and Alon Lavie . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

Formemes in English-Czech Deep Syntactic MT

Ondrej Dusek, Zdenek Zabokrtsky, Martin Popel, Martin Majlis, Michal Novak and David Marecek ……………..267

The TALP-UPC phrase-based translation systems for WMT12: Morphology simplification and domain adaptation

Lluis Formiga, Carlos A. Henrıquez Q., Adolfo Hernandez, Jose B. Marino, Enric Monte and Jose A. R. Fonollosa . . . . . 275

Joshua 4.0: Packing, PRO, and Paraphrases

Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post and Chris Callison-Burch . . . . . . . . . . 283

Syntax-aware Phrase-based Statistical Machine Translation: System Description

Ulrich Germann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 292

QCRI at WMT12: Experiments in Spanish-English and German-English Machine Translation of News Text

Francisco Guzman, Preslav Nakov, Ahmed Thabet and Stephan Vogel . . . . . . . . . . . . . . . . . . . . . 298

The RWTH Aachen Machine Translation System for WMT 2012

Matthias Huck, Stephan Peitz, Markus Freitag, Malte Nuhn and Hermann Ney . . . . . . . . . . . . . 304

Machine Learning for Hybrid Machine Translation

Sabine Hunsicker, Chen Yu and Christian Federmann . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 312

Towards Effective Use of Training Data in Statistical Machine Translation

Philipp Koehn and Barry Haddow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317

Joint WMT 2012 Submission for the QUAERO Project

Freitag Markus, Peitz Stephan, Huck Matthias, Ney Hermann, Niehues Jan, Herrmann Teresa,

Waibel Alex, Hai-son Le, Lavergne Thomas, Allauzen Alexandre, Buschbeck Bianka, Crego Joseph

Maria and Senellart Jean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 322

LIMSI @ WMT12

Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurelien

Max, Artem Sokolov, Guillaume Wisniewski and Francois Yvon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330

UPM system for WMT 2012

Veronica Lopez-Ludena, Ruben San-Segundo and Juan M. Montero . . . . . . . . . . . . . . . . . . . . . . . 338

PROMT DeepHybrid system for WMT12 shared translation task

Alexander Molchanov . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 345

The Karlsruhe Institute of Technology Translation Systems for the WMT 2012

Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho and Alex Waibel 349

Kriya - The SFU System for Translation Task at WMT-12

Majid Razmara, Baskaran Sankaran, Ann Clifton and Anoop Sarkar . . . . . . . . . . . . . . . . . . . . . . . 356

DEPFIX: A System for Automatic Correction of Czech MT Outputs

Rudolf Rosa, David Marecek and Ondrej Dusek . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362

LIUM’s SMT Machine Translation Systems for WMT 2012

Christophe Servan, Patrik Lambert, Anthony Rousseau, Holger Schwenk and Loıc Barrault . . 369

Selecting Data for English-to-Czech Machine Translation

Ales Tamchyna, Petra Galuscakova, Amir Kamran, Milos Stanojevic and Ondrej Bojar . . . . . . 374

DFKI’s SMT System for WMT 2012

David Vilar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 382

GHKM Rule Extraction and Scope-3 Parsing in Moses

Philip Williams and Philipp Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 388

Data Issues of the Multilingual Translation Matrix

Daniel Zeman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395

Constructing Parallel Corpora for Six Indian Languages via Crowdsourcing

Matt Post, Chris Callison-Burch and Miles Osborne. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .401

Twitter Translation using Translation-Based Cross-Lingual Retrieval

Laura Jehl, Felix Hieber and Stefan Riezler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 410

Analysing the Effect of Out-of-Domain Data on SMT Systems

Barry Haddow and Philipp Koehn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 422

Evaluating the Learning Curve of Domain Adaptive Statistical Machine Translation Systems

Nicola Bertoldi, Mauro Cettolo, Marcello Federico and Christian Buck . . . . . . . . . . . . . . . . . . . . 433

The Trouble with SMT Consistency

Marine Carpuat and Michel Simard . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 442

Phrase Model Training for Statistical Machine Translation with Word Lattices of Preprocessing Alternatives

Joern Wuebker and Hermann Ney . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 450

Leave-One-Out Phrase Model Training for Large-Scale Deployment

Joern Wuebker, Mei-Yuh Hwang and Chris Quirk. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .460

Direct Error Rate Minimization for Statistical Machine Translation

Tagyoung Chung and Michel Galley. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .468

Optimization Strategies for Online Large-Margin Learning in Machine Translation

Vladimir Eidelman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480