WMT 2012
7th Workshop on
Statistical Machine Translation
Proceedings of the
Workshop
June 7-8, 2012
Table of Contents
Putting
Human Assessments of Machine Translation Systems in Order
Adam Lopez . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . 1
Findings of the 2012 Workshop on Statistical Machine
Translation
Chris Callison-Burch, Philipp Koehn, Christof Monz, Matt
Post, Radu Soricut and Lucia Specia …………10
Semantic Textual Similarity for MT evaluation
Julio Castillo and Paula Estrella . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . 52
Improving
AMBER, an MT Evaluation Metric
Boxing Chen, Roland Kuhn and George Foster . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
TerrorCat:
a Translation Error Categorization-based MT Quality Metric
Mark Fishel, Rico Sennrich, Maja Popovi´c and Ondˇrej
Bojar. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .64
Class error rates for evaluation of machine
translation output
Maja Popovic . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 71
SPEDE:
Probabilistic Edit Distance Metrics for MT Evaluation
Mengqiu Wang and Christopher Manning . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
Quality estimation for Machine Translation output
using linguistic analysis and decoding features
Eleftherios Avramidis . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . 84
Black
Box Features for the WMT 2012 Quality Estimation Shared Task
Christian Buck . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 91
Linguistic
Features for Quality Estimation
Mariano Felice and Lucia Specia . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . 96
PRHLT Submission to the WMT12 Quality Estimation Task
Jesus Gonzalez-Rubio, Alberto Sanchıs and Francisco
Casacuberta . . . . . . . . . . . . . . . . . . . . . . . . 104
Tree Kernels for Machine Translation Quality
Estimation
Christian Hardmeier, Joakim Nivre and J¨org Tiedemann . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
LORIA System for the WMT12 Quality Estimation Shared Task
David Langlois, Sylvain Raybaud and Kamel Sma¨ıli . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Quality
Estimation: an experimental study using unsupervised similarity measures
Erwan Moreau and Carl Vogel . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. 120
The
UPC Submission to the WMT 2012 Shared Task on Quality Estimation
Daniele Pighin, Meritxell Gonzalez and Lluıs Marquez .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
Morpheme- and POS-based IBM1 and language model scores
for translation quality estimation
Maja Popovic . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . 133
DCU-Symantec
Submission for the WMT 2012 Quality Estimation Task
Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann
Roturier, Rasul Samad Zadeh Kaljahi and Fred Hollowood . . . . . . 138
The SDL Language Weaver Systems in the WMT12 Quality
Estimation Shared Task
Radu Soricut, Nguyen Bach and Ziyuan Wang. . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .145
Regression
with Phrase Indicators for Estimating MT Quality
Chunyang Wu and Hai Zhao . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . 152
Non-Linear
Models for Confidence Estimation
Yong Zhuang, Guillaume Wisniewski and François Yvon . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157
Combining Quality Prediction and System Selection for
Improved Automatic Translation Output
Radu Soricut and Sushant Narsale . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. 163
Match
without a Referee: Evaluating MT Adequacy without Reference Translations
Yashar Mehdad, Matteo Negri and Marcello Federico . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171
Comparing
human perceptions of post-editing effort with post-editing operations
Maarit Koponen . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . 181
Review
of Hypothesis Alignment Algorithms for MT System Combination via Confusion
Network Decoding
Antti-Veikko Rosti, Xiaodong He, Damianos Karakos, Gregor
Leusch, Yuan Cao, Markus Freitag,
Spyros Matsoukas, Hermann Ney, Jason Smith and Bing Zhang.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .191
On
Hierarchical Re-ordering and Permutation Parsing for Phrase-based Decoding
Colin Cherry, Robert C. Moore and Chris Quirk . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
CCG
Syntactic Reordering Models for Phrase-based Machine Translation
Dennis Nolan Mehay and Christopher Hardie Brew . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
Using
Categorial Grammar to Label Translation Rules
Jonathan Weese, Chris Callison-Burch and Adam Lopez . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
Using
Syntactic Head Information in Hierarchical Phrase-Based Translation
Junhui Li, Zhaopeng Tu, Guodong Zhou and Josef van Genabith
. . . . . . . . . . . . . . . . . . . . . . . . . . 232
Fully
Automatic Semantic MT Evaluation
Chi-kiu Lo, Anand Karthik Tumuluru and Dekai Wu . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243
Probes
in a Taxonomy of Factored Phrase-Based Models
Ondˇrej Bojar, Bushra Jawaid and Amir Kamran . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
The CMU-Avenue French-English Translation System
Michael Denkowski, Greg Hanneman and Alon Lavie . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261
Formemes
in English-Czech Deep Syntactic MT
Ondrej Dusek, Zdenek Zabokrtsky, Martin Popel, Martin
Majlis, Michal Novak and David Marecek ……………..267
Lluis Formiga, Carlos A. Henrıquez Q., Adolfo
Hernandez, Jose B. Marino, Enric Monte and Jose A. R. Fonollosa . . . . . 275
Joshua 4.0: Packing, PRO, and Paraphrases
Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post and
Chris Callison-Burch . . . . . . . . . . 283
Syntax-aware
Phrase-based Statistical Machine Translation: System Description
Ulrich Germann . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . 292
QCRI
at WMT12: Experiments in Spanish-English and German-English Machine Translation
of News Text
Francisco Guzman, Preslav Nakov, Ahmed Thabet and Stephan
Vogel . . . . . . . . . . . . . . . . . . . . . 298
The
RWTH Aachen Machine Translation System for WMT 2012
Matthias Huck, Stephan Peitz, Markus Freitag, Malte Nuhn
and Hermann Ney . . . . . . . . . . . . . 304
Machine Learning for Hybrid Machine Translation
Sabine Hunsicker, Chen Yu and Christian Federmann . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 312
Towards
Effective Use of Training Data in Statistical Machine Translation
Philipp Koehn and Barry Haddow. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
317
Joint
WMT 2012 Submission for the QUAERO Project
Freitag Markus, Peitz Stephan, Huck Matthias, Ney Hermann,
Niehues Jan, Herrmann Teresa,
Waibel Alex, Hai-son Le, Lavergne Thomas, Allauzen
Alexandre, Buschbeck Bianka, Crego Joseph
Maria and Senellart Jean . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 322
Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna
Apidianaki, Li Gong, Aurelien
Max, Artem Sokolov, Guillaume Wisniewski and Francois Yvon
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 330
Veronica Lopez-Ludena, Ruben San-Segundo and Juan M.
Montero . . . . . . . . . . . . . . . . . . . . . . . 338
PROMT DeepHybrid system for WMT12 shared translation
task
Alexander Molchanov . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . 345
The
Karlsruhe Institute of Technology Translation Systems for the WMT 2012
Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann,
Eunah Cho and Alex Waibel 349
Kriya
- The SFU System for Translation Task at WMT-12
Majid Razmara, Baskaran Sankaran, Ann Clifton and Anoop
Sarkar . . . . . . . . . . . . . . . . . . . . . . . 356
DEPFIX:
A System for Automatic Correction of Czech MT Outputs
Rudolf Rosa, David Marecek and Ondrej Dusek . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362
LIUM’s
SMT Machine Translation Systems for WMT 2012
Christophe Servan, Patrik Lambert, Anthony Rousseau, Holger
Schwenk and Loıc Barrault . . 369
Selecting Data for English-to-Czech Machine Translation
Ales Tamchyna,
DFKI’s
SMT System for WMT 2012
David Vilar . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . 382
GHKM Rule Extraction and Scope-3 Parsing in Moses
Philip Williams and Philipp Koehn . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
388
Data
Issues of the Multilingual Translation Matrix
Daniel Zeman . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . 395
Constructing
Parallel Corpora for Six Indian Languages via Crowdsourcing
Matt Post, Chris Callison-Burch and Miles Osborne. . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .401
Twitter
Translation using Translation-Based Cross-Lingual Retrieval
Laura Jehl, Felix Hieber and Stefan Riezler . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 410
Analysing
the Effect of Out-of-Domain Data on SMT Systems
Barry Haddow and Philipp Koehn . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
422
Evaluating the Learning Curve of Domain Adaptive Statistical
Machine Translation Systems
Nicola Bertoldi, Mauro Cettolo, Marcello Federico and
Christian Buck . . . . . . . . . . . . . . . . . . . . 433
The
Trouble with SMT Consistency
Marine Carpuat and Michel Simard . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
442
Joern Wuebker and Hermann Ney . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
450
Leave-One-Out Phrase Model Training for Large-Scale
Deployment
Joern Wuebker, Mei-Yuh Hwang and Chris Quirk. . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .460
Direct
Error Rate Minimization for Statistical Machine Translation
Tagyoung Chung and Michel Galley. . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .468
Optimization Strategies for Online Large-Margin Learning in Machine
Translation
Vladimir Eidelman . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . 480