Machine Translation Summit XIV

2-6 September 2013, Nice, France


Main Conference Proceedings


Editors:  Andy Way

                Khalil Sima’an and Mikel L.Forcada (Research papers)

                Daniel Grasmick and Heidi Depaetere (User papers)




Research track - Invited talk

The Operation Sequence Model: Integrating Translation and Reordering Operations in a Single Left-to-Right Model

Hinrich Schütze …  1

Research track - Oral presentations

Terminology-driven Augmentation of Bilingual Terminologies

Koichi Sato, Koichi Takeuchi and Kyo Kageura…      3

Generative and Discriminative Methods for Online Adaptation in SMT

Katharina Wäschle, Patrick Simianer, Nicola Bertoldi, Stefan Riezler and Marcello Federico…      11

Inducing Romanization Systems

Keiko Taguchi, Andrew Finch, Seiichi Yamamoto and Eiichiro Sumita …      19

Project Adaptation for MT-Enhanced Computer Assisted Translation

Mauro Cettolo, Nicola Bertoldi and Marcello Federico…      27

Cache-based Online Adaptation for Machine Translation Enhanced Computer Assisted Translation

Nicola Bertoldi, Mauro Cettolo and Marcello Federico…      35

Semi-Supervised Word Sense Disambiguation for Mixed-Initiative Conversational Spoken Language Translation

Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Rohit Kumar, Enoch Kan, Rohit Prasad and Prem Natarajan …  43

Listwise Approach to Learning to Rank for Automatic Evaluation of Machine Translation

Maoxi Li, Aiwen Jiang and Mingwen Wang …      51

MWE Alignment in Phrase Based Statistical Machine Translation

Santanu Pal, Sudip Kumar Naskar and Sivaji Bandyopadhyay .............       61

Real-life Translation Quality Estimation for MT System Selection

Lluis Formiga, Lluis Marquez and Jaume Pujantell…      69

Yet Another Fast, Robust and Open Source Sentence Aligner. Time toReconsider Sentence Alignment?

Fethi Lamraoui and Philippe Langlais…       77

Development of a Simultaneous Interpretation System for Face-to-Face Services and Its Evaluation Experiment in Real Situation

Akiko Sakamoto, Nayuko Watanabe, Satoshi Kamatani and Kazuo Sumita…       85

Can Informal Genres be better Translated by Tuning on Automatic Semantic Metrics?

Chi-Kiu Lo and Dekai Wu …      93

Quality Estimation-guided Data Selection for Domain Adaptation of SMT

Pratyush Banerjee, Raphael Rubino, Johann Roturier and Josef van Genabith ….       101


Modeling Hip Hop Challenge-Response Lyrics as Machine Translation
Karteek Addanki, Markus Saers and Dekai Wu….      109

Design and Analysis of a Large Corpus of Post-Edited Translations: Quality Estimation, Failure Analysis and the Variability of Post-Edition
Guillaume Wisniewski, Anil Kumar Singh, Natalia Segal and François Yvon … 117

The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian - Standard Arabic Machine Translation

Ahmed Hamdi, Rahma Boujelbane, Nizar Habash and Alexis Nasr …   125

Meta-Evaluation of a Diagnostic Quality Metric for Machine Translation
Sudip Kumar Naskar, Antonio Toral, Federico Gaspari and Declan Groves      135

Towards a Generic Approach for Bilingual Lexicon Extraction from Comparable Corpora
Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum ........ ..      143

(Hidden) Conditional Random Fields Using Intermediate Classes for Statistical Machine Translation

Patrick Lehnen, Jan-Thorsten Peter, Jorn Wiibker, Stephan Peitz and Hermann Ney ….     151

Reverse Word Order Model

Markus Freitag, Minwei Feng, Matthias Huck, Stephan Peitz and Hermann Ney …   159

An Investigation on the Effectiveness of Features for Translation Quality Estimation

Kashif Shah, Trevor Conn and Lucia Specia ….        167

A Free/Open-source Kazakh-Tatar Machine Translation System

Ilnar Salimzyanov, Jonathan Washington and Francis Tyers.................       175

Simulating Discriminative Training for Linear Mixture Adaptation in Statistical Machine Translation
George Foster, Boxing Chen and Roland Kuhn ….      183

PEPr: Post-Edit Propagation Using Phrase-based Statistical Machine Translation

Michel Simard and George Foster ….      191

Research track - Poster presentations

Translating the FINREP Taxonomy using a Domain-specific Corpus

Mihael Arcan, Susan Marie Thomas, Derek De Brandt and Paul Buitelaar     199

Promoting Flexible Translations in Statistical Machine Translation

Rico Sennrich ….       207

Language-independent Model for Machine Translation Evaluation with Reinforced Factors

Aaron Li-Feng Han, Derek F. Wong, Lidia S. Chao, Liangye He, Yi Lu, Junwen Xing and Xiaodong Zeng ….     215

A CCG-based Quality Estimation Metric for Statistical Machine Translation
Hala Almaghout and Lucia Specia ….       223

Learning from Human Judgments of Machine Translation Output

Maja Popovic, Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Sven Schmeier, Cindy Tscherwinka and David Vilar ….      231

Towards the Supervised Machine Translation: Real Word Alignments and Translations in a Multi-task Active Learning process

Martha Alicia Rocha and Joan Andreu Sanchez ….      239

Comparing Forum Data Post-Editing Performance Using Translation Memory and Machine Translation Output: A Pilot Study

Lucia Morado Vazquez, Silvia Rodriguez Vazquez and Pierrette Bouillon  ….   249

Confidence-driven Rewriting for Improved Translation

Shachar Mirkin, Sriram Venkatapathy and Marc Dymetman ….      257

Statistical Machine Translation for Automobile Marketing Texts

Samuel Läubli, Mark Fishel, Manuela Weibel and Martin Volk ..........       265

MT Techniques in a Retrieval System of Semantically Enriched Patents

Meritxell Gonzalez, Maria Mateva, Ramona Enache, Cristina Espana-Bonet and Lluis Marquez, Borislav Popov and Aarne Ranta          273

Application of Online Terminology Services in Statistical Machine Translation

Raivis Skadins, Marcis Pinnis, Tatiana Gornostay and Andrejs Vasiljevs ….      281

Key Problems in Conversion from Simplified to Traditional Chinese Characters
Xiaodong Shi, Yidong Chen and Xiuping Huang ….      287

Topic Models for Translation Quality Estimation for Gisting Purposes

Raphael Rubino, Jose Guilherme Camargo de Souza, Jennifer Foster and Lucia Specia ….      295

User track - Invited talk

Demystifying Machine Translation: Learning from the Real World

Harald Elsen ….      303

User track - Oral presentations

Analyzing and Predicting MT Utility and Post-Editing Productivity in Enterprise-scale Translation Projects

Alon Lavie, Olga Beregovaya, Michael Denkowski, David Clarke......     305

A Technology-agnostic Approach to Machine Translation

Lori Thicke ….      309

COACH: Designing a new CAT tool with Translator Interaction

Laura Bota, Christoph Schneider, Andy Way ….      313

Hybrid Domain Adaptation for a Rule Based MT System

Petra Wolf, Ulrike Bernardi  ….    321

Implementing ITS 2.0 for Post-editing Purposes

Celia Rico, Pedro L. Diez Orzas, Felix Sasaki ….      329

ROI Analysis Model for Language Service Providers

Ekaterina Stambolieva ….      337


Large-scale Multiple Language Translation Accelerator at the United Nations

Bruno Pouliquen, Cecilia Elizalde, Marcin Junczys-Dowmunt, Christophe Mazenc, Jose Garcia-Verdugo ….  345

Let'sMT! as a Learning Platform for SMT

Hanne Fersøe, Dorte Haltrup Hansen, Lene Offersgaard, Susi Olsen, Claus Povlsen ….  353

User Evaluation of Advanced Interaction Features for a Computer-Assisted Translation Workbench

Vicente Alabau, Jesus Gonzalez-Rubio, Luis A. Leiva, Daniel Ortiz-Martínez, German Sanchis-Trilles, Francisco Casacuberta, Bartolomé Mesa-Lao, Ragnar Bonk, Michael Carl, Mercedes Garcia-Martinez ….      361

SMT Approaches for Commercial Translation of Subtitles

Thierry Etchegoyhen, Mark Fishel, Jie Jiang, Mirjam Sepesy Maucec ….     369

Connectivity, Adaptability, Productivity, Quality, Price...   What are the Necessary Ingredients to get the MT Recipe Right?

Laura Casanellas, Lena Marg ….      371

Productivity or Quality? Let's do both!

Joachim Van den Bogaert and Nathalie De Sutter ….      381

Streamlining Terminology Management in an RBMT Context

Horst Liebscher, Thomas Senf ….    391

Using MT as Enabler for Differentiated Quality Levels

Eric Blassin, Rafa Moral ….      397

IBM Germany Using RMT

Sigrun Gehrig ….      399

USE CASE: Customization and Collaboration to Enhance MT for a Knowledge Base Online Portal

Chris Wendt, Federico Garcea ….      401

European projects

Automated Community Content Editing PorTal (ACCEPT)

Pierrette Bouillon ….      407

Bologna Translation Service (BOLOGNA)

Joachim Van den Bogaert, Heidi Depraetere, Joeri Van de Walle....... ..      409


CASMACAT: Cognitive Analysis and Statistical Methods for Advanced Computer Aided Translation

Philipp Koehn, Michael Carl, Francisco Casacuberta, Eva Marcos ….      411

Machine Translation in CNGL II

Qun Liu …      413

Bridges Across the Language Divide — EU-BRIDGE      415

Excitement:  Exploring Customer Interactions through Textual EntailMENT

Ido Dagan, Bernardo Magnini, Guenter Neumann, Sebastian Pado ….             417

Pangeanic in the EXPERT Project: Exploiting Empirical appRoaches to Translation

Manuel Herranz, Alex Helle, Elia Yuste, Ruslan Mitkov, Lucia Specia ...     419

FAUST: Feedback Analysis for User Adaptive Statistical Translation

William Byrne, Lluis Marquez ….       421

LT-Web: Metadata for the Multilingual Web: Introducing Internationalization Tag Set (ITS) 2.0
Felix Sasaki ….     423

MATECAT: Machine Translation Enhanced Computer Assisted Translation
Marcello Federico, Philipp Koehn, Holger Schwenk, Marco Trombetti ….     425

META - Multilingual Europe Technology Alliance

Georg Rehm, Aljoscha Burchardt, Felix Sasaki ….      427

MOLTO - Multilingual On Line Translation

Aarne Ranta ….      429

MONNET: Multilingual Ontologies for Networked Knowledge

Mihael Arcan, Paul Buitelaar ….      431

MosesCore: Moses Open Source Evaluation and Support Co-ordination for OutReach and Exploitation  433

PANACEA: Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies

Nuria Bel, Marc Poch, Antonio Toral ….     435

A Review of the PRESEMT project

George Tambouratzis, Marina Vassiliou, Sokratis Sofianopoulos........ ..     437



Stephen Doherty, Declan Groves, Josef van Genabith, Arle Lommel, Aljoscha Burchardt, Hans Uszkoreit, Lucia Specia, Stelios Piperidis ….      439

SIGNSPEAK: Scientific Understanding and Vision-based Technological Development for Continuous Sign Language Recognition and Translation

Jens Forster, Christoph Schmidt, Hermann Ney ….      441

SUMAT: An Online Service for Subtitling by Machine Translation

P. Georgakopoulou, L. Bywood, Thierry Etchegoyen, Mark Fishel, Jie Jiang, G. van Loenhout, A. del Pozo, D. Spiliotopoulos, Mirjam Sepesy Maucec, A. Turner …. 443

TaaS: Terminology as a Service

Andrejs Vasiljevs, Tatiana Gornostay ….      445

transLectures: Transcription and Translation of Video Lectures...........       447

TTC: Terminology Extraction, Translation Tools and Comparable Corpora
Beatrice Daille ….      449

Cross-lingual Knowledge Extraction (XLike)….      451




September 2, 2013


5th Workshop on Patent Translation, ed. Shoichi Yokoyama


Workshop on Post-editing Technology and Practice (WPTP-2), eds. Sharon O’Brien, Michel Simard and Lucia Specia



September 3, 2013


Workshop on Multi-word Units in Machine Translation and Translation Technologies, eds. Johanna Monti, Ruslan Mitkov, Gloria Corpas Pastor, Violeta Seretan,