EAMT 2010

Proceedings of the

14th Annual Conference

of the European Association

for Machine Translation


27-28 May 2010

Saint-Raphaël Congrès, Saint-Raphaël, France


Edited by François Yvon and Viggo Hansen


Table of Contents



Andy Way

Message from the local organizers

Viggo Hansen and Svetlana Sheremetyeva

EAMT 2010 organizers


Keynote Speaker (Thursday 27 May):

Born Global - How to Communicate Effectively in an International Environment

Doris Albisser, CEO - CLS Communication, Switzerland [presentation]


Invited Presentations:

User report (Thursday 27 May): What’s driving MT and why should we care?

Lori Thicke, CEO, Lexcelera. Innovating Translation [presentation]


User report (Thursday 27 May): Let a Thousand MT Systems Bloom

Jaap van der Meer, TAUS


User report (Friday 28 May): The Evolution of the European Machine Translation Programme at the European Patent Office

Patrizia Biani, Country Coordinator, European Affairs, Member States Co-op. , European Patent Office [presentation]


Regular Papers: Thursday 27 May


Research paper: Can inversion transduction grammars generate hand alignments?

Anders Søgaard [presentation]


Research paper: A fully unsupervised approach for mining parallel data from comparable corpora

Thi Ngoc Diep Do, Laurent Besacier and Eric Castelli [presentation]


Research paper: Statistical Analysis of Alignment Characteristics for Phrase-based Machine Translation

Patrik Lambert, Simon Petitrenaud, Yanjun Ma and Andy Way [presentation]


User report: Language Technology Resource Center,

Jennifer DeCamp [presentation]


User report: MT engine selection in the enterprise

Heidi Depraetere and Pablo Vazquez [presentation]


Poster presentations: Thursday 27 May


Research paper: Query translation using Wikipedia-based resources for analysis and disambiguation

Benoit Gaillard, Malek Boualem and Olivier Collin


Research paper: What counts as useful advice in a university post-editing training context? Report on a case

Ilse Depraetere


Research paper: Linguistic-based Evaluation Criteria to identify Statistical Machine Translation Errors

Mireia Farrús, Marta R. Costa-jussà, José B. Mariño and José A. R. Fonollosa


Research paper: Rule-based Breton to French machine translation

Francis Tyers


User report: Correlating Translation Product and Translation Process Data of Professional and Student Translators

Michael Carl and Matthias Buch-Kromann


Research paper: Robust Estimation of Feature Weights in Statistical Machine Translation

Cristina España-Bonet and Lluís Màrquez


Research paper: Potential scope of a fully-integrated architecture for speech translation

Alicia Pérez, M. Inés Torres and Francisco Casacuberta

Research paper: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system

Marta R. Costa-jussa, Vidas Daudaravicius, and Rafael E.Banchs


Research paper: On the Use of Confidence Measures within an Interactive-predictive Machine Translation System

Jesús González-Rubio, Daniel Ortíz-Martínez and Francisco Casacuberta


Research paper: Analysing Post-Editing Performance: Correlations with Years of Translation Experience

Giselle de Almeida and Sharon O'Brien


Research paper: A Deterministic Annealing-Based Training Algorithm For Statistical Machine Translation Models

Pascual Martínez Gómez, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda and Germán Sanchis-Trilles


Research paper: English to Bangla Phrase-Based Machine Translation

Md. Zahurul Islam, Jörg Tiedemann and Andreas Eisele


Research paper: Translation practice in the workplace and Machine Translation

Nikiforos Karamanis, Saturnino Luz and Gavin Doherty


User report: TMX Markup: A Challenge When Adapting SMT to the Localisation Environment

Jinhua Du, Johann Roturier and Andy Way


Research paper: Speeding up the Implementation Process of a Shallow Transfer Machine Translation System

Jernej Vičič and Petr Homola


User report: Bridging the Gap – EuroTermBank Terminology Delivered to Users’ Environment

Tatiana Gornostay, Andrejs Vasiljevs, Signe Rirdance and Roberts Rozis


Research paper: Lattice Score Based Data Cleaning for Phrase-Based Statistical Machine Translation

Jie Jiang, Andy Way and Julie Carson-Berndsen


Research paper: Chunk-Based EBMT

Jae Dong Kim, Ralf Brown and Jaime Carbonell


Research paper: Source reordering using MaxEnt classifiers and supertags

Maxim Khalilov and Khalil Sima'an


Regular papers: Friday 28 May


Research paper: Domain Adaptation in Statistical Machine Translation using Factored Translation Models

Jan Niehues and Alex Waibel [presentation]


Research paper: Online Language Model adaptation via N-gram Mixtures for Statistical Machine Translation

Germán Sanchis-Trilles and Mauro Cettolo [presentation]


Research paper: Learning an Expert from Human Annotations in Statistical Machine Translation: the Case of Out-of-Vocabulary Words

Wilker Aziz, Marc Dymetman, Lucia Specia and Shachar Mirkin [presentation]


Research paper: The Impact of Source–Side Syntactic Reordering on Hierarchical Phrase-based SMT

Jinhua Du and Andy Way [presentation]


Research paper: Hierarchical Hybrid Translation between English and German

Yu Chen and Andreas Eisele [presentation]


Research paper: A Novel Statistical Pre-Processing Model for Rule-Based Machine Translation System

Yanli Sun, Sharon O’Brien, Minako O’Hagan, Fred Hollowood [presentation]


User report: Using the Apertium Spanish-Brazilian Portuguese machine translation system for localization

Masselot François, Ribiczey Petra and Ramírez-Sánchez Gema [presentation]


User report: Employing Machine Translation in Glocalization Tasks – A Use Case Study

Jörg Schütz and Sven Christian Andrä [presentation]


User report: Haitian Creole: How to Build and Ship an MT Engine from Scratch in 4 days, 17 hours, & 30 minutes

William Lewis [presentation]


Research paper: Bottom-up Transfer in Example-based Machine Translation

Vincent Vandeghinste and Scott Martens [presentation]


Research paper: A Bootstrapped Interlingua-Based SMT Architecture

Manny Rayner, Paula Estrella and Pierrette Bouillon [presentation]


Research paper: Automatic Determination of Number of clusters for creating Templates in Example-Based Machine Translation

Rashmi Gangadharaiah, Ralf Brown and Jaime Carbonell [presentation]


European Community supported projects: Friday 28 May


              iTranslate4.eu: Internet translators for all European languages.

              PANACEA – platform for the automatic, normalized annotation and cost-effective acquisition of language resources.

              PluTO: patent language translation online.

              TTC:terminology extraction, translation tools, comparable corpora

Also presented:





Best Thesis Awards: Friday 28 May


Best Thesis Award for 2008: Empirical machine translation and its evaluation.

              Jesús Giménez


Best Thesis Award for 2009: Constrained word alignment models for statistical machine translation.

 Yanjun Ma


List of participants