EAMT-2009

 

Proceedings of the

13th Annual Conference

of the European Association

for Machine Translation

edited by

Lluís Màrquez and Harold Somers

14-15 May 2009

Universitat Politècnica de Catalunya

Barcelona, Spain

 

Table of Contents

 

Foreword, Message from the Programme Committee Chairs, Message from the Local Organising Committee, EAMT-2009 Organizers, Conference Program

Invited Papers

"The Jungle is Neutral" - Newcomer Languages Face New Media

Nicholas Ostler........................................................................................................ 1

Adaptable, Community-Controlled, Language Technologies for Language Maintenance

Lori Levin............................................................................................................... 8

Regular Papers

Character-Based PSMT for Closely Related Languages

Jörg Tiedemann..................................................................................................... 12

TS3: an Improved Version of the Bilingual Concordancer TransSearch

Stéphane Huet, Julien Bourdaillet and Philippe Langlais..................................... 20

Estimating the Sentence-Level Quality of Machine Translation Systems

Lucia Specia, Marco Turchi, Nicola Cancedda, Marc Dymetman and Nello Cristianini            28

       Evaluation-Guided Pre-Editing of Source Text: Improving MT-Tractability of Light Verb Constructions
Bogdan Babych, Anthony Hartley and Serge Sharoff......................................... 36

Learning Labelled Dependencies in Machine Translation Evaluation

Yifan He and Andy Way...................................................................................... 44

Improving a Catalan-Spanish Statistical Translation System using Morphosyntactic Knowledge

Mireia Farrús, Marta R. Costa-jussà, Marc Poch, Adolfo Hernández and José B. Mariño         52

       Use of Rich Linguistic Information to Translate Prepositions and Grammar Cases to Basque
Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor and Kepa
Sarasola................................................................................................................. 58

Gappy Translation Units under Left-to-Right SMT Decoding

Josep M. Crego and François Yvon...................................................................... 66

Relevance of Different Segmentation Options on Spanish-Basque SMT

Arantza Díaz de Ilarraza, Gorka Labaka and Kepa Sarasola............................... 74

English-Latvian Toponym Processing: Translation Strategies and Linguistic Patterns

Tatiana Gornostay and Inguna Skadiņa................................................................ 81

An Environment for Named Entity Recognition and Translation

Filip Graliński, Krzysztof Jassem and Michał Marcińczuk................................... 88

Optimal Bilingual Data for French-English PB-SMT

Sylwia Ozdowska and Andy Way....................................................................... 96

Word- and Sentence-Level Confidence Measures for Machine Translation

Sylvain Raybaud, Caroline Lavecchia, David Langlois and Kamel Smaïli........ 104

Translating Questions for Cross-Lingual QA

Jörg Tiedemann................................................................................................... 112


Developing Prototypes for Machine Translation between Two Sami Languages

Francis M. Tyers, Linda Wiechetek and Trond Trosterud................................... 120

Collocations in a Rule-Based MT System: A Case Study Evaluation of their Translation Adequacy

Eric Wehrli, Violeta Seretan, Luka Nerima and Lorenza Russo......................... 128

Automatic Translation of Norwegian Noun Compounds

Lars Bungum and Stephan Oepen...................................................................... 136

Marker-Based Filtering of Bilingual Phrase Pairs for SMT

Felipe Sánchez-Martínez and Andy Way............................................................ 144

Tree-Based Target Language Modeling

Vincent Vandeghinste......................................................................................... 152

Language Model Adaptation for Difficult to Translate Phrases

Behrang Mohit, Frank Liberato and Rebecca Hwa............................................ 160

A Phrase-Based Hidden Semi-Markov Approach to Machine Translation

Jesús Andrés-Ferrer and Alfons Juan................................................................. 168

Building Strong Multilingual Aligned Corpora

Reza Bosagh Zadeh............................................................................................. 176

A Constraint Satisfaction Approach to Machine Translation

Sander Canisius and Antal van den Bosch......................................................... 182

Introducing the Autshumato Integrated Translation Environment

Hendrik J. Groenewald and Wildrich Fourie...................................................... 190

A New Subtree-Transfer Approach to Syntax-Based Reordering for Statistical Machine Translation

Maxim Khalilov, José A. R. Fonollosa and Mark Dras....................................... 197

On Extracting Multiword NP Terminology for MT

Svetlana Sheremetyeva........................................................................................ 205

Rule-Based Augmentation of Training Data in Breton-French Statistical Machine Translation

Francis M. Tyers.................................................................................................. 213

Can Semantic Role Labeling Improve SMT?

Dekai Wu and Pascale Fung............................................................................... 218

Are Unaligned Words Important for Machine Translation?

Yuqi Zhang, Evgeny Matusov and Hermann Ney............................................. 226

Using Supertags as Source Language Context in SMT

Rejwanul Haque, Sudip Kumar Naskar, Yanjun Ma and Andy Way................ 234

On LM Heuristics for the Cube Growing Algorithm

David Vilar and Hermann Ney........................................................................... 242

Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation

Yanjun Ma, Patrik Lambert and Andy Way....................................................... 250

 

 

Workshops held in conjunction with the Conference, Wednesday 13 May

 

Statistical Multilingual Analysis for Retrieval and Translation

 

Introduction

          Nicola Cancedda

Empirical machine translation and its evaluation

          Jesús Giménez

Online learning for CAT applications

         Nicolò Cesa-Bianchi and Gabriele Reverberi

Sinuhe – statistical machine translation with a globally trained conditional exponential family translation model

         Matti Kääriäinen

Large scale maximum margin regression based, structural learning approach to phrase translations

         Sandor Szedmak, Esther Galbrun, Craig Saunders, and Yizhao Ni

Learning to translate: a statistical and computational analysis

         Marco Turchi, Tijl de Bie, and Nelo Cristianini

Improving SMT by learning translation direction

         Cyril Goutte, David Kurokawa, and Pierre Isabelle

Multi-view CCA and regression CCA

         Jan Rupnik

Large-margin structural prediction via linear programming

        Zuoran Wang, John Shawe-Taylor, & Sándor Szedmák

Sentence-level confidence estimation for MT

        Lucia Specia, Nicola Cancedda, Marc Dymetman, Craig Saunders, Marco Turchi, Nello Cristianini, Zhuoran Wang and John Shawe-Taylor

Closing remarks

        Nicola Cancedda

 

Machine Technologies for Historical Linguistics

           [proceedings not yet available]

 

Challenges in machine (aided) translation for historical languages

           Walther von Hahn

Mass digitization in the humanities and results of DARIAH-Network

            Mike Priddy

Machine (aided) translation for historical texts – an overview of current solutions

            Cristina Vertan

Tagging Old Russian texts via modern translations – applications of and resources for historical MT

            Roland Meyer

Hypothetical and explicit lexical resources supporting access to historical documents

            Christoph Ringlstetter

Automatically building bilingual dictionaries for Greek and Latin

            David Bammann