EAMT-2009
Proceedings of the
13th
Annual Conference
of the European Association
for Machine Translation
edited by
Lluís Màrquez and Harold Somers
14-15 May 2009
Universitat Politècnica de Catalunya
Table of Contents
Invited Papers
"The Jungle is Neutral" - Newcomer Languages Face New Media
Nicholas Ostler............................................................................................................................................................... 1
Adaptable, Community-Controlled, Language Technologies for Language Maintenance
Lori Levin....................................................................................................................................................................... 8
Regular Papers
Character-Based PSMT for Closely Related Languages
Jörg Tiedemann.......................................................................................................................................................... 12
TS3: an Improved Version of the Bilingual Concordancer TransSearch
Stéphane Huet, Julien Bourdaillet and Philippe Langlais.................................................................................... 20
Estimating the Sentence-Level Quality of Machine Translation Systems
Lucia Specia, Marco Turchi, Nicola Cancedda, Marc Dymetman and Nello Cristianini.............................. 28
Evaluation-Guided
Pre-Editing of Source Text: Improving MT-Tractability of Light Verb Constructions
Bogdan Babych, Anthony Hartley and
Serge Sharoff........................................................................................ 36
Learning Labelled Dependencies in Machine Translation Evaluation
Yifan He and
Improving a Catalan-Spanish Statistical Translation System using Morphosyntactic Knowledge
Mireia Farrús, Marta R. Costa-jussà, Marc Poch, Adolfo Hernández and José B. Mariño.......................... 52
Use of
Rich Linguistic Information to Translate Prepositions and Grammar Cases to
Basque
Eneko Agirre, Aitziber Atutxa, Gorka Labaka, Mikel Lersundi, Aingeru Mayor and Kepa
Sarasola........................................................................................................................................................................ 58
Gappy Translation Units under Left-to-Right SMT Decoding
Josep M. Crego and François Yvon......................................................................................................................... 66
Relevance of Different Segmentation Options on Spanish-Basque SMT
Arantza Díaz de Ilarraza, Gorka Labaka and Kepa Sarasola.......................................................................... 74
English-Latvian Toponym Processing: Translation Strategies and Linguistic Patterns
Tatiana Gornostay and Inguna Skadiņa................................................................................................................ 81
An Environment for Named Entity Recognition and Translation
Filip Graliński, Krzysztof Jassem and Michał Marcińczuk................................................................................. 88
Optimal Bilingual Data for French-English PB-SMT
Sylwia Ozdowska and Andy Way.......................................................................................................................... 96
Word- and Sentence-Level Confidence Measures for Machine Translation
Sylvain Raybaud, Caroline Lavecchia, David Langlois and Kamel Smaïli................................................. 104
Translating Questions for Cross-Lingual QA
Jörg Tiedemann........................................................................................................................................................ 112
Developing Prototypes for Machine Translation between Two Sami Languages
Francis M. Tyers, Linda Wiechetek and Trond Trosterud................................................................................. 120
Collocations in a Rule-Based MT System: A Case Study Evaluation of their Translation Adequacy
Eric Wehrli, Violeta Seretan, Luka Nerima and Lorenza Russo...................................................................... 128
Automatic Translation of Norwegian Noun Compounds
Lars Bungum and Stephan Oepen........................................................................................................................ 136
Marker-Based Filtering of Bilingual Phrase Pairs for SMT
Felipe Sánchez-Martínez and
Tree-Based Target Language Modeling
Vincent Vandeghinste.............................................................................................................................................. 152
Language Model Adaptation for Difficult to Translate Phrases
Behrang Mohit, Frank Liberato and Rebecca Hwa.......................................................................................... 160
A Phrase-Based Hidden Semi-Markov Approach to Machine Translation
Jesús Andrés-Ferrer and Alfons Juan.................................................................................................................... 168
Building Strong Multilingual Aligned Corpora
Reza Bosagh Zadeh................................................................................................................................................. 176
A Constraint Satisfaction Approach to Machine Translation
Sander Canisius and Antal van den Bosch.......................................................................................................... 182
Introducing the Autshumato Integrated Translation Environment
Hendrik J. Groenewald and Wildrich Fourie........................................................................................................ 190
A New Subtree-Transfer Approach to Syntax-Based Reordering for Statistical Machine Translation
Maxim Khalilov, José A. R. Fonollosa and Mark Dras..................................................................................... 197
On Extracting Multiword NP Terminology for MT
Svetlana Sheremetyeva.......................................................................................................................................... 205
Rule-Based Augmentation of Training Data in Breton-French Statistical Machine Translation
Francis M. Tyers....................................................................................................................................................... 213
Can Semantic Role Labeling Improve SMT?
Dekai Wu and Pascale Fung.................................................................................................................................. 218
Are Unaligned Words Important for Machine Translation?
Yuqi Zhang, Evgeny Matusov and Hermann Ney............................................................................................ 226
Using Supertags as Source Language Context in SMT
Rejwanul Haque, Sudip
Kumar Naskar, Yanjun Ma and
On LM Heuristics for the Cube Growing Algorithm
David Vilar and Hermann Ney.............................................................................................................................. 242
Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation
Yanjun Ma, Patrik Lambert and
Workshops held in conjunction with the Conference, Wednesday
13 May
Statistical
Multilingual Analysis for Retrieval and Translation
Nicola Cancedda
Empirical machine translation and its evaluation
Jesús
Giménez
Online learning for CAT applications
Nicolò Cesa-Bianchi and Gabriele Reverberi
Matti Kääriäinen
Large scale maximum margin regression based,
structural learning approach to phrase translations
Sandor Szedmak, Esther Galbrun, Craig
Saunders, and Yizhao Ni
Learning to translate: a statistical and
computational analysis
Marco
Turchi, Tijl de Bie, and Nelo Cristianini
Improving SMT by learning translation direction
Cyril
Goutte, David Kurokawa, and
Pierre Isabelle
Multi-view
CCA and regression CCA
Jan Rupnik
Large-margin structural prediction via linear
programming
Zuoran
Wang, John Shawe-Taylor, & Sándor
Szedmák
Sentence-level
confidence estimation for MT
Lucia Specia, Nicola Cancedda, Marc Dymetman, Craig Saunders, Marco Turchi,
Nello Cristianini, Zhuoran Wang and John Shawe-Taylor
Nicola Cancedda
Machine
Technologies for Historical Linguistics
[proceedings
not yet available]
Challenges
in machine (aided) translation for historical languages
Walther von Hahn
Mass
digitization in the humanities and results of DARIAH-Network
Mike Priddy
Machine
(aided) translation for historical texts – an overview of current solutions
Cristina Vertan
Tagging
Old Russian texts via modern translations – applications of and resources for
historical MT
Roland Meyer
Hypothetical
and explicit lexical resources supporting access to historical documents
Christoph Ringlstetter
Automatically
building bilingual dictionaries for Greek and Latin
David Bammann