Machine Translation Archive

Index to methods, techniques, processes

Publications 2005-2009

For other periods go to: Publications since 2010; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

To return to home page click here

Alignment

(2009) Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, & Juan-Miguel Vilar: Statistical approaches to computer-assisted translation. Computational Linguistics 35 (1), pp.3-28. [PDF, 252KB]

(2009) Reza Bosagh Zadeh: Building strong multilingual aligned corpora. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.176-181. [PDF, 497KB]

(2009) Gosse Bouma, Sergio Duarte, & Zahurul Islam: Cross-lingual alignment and completion of Wikipedia templates. NAACL-HLT-2009: The Third International Workshop on Cross Lingual Information Access: Addressing the Information Need of Multilingual Societies (CLIAWS3), Proceedings of the Workshop, June 4, 2009, Boulder, Colorado; pp.21-29. [PDF, 160KB]

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: The SAWA corpus: a parallel corpus English – Swahili. Proceedings of the EACL Workshop on Language Technologies for African Languages (AfLaT 2009), Athens, Greece, 31 March 2009; pp.9-16. [PDF, 129KB]

(2009) Thi-Ngoc-Diep Do, Viet-Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Mining a comparable text corpus for a Vietnamese-French statistical machine translation system.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.165-172. [PDF, 152KB]

(2009) Sittichai Jiampojamarn, Aditya Bhargava, Qing Dou, Kenneth Dwyer, & Grzegorz Kondrak: DirecTL: a language-independent approach to transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 128KB]

(2009) Adrien Lardilleux, Jonathan Chevelu, Yves Lepage, Ghislain Putois, & Julien Gosme: Lexicons or phrase tables? An investigation in sampling-based multilingual alignment. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.45-52. [PDF, 345KB]; presentation [PDF of PPT, 256KB]

(2009) Adrien Lardilleux & Yves Lepage: Sampling-based multilingual alignment. [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp. 214-218. [PDF, 210KB]

(2009) Yves Lepage, Adrien Lardilleux, & Julien Gosme: The GREYC translation memory for the IWSLT 2009 evaluation campaign: one step beyond translation memory. IWSLT 2009: Proceedings of the International Workshop on Spoke n Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 45-49. [PDF, ]; presentation [PDF of PPT,

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Samat Mamitimin & Min Hou: Chinese-Uyghur sentence alignment: an approach based on anchor sentences. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.38-45. [PDF, 215KB]

(2009) David Mareček: Analysis and alignment of parallel data in TectoMT. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.57-58 [PDF, 122KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Hideharu Okuma, Kazuo Hara, Masashi Shimbo, & Yuji Matsumoto: Bypassed alignment graph for learning coordination in Japanese sentences. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.5-8. [PDF, 474KB]

(2009) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Interactive machine translation based on partial statistical phrase-based alignments. [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp.330-336. [PDF, 275KB]

(2009) Vladimir Pervouchine, Haizhou Li, & Bo Lin: Transliteration alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.136-144. [PDF, 572KB]

(2009) Emmanuel Prochasson, Emmanuel Morin & Kyo Kageura: Anchor points for bilingual lexicon extraction from small comparable corpora. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.284-291. [PDF, 138KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Ken Samuel, Alan Rubenstein, Sherri Condon & Alex Yeh: Name matching between Chinese and Roman scripts: machine complements human. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 454KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Felipe Sánchez-Martínez & Andy Way: Marker-based filtering of bilingual phrase pairs for SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.144-151. [PDF , 386KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

 (2009) Anders Søgaard: On the complexity of alignment problems in two synchronous grammar formalisms.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.60-68. [PDF, 161KB]

(2009) Anders Søgaard & Jonas Kuhn: Empirical lower bounds on alignment error rates in syntax-based machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.19-27. [PDF, 138KB]

(2009) Sanghoun Song & Francis Bond: Online search interface for the Sejong Korean-Japanese bilingual corpus and auto-interpretation of phrase alignment. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.146-149. [PDF, 153KB]

(2009) Jun Sun, Min Zhang, & Chew Lim Tan: A non-contiguous tree sequence alignment-based model for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.914-922. [PDF, 459KB]

(2009) Masao Utiyama & Hitoshi Isahara: Mining patents for parallel corpora. In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.): Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.41-58.

(2009) Balakrishnan Vardarajan & Delip Rao: Î-extension Hidden Markov Models and weighted transducers for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.120-123. [PDF, 129KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2009) Ventsislav Zhechev: Unsupervised generation of parallel treebanks through sub-tree alignment. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.89-98. [PDF, 478KB]  [presentation at MT Marathon 2009, 3281KB]

(2008) Necip Fazil Ayan, Jing Zheng, & Wen Wang: Improving alignments for better confusion networks for combining machine translation systems. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.33-40. [PDF, 228KB]

(2008) Alberto Barrón-Cedeño, Paolo Rosso, David Pinto, & Alfons Juan: On cross-lingual plagiarism analysis using a statistical model. ECAI 2008 Workshop on Uncovering Plagiarism, Authorship and Social Software Misuse, July 22, Patras, Greece; 5pp. [PDF, 132KB]

(2008) Marta R.Costa-jussà, José A.R.Fonollosa, & Enric Monte: Using reordering in statistical machine translation based on recursive alignment block classification.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 96KB]

(2008) John DeNero & Dan Klein: The complexity of phrase alignment problems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 25-28. [PDF, 180KB]

(2008) John DeNero, Alexandre Bouchard-Côté, & Dan Klein: Sampling alignment structure under a Bayesian translation model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.314-323. [PDF, 356KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Chris Dyer, Alex Mont, Aaron Cordova, & Jimmy Lin: SMT model building with MapReduce. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 91 slides. [PDF, 1850KB]

(2008) Mark Fishel & Heiki-Jaan Kaalep: Experiments on processing overlapping parallel corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 186KB]

(2008) Surya Ganesh, Sree Harsha, Prasad Pingali, & Vasudeva Varma: Statistical transliteration for cross language information retrieval using HMM alignment and CRF. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.42-47. [PDF, 127KB]

(2008) Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, & Robert Moore: Indirect HMM-based hypothesis alignment for combining outputs from machine translation systems.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.98-107. [PDF, 280KB]

(2008) Young-Sook Hwang, YoungKil Kim, & SangKyu Park: Paraphrasing depending on bilingual context toward generalization of translation knowledge. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.327-334. [PDF, 952KB]

(2008) Einav Itamar & Alon Itai: Using movie subtitles for creating a large-scale bilingual corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 120KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Adrien Lardilleux & Yves Lepage: Multilingual alignments by monolingual string differences.  Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.55-58. [PDF, 164KB]

(2008) Adrien Lardilleux & Yves Lepage: A truly multilingual, high coverage, accurate, yet simple, subsentential alignment method. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.125-132. [PDF, 633KB]

(2008) Yves Lepage, Adrien Lardilleux, Julien Gosme, & Jean-Luc Manguin: The GREYC machine translation system for the IWSLT 2008 evaluation campaign. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 39-45. [PDF, 184KB]; presentation [PDF, 142KB]

(2008) Yanjun Ma, John Tinsley, Hany Hassan, Jinhua Du, & Andy Way: Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 26-33. [PDF, 198KB]; presentation [PDF, 559KB]

(2008) Lieve Macken, Els Lefever, & Veronique Hoste:  Linguistically-based sub-sentential alignment for terminology extraction from a bilingual automotive corpus. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.529-536. [PDF, 145KB]

(2008) Kazuaki Maeda, Xiaoyi Ma, & Stephanie Strassel: Creating sentence-aligned parallel text corpora from a large archive of potential parallel text using BITS and Champollion.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 68KB]

(2008) David Mareček, Zdeněk Žabokrtský, & Václav Novák: Automatic alignment of Czech and English deep syntactic dependency trees. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.104-113. [PDF, 586KB]

(2008) Toshiaki Nakazawa & Sadao Kurohashi: Linguistically-motivated tree-based probabilistic phrase alignment. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.163-171. [PDF, 603KB]

(2008) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Phrase-level alignment generation using a smoothed loglinear phrase-based statistical alignment model. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.160-169. [PDF, 536KB]

(2008) Antti-Veikko I. Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.183-186. [PDF, 61KB]

(2008) Lei Shi & Ming Zhou: Improved sentence alignment on parallel web pages using a stochastic tree alignment model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.505-513. [PDF, 483KB]

(2008) Anders Søgaard: Learning context-sensitive synchronous rules. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.170-175. [PDF, 525KB]

(2008) Zhiyi Song & Stephanie Strassel: Entity translation and alignment in the ACE-07 ET task. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 135KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Jörg Tiedemann: Synchronizing translated movie subtitles. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 171KB]

(2008) Julia Trushkina, Lieve Macken, & Hans Paulussen: Sentence alignment in DPC: maximizing precision, minimizing human effort.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 52KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2008) Clare R. Voss, Jamal Laoudi, & Jeffrey Micher: Exploitation of an Arabic language resource for MT evaluation: using Buckwalter-based lookup tool to augment CMU alignment algorithm.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 246KB]

(2008) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, & Sheng Li: A tree sequence alignment-based tree-to-tree translation model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 559-567. [PDF, 317KB]

(2007) Fernando Diaz & Donald Metzler: Pseudo-aligned multilingual corpora. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.2727-2732. [PDF, 126KB]

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

(2007) Mary Hearne, John Tinsley, Ventsislav Zhechev, & Andy Way: Capturing translational divergences with a statistical tree-to-tree aligner. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.85-94 [PDF, 394KB]; presentation [PDF, 462KB]

(2007) Sarvnaz Karimi, Falk Scholer, & Andrew Turpin: Collapsed consonant and vowel models: new approaches for English-Persian transliteration and back-transliteration.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 648-655 [PDF, 195KB]

(2007) Jae Dong Kim & Stephan Vogel: Iterative refinement of lexicon and phrasal alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.281-288 [PDF, 121KB]

(2007) Tadashi Kumano, Hideki Tanaka, & Takenobu Tokunaga: Extracting phrasal alignments from comparable corpora by using joint probability SMT model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.95-103 [PDF, 366KB]

(2007) Patrik Lambert, Rafael E.Banchs, & Josep M.Crego: Discriminative alignment training without annotated data for machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.85-88 [PDF, 114KB]

(2007) Ding Liu & Daniel Gildea: Source-language features and maximum correlation training for machine translation evaluation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.41-48 [PDF, 201KB]

(2007) Lieve Macken: Analysis of translational correspondence in view of sub-sentential alignment.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 9pp. [PDF, 182KB]

(2007) Toshiaki Nakazawa, Yu Kun, & Sadao Kurohashi: Structural phrase alignment based on consistency criteria. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.337-344 [PDF, 202KB]

(2007) Nasredine Semmar & Christian Fluhr: Arabic to French sentence alignment: exploration of a cross-language information retrieval approach. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 73-80 [PDF, 156KB]

(2007) John Tinsley, Ventsislav Zhechev, Mary Hearne, & Andy Way: Robust language pair-independent sub-tree alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.467-474 [PDF, 1310KB]

(2007) Akira Ushioda: Phrase alignment for integration of SMT and RBMT resources. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.8-12. [PDF, 66KB]

(2007) Akira Ushioda: Phrase alignment based on bilingual parsing. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.241-250 [PDF, 312KB]

(2007) Min Zhang, Hongfei Jiang, Ai Ti Aw, Jun Sun, Sheng Li, & Chew Lim Tan: A tree-to-tree alignment-based model for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.535-542 [PDF, 243KB]

(2007) Qibo Zhu, Diana Inkpen & Ash Asudeh: Automatic extraction of translations from web-based bilingual materials [abstract]. Machine Translation 21 (3), September 2007; pp.139-163.

(2006) Alexandru Ceauşu, Dan Ştefănescu, & Dan Tufiş: Acquis Communautaire sentence alignment using support vector machines.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2134-2137 [PDF, 381KB]

(2006) Brooke Cowan, Ivona Kučerová, & Michael Collins: A discriminative model for tree-to-tree translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 232-241. [PDF, 338KB]

(2006) Hal Daumé III & Daniel Marcu: Induction of word and phrase alignments for automatic document summarization. Computational Linguistics 31 (4), pp. 505-530. [PDF, 507KB]

(2006) John DeNero, Dan Gillick, James Zhang, & Dan Klein: Why generative phrase models underperform surface heuristics.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 31-38 [PDF, 393KB]

(2006)Yonggang Deng & William Byrne: MTTK: an alignment toolkit for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 265-268 [PDF, 135KB]

(2006) Adria de Gispert & José B.Mariño: Linguistic tuple segmentation in n-gram-based statistical machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1049; abstract [PDF, 90KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Mark Hopkins & Jonas Kuhn: A framework for incorporating alignment information in parsing. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.9-16 [PDF, 311 KB]

(2006) Radu Ion, Alexandru Ceauşu, & Dan Tufiş: Dependency-based phrase alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1290-1293 [PDF, 430KB]

(2006) Ding Liu & Daniel Gildea: Stochastic iterative alignment for machine translation evaluation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.539-546. [PDF, 201KB]

(2006) Xiaoyi Ma: Champollion: a robust parallel text sentence aligner.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.489-492 [PDF, 313KB]

(2006) Arne Mauser, Evgeny Matusov, & Hermann Ney: Training a statistical machine translation system without GIZA++.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.714-720 [PDF, 325KB]

(2006) Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: Contextual bitext-derived paraphrases in automatic MT evaluation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 86-93 [PDF, 222KB]

(2006) Gábor Pohl: English-Hungarian NP alignment in MetaMorpho TM. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.69-74 [PDF, 81KB]

(2006) J.A.Sánchez & J.M.Benedí: Stochastic inversion transduction grammars for obtaining word phrases for phrase-based statistical machine translation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 130-133 [PDF, 91KB]

(2006) Bettina Schrader: ATLAS – a new text alignment architecture. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.715-722. [PDF, 139KB]

(2006) Nasredine Semmar & Christian Fluhr: Using cross-language information retrieval for sentence alignment.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.95-104. [PDF, 254KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Dan Ştefănescu & Dan Tufiş: Aligning multilingual thesauri. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.475-478 [PDF, 461KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2006) David Vilar, Maja Popovic, & Hermann Ney: AER: do we need to “improve” our alignments? International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 205-212 [PDF, 108KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

(2006) Martin Volk, Sofia Gustafson-Capková, Joakim Lundborg, Torsten Marek, Yvonne Samuelsson, & Frida Tidström: XML-based phrase alignment in parallel treebanks. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.93-96. [PDF, 103KB]

(2006) Jia Xu, Richard Zens, & Hermann Ney: Partitioning parallel documents using binary segmentation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 78-85 [PDF, 433KB]

(2006) Hao Zhang & Daniel Gildea: Efficient search for inversion transduction grammar.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 224-231. [PDF, 172KB]

(2005) proceedings of ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Emmanuel Giguet: Multi-grained alignment of parallel texts with endogenous resources. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.12-17 [PDF, 194KB]

(2005) Declan Groves & Andy Way: Hybrid data-driven models of machine translation [abstract]. Machine Translation 19 (3-4), 2005; pp. 301-323.

(2005) Sanjika Hewavitharana, Stephan Vogel, & Alex Waibel: Augmenting a statistical translation system with a translation memory. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 126132. [PDF, 74KB]

(2005) Jae Dong Kim, Ralf D. Brown, Peter J. Jansen, & Jaime G. Carbonell: Symmetric probabilistic alignment for example-based translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 153-159. [PDF, 63KB]

(2005) Philipp Koehn: Europarl: a parallel corpus for statistical machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.79-86. [PDF, 123KB]

(2005) Takeshi Kutsumi, Takehiko Yoshimi, Katsunori Kotani, Ichiko Sata, & Hitoshi Isahara: Selection of entries for a bilingual dictionary from aligned translation equivalents using support vector machines. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.11-16. [PDF, 253KB]

(2005) Jong-Hoon Oh & Key-Sun Choi: Automatic extraction of English-Korean translations for constituents of technical terms. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.67-72. [PDF, 226KB]

(2005) Stephan Vogel: PESA: phrase pair extraction as sentence splitting. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.251-258. [PDF, 192KB]

(2005) Jia Xu, Richard Zens, & Hermann Ney: Sentence segmentation using IBM word alignment model 1. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 280-287. [PDF, 145KB]

(2005) Hao Zhang & Daniel Gildea: Stochastic lexicalized inversion transduction grammar for alignment. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 4787-482. [PDF, 102KB]

(2005) Ying Zhang & Stephan Vogel: An efficient phrase-to-phrase alignment model for arbitrarily long phrase and large corpora. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 294-301. [PDF, 187KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.85-90. [PDF, 936KB]

(2005) Yujie Zhang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Building an annotated Japanese-Chinese parallel corpus – a part of NICT multilingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.71-78. [PDF, 1139KB]

Analogies and analogical modelling (see also Example-based methods)

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Harold Somers, Sandipan Dandapat, & Sundip Kumar Naskar: A review of EBMT using proportional analogies. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.53-60. [PDF, 337KB]

(2007) Etienne Denoual: Analogical translation of unknown words in a statistical machine translation framework. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.135-141 [PDF, 94KB]

(2007) Philippe Langlais & Alexandre Patry: Translating unknown words using analogical learning. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 877-886. [PDF, 155KB]

(2005) Yves Lepage & Etienne Denoual: ALEPH: an EBMT system based on the preservation of proportional analogies between sentences across langauges. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 388KB]

(2005) Yves Lepage & Etienne Denoual: Purest ever example-based machine translation: detailed presentation and assessment [abstract]. Machine Translation 19 (3-4), 2005; pp.251-282.

Case-based methods

(2005) J.Akshay Iyer & Pushpak Bhattacharyya: Using semantic information to improve case retrieval in case-based reasoning systems. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.347-358 [abstract, PDF, 15KB]

Character alignment

(2009) Dipankar Bose & Sudeshna Sarkar: Learning multi character alignment rules and classification of training data for transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.61-64. [PDF, 289KB]

(2009) Jörg Tiedemann: Character-based PSMT for closely related languages. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.12-19 [PDF, 399KB]

Context-based methods

(2009) Jamie Brunning, Adrià de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Sander Canisius & Antal van den Bosch: A constraint satisfaction approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.182-189. [PDF, 533KB]

(2009) Jaime Carbonell: New paradigms for machine translation. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 35pp. [PDF of PPT, 462KB]

(2009) Marta R.Costa-jussà & Rafael E.Banchs: Barcelona Media SMT system description for the IWSLT 2009: introducing source context information. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 24-28. [PDF, 265KB]; poster [PDF, 281KB[

(2009) Jinhua Du, Yanjun Ma & Andy Way: Source-side context-informed hypothesis alignment for combining outputs from machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.230-27. [PDF, 277KB]

(2009) Rejwanul Haque, Sandipan Dandapat, Ankit Kumar Srivastava, Sudip Kumar Naskar & Andy Way: English-Hindi transliteration using context-informed PB-SMT: the DCU system for NEWS 2009. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.104-107. [PDF, 98KB]

(2009) Ruhi Sarikaya, Sameer Maskey, Rong Zhang & Ea-Ee Jan: Iterative sentence-pair extraction from quasi-parallel corpora for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Libin Shen, Jinxi Xu, Bing Zhang, Spyros Matsoukas & Ralph Weischedel: Effective use of linguistic and contextual information for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.72-80. [PDF, 122KB]

(2009) Rie Tanaka, Yohei Murakami, & Toru Ishida: Context-based approach for pivot translation services. IJCAI-09: Twenty-first International Joint conference on Artificial Intelligence, Pasadena, Californai, USA, 11-17 July 2009; pp.1555-1561. [PDF, 505KB]

(2009) Maarten van Gompel, Antal van den Bosch, & Peter Berck: Extending memory-based machine translation to phrases. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.61-68. [PDF, 207KB]

(2009) Ying Zhang & Nguyen Bach: Virtual Babel: towards context-aware machine translation in virtual worlds. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 387-394. [PDF, 290KB]

(2008) Ralf D.Brown: Exploiting document-level context for data-driven machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.46-55. [PDF, 496KB]

(2008) Maxine Carpuat & Dekai Wu: Evaluation of context-dependent phrasal translation lexicons for statistical machine translation. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

(2008) Kevin Gimpel & Noah A. Smith: Rich source-side context for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.9-17. [PDF, 211KB]

(2007) Marine Carpuat & Dekai Wu: Context-dependent phrasal translation lexicons for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.73-80 [PDF, 425KB]

(2007) Jesús Giménez & Lluís Màrquez: Context-aware discriminative phrase selection for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 159-166 [PDF, 1046KB]

(2007) Nicolas Stroppa, Antal van den Bosch, & Andy Way: Exploiting source similarity for SMT using context-informed features. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.231-240 [PDF, 309KB]; presentation [PDF, 149KB]

(2006) Jaime Carbonell, Steve Klein, David Miller, Michael Steinbaum, Tomer Grassiany, & Jochen Frei: Context-based machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.19-28 [PDF, 338KB]

(2006) Andrei Popescu-Belis, Paula Estrella, Margaret King, & Nancy Underwood: A model for context-based evaluation of language processing systems and its application to machine translation evaluation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.691-696 [PDF, 536KB]

(2006) Evan Ratliff: Me translate pretty one day. Wired 14 (12), 2006; pp.210-213. [PDF, 32KB]

(2005) Young-Sook Hwang & Yutaka Sasaki: Context-dependent SMT model using bilingual verb-noun collocation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 549-556. [PDF, 1005KB]

Disambiguation (see also Word sense disambiguation)

(2009) Pi-Chuan Chang, Dan Jurafsky, & Christopher D.Manning: Disambiguating “DE” for Chinese-English machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.215-223. [PDF, 276KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Gan Jin: Some problems in a French-Chinese machine translation system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.138-142 [abstract]

(2009) Shoichi Yokoyama & Masumi Okuyama: Translation disambiguation of patent sentences using case frames. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; pp. 25-32. [PDF, 75KB]

(2009) Zhifei Li, Jason Eisner, & Sanjeev Khudanpur: Variational decoding for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.593-601. [PDF, 217KB]

(2008) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Orthographic disambiguation incorporating transliterated probability. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.48-55. [PDF, 525KB]

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Aurélien Max, Rafik Makhloufi, & Philippe Langlais: Explorations in using grammatical dependencies for contextual phrase translation disambiguation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.114-119. [PDF, 568KB]

(2008) Wolodja Wentland, Johannes Knopp, Carina Silberer, & Matthias Hartung: Building a multilingual lexical resource for named entity disambiguation, translation and transliteration.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 451KB]

(2007) Marine Carpuat & Dekai Wu: Context-dependent phrasal translation lexicons for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.73-80 [PDF, 425KB]

(2007) Marine Carpuat & Dekai Wu: How phrase sense disambiguation outperforms word sense disambiguation for statistical machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.43-52 [PDF, 313KB]

(2006) Mohammed A. Attia: An ambiguity-controlled morphological analyzer for modern standard Arabic modelling finite state networks.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.48-67. [PDF, 290KB]

(2006) Mary Hearne & Andy Way: Disambiguation strategies for data-oriented translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.59-68 [PDF, 164KB]

(2006) Ted Pedersen, Anagha Kulkarni, Roxana Angheluta, Zornitsa Kozareva, & Thamar Solorio: Improving name discrimination: a language salad approach. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.25-32 [PDF, 268 KB]

(2006) Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff & Oren Etzioni: Ambiguity reduction for machine translation: human-computer collaboration.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.193-202 [PDF, 247KB]

(2005) Igor M.Boguslavsky, Leonid L.Iomdin & Victor G.Sizov: Interactive enconversion by means of the Etap-3 system. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.230-240 [abstract, PDF, 72KB]

(2005) Niladri Chatterjee, Shailly Goyal, Anjali Naithani: Resolving pattern ambiguity for English to Hindi machine translation using WordNet. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.18-25 [PDF, 245KB]

(2005) Gregor Thurmair: Improving machine translation quality. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit: invited paper [PDF, 209KB]

(2005) Ying Zhang & Phil Vines: Using the web for translation disambiguation: RMIT University at NTCIR-5 Chinese-English CLIR.  Proceedings of NTCIR-5 Workshop Meeting, December 6-9, 2005, Tokyo, Japan; 6pp. [PDF, 394KB]

Generation (see also Text prediction)

(2009) Michael Auli, Adam Lopez, Hieu Hoang & Philipp Koehn: A systematic analysis of translation model search spaces. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.224-232. [PDF, 101KB]

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2009) Nizar Habash, Bonnie Dorr, & Christof Monz: Symbolic-to-statistical hybridization: extending generation-heavy machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.23-63.

(2009) Karin Harbusch & Gerard Kempen: Generating clausal coordinate ellipsis multilingually: a uniform approach based on postediting. Proceedings of the 12th European Workshop on Natural Language Generation, March 30-31, 2009, Athens, Greece; pp.138-145. [PDF, 389KB]

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

(2009) Jin-Ji Li, Jungi Kim, Dong-Il Kim, & Jong-Hyeok Lee: Chinese syntactic reordering for adequate generation of Korean verbal phrases in Chinese-to-Korean SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.190-196. [PDF, 323KB]

(2009) Kathleen McKeown: Query-focused summarization using text-to-text generation: when information comes from multilingual sources. [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Language Generation and Summarisation ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; p.3. [PDF, 33KB]

(2009) Behrang Mohit, Frank Liberato, & Rebecca Hwa: Language model adaptation for difficult to translate phrases. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.160-167. [PDF, 343KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Khaled Shaalan, Azza Abdel Monem & Ahmed Rafea: Syntactic generation of Arabic in interlingua-based machine translation framework. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 574KB]

(2009) Vincent Vandeghinste: Tree-based target language modeling. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.152-159. [PDF, 391KB]

(2009) David Vilar & Hermann Ney: On LM heuristics for the cube growing algorithm. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.242-249. [PDF, 459KB]

(2008) Atsushi Fujita & Satoshi Sato: A probabilistic model for measuring grammaticality and similarity of automatically generated paraphrases of predicate phrases. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.225-232. [PDF, 498KB]

(2008) Yuqing Guo, Josef van Genabith, & Haifeng Wang: Dependency-based n-gram models for general purpose sentence realisation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.297-304. [PDF, 138KB]

(2008) Long Jiang & Ming Zhou: Generating Chinese couplets using a statistical MT approach. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.377-384. [PDF, 284KB]

(2008) Azza Abdel Monem, Khaled Shaalan, Ahmed Rafea, & Hoda Baraka: Generating Arabic text in multilingual speech-to-speech machine translation framework [abstract]. Machine Translation 22 (4), December 2008; pp.205-258.

(2008) Kristina Toutanova, Hisami Suzuki, & Achim Ruopp: Applying morphology generation models to machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 514-522. [PDF, 586KB]

(2008) Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, & Ming Zhou: Measure word generation for English-Chinese SMT systems. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 89-96. [PDF, 237KB]

(2007) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Statistical machine translation through global lexical selection and sentence reconstruction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 152-159 [PDF, 458KB]

(2007) Bernd Bohnet: The induction and evaluation of word order rules using corpora based on the two concepts of topological models. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.38-45 [PDF, 1835KB]

(2007) Pierrette Bouillon, Sonia Halimi, Manny Rayner, & Beth Ann Hockey: Adapting a medical speech to speech translation system (MedSLT) to Arabic. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 41-48 [PDF, 145KB]

(2007) Yvette Graham, Deirdre Hogan, & Josef van Genabith: Automatic evaluation of generation and parsing for machine translation with automatically acquired transfer rules. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.5-12 [PDF, 1796KB]

(2007) Nizar Habash: NLG is still relevant to MT.  MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.64-65 [PDF, 478KB]

(2007) Keith Hall & Petr Němec: Generation in machine translation from deep syntactic trees. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.57-64 [PDF, 237KB]

(2007) David Hardcastle: Generalizing syntactic collocates for creative language generation. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.13-21 [PDF, 2114KB]

(2007) Matt Huenerfauth, Liming Zhao, Erdan Gu, & Jan Allbeck: Design and evaluation of an American Sign Language generator. ACL 2007: Proceedings of the Workshop on Embodied Language Processing, June 29, 2007, Prague, Czech Republic; pp.51-58. [PDF, 204KB]

(2007) Kevin Knight: Automatic language translation generation help needs badly. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.1-4 [PDF, 1093KB]

 (2007) Zhanyi Liu, Heifeng Wang, & Hua Wu: Log-linear generation models for example-based machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.305-312 [PDF, 178KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Andrew Mutton, Mark Dras, Stephen Wan, & Robert Dale: GLEU: automatic evaluation of sentence-level fluency. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 344-351 [PDF, 157KB]

(2007) Andrei Popescu-Belis: Evaluation of NLG: some analogies and differences with machine translation and reference resolution. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.66-68 [PDF, 683KB]

(2007) Germán Sanchis & Francisco Casacuberta: Reordering via n-best lists for Spanish-Basque translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.191-198 [PDF, 309KB]; poster [PDF, 115KB]

(2007) Smriti Singh, Mrugank Dalal, Vishal Vachhani, Pushpak Bhattacharyya, & Om P.Damani: Hindi generation from interlingua. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.421-428 [PDF, 206KB]

(2007) Gregor Thurmair: Generation issues in machine translation. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.69-70 [PDF, 466KB]

(2007) Kristina Toutanova & Hisami Suzuki: Generating case markers in machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.49-56 [PDF, 168KB]

(2007) Sebastian Varges: One-way translation: an opportunity for NLG and MT research to interact. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.71-72 [PDF, 497KB]

(2007) Sriram Venkatapathy & Srinivas Bangalore: Three models for discriminative machine translation using global lexical selection and sentence reconstruction. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.96-102 [PDF, 191KB]

(2007) Michael White, Rajakrishnan Rajkumar, & Scott Martin: Towards broad coverage surface realization with CCG. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.22-30 [PDF, 2020KB]

(2007) Yuk Wah Wong & Raymond J.Mooney: Generation by inverting a semantic parser that uses statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.172-179 [PDF, 192KB]

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2006) Tod Allman & Stephen Beale: A natural language generator for minority languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.39-46. [PDF, 301KB]

(2006) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Sequence classification for machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1722; abstract [PDF, 72KB]

(2006) Marta R. Costa-jussà & José A.R. Fonollosa: Statistical machine reordering.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 70-76. [PDF, 254KB]

(2006) Nizar Habash, Bonnie Dorr & Christof Monz: Challenges in building an Arabic-English GHMT system with SMT components. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.56-65 [PDF, 116KB]

(2006) Daniel T. Heinze, Alexander Turchin & V. Jagannathan: Automated interpretation of clinical encounters with cultural cues and electronic health record generation.  HLT-NAACL 2006: Proceedings of the  Workshop on Medical Speech Translation, 9 June 2006, New York, NY, USA; pp.24-31 [PDF, 251KB]

(2006) Howard Johnson, Fatiha Sadat, George Foster, Roland Kuhn, Michel Simard, Eric Joanis, & Samuel Larkin: PORTAGE, with smoothed phrase tables and segment choice models.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 134-137 [PDF, 118KB]

(2006) Roland Kuhn, Denis Yuen, Michel Simard, Patrick Paul, George Foster, Eric Joanis, & Howard Johnson: Segment choice models: feature-rich models for global distortion in statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 25-32 [PDF, 95KB]

(2006) Zhanyi Liu, Haifeng Wang, & Hua Wu: Example-based machine translation on tree-string correspondences and statistical generation [abstract]. Machine Translation 20 (1),2006; pp.25-41.

(2006) Daniel Marcu, Wei Wang, Abdessamad Echihabi, & Kevin Knight: SPMT: statistical machine translation with syntactified target language phrases. EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 44-52. [PDF, 164KB]

(2006) Marian Olteanu, Pasin Suriyentrakorn, & Dan Moldovan: Language models and reranking for machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 150-153 [PDF, 115KB]

(2006) Khaled Shaalan, Azza Abdel Monem, Ahmed Rafea, & Hoda Baraka: Mapping interlingua representations to feature structures of Arabic sentences. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.149-159. [PDF,278KB]

(2006) Radu Soricut & Daniel Marcu: Stochastic language generation using WIDL-expressions and its application in machine translation and summarization.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1105-1112. [PDF, 233KB]

(2006) Wei Wang, Kevin Knight & David Marcu: Capitalizing machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 1-8 [PDF, 171KB]

(2006) Taro Watanabe, Hajime Tsukada, & Hideki Isozaki: Left-to-right target generation for hierarchical phrase-based translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.777-784. [PDF, 224KB]

(2006) John Wogan, Brian Ó Raghallaigh, Áine Ní Bhriain, Eric Zoerner, Harald Berthelsen, Ailbhe Ní Chasaide, & Christer Gobl: Developing a spoken corpus and a synthesiser for Irish (Gaelic). LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.113-116. [PDF, 248KB]

(2005) Stephen Beale, Sergei Nirenburg, Marjorie McShane, & Tod Allman: Document authoring the Bible for minority language translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.63-70. [PDF, 415KB]

(2005) Anja Belz & Ehud Reiter: Evaluation in NLG. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 35pp. [PDF from PPT, 2032KB]

(2005) Etienne Blanc: About and around the French enconverter and the French deconverter. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.157-166 [abstract, PDF, 21KB]

(2005) Igor Boguslavsky: Some lexical issues of UNL. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.101-108 [abstract, PDF, 54KB]

(2005) Jill Burstein & Daniel Marcu: Translation exercise assistant: automated generation of translation exercises for native-Arabic speakers learning English. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 16-17. [PDF, 53KB]

(2005) Stephan Busemann: Issues in generating text from interlingua representations. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.38-50 [abstract, PDF, 14KB]

(2005) Jesús Cardeñosa, Carolina Gallardo & Edmundo Tovar: Standardization of the generation process in a multilingual environment. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.10-24 [abstract, PDF, 15KB]

(2005) Michel Carl, Paul Schmidt, & Jörg Schütz: Reversible template-based Shake & Bake generation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.17-25. [PDF, 367KB]

(2005) Daoud Maher Daoud: Arabic generation in the framework of the Universal Networking Language.  In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.195-209 [abstract, PDF, 58KB]

(2005) Kuntal Dey & Pushpak Bhattacharyya: Universal Networking Language based analysis and generation for Bengali case structure constructs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.215-229 [abstract, PDF, 88KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

(2005) Matt Huenerfauth: American Sign Language generation: multimodal NLG with multiple linguistic channels. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 37-42. [PDF, 91KB]

(2005) Shyamsundar Jayaraman & Alon Lavie: Multi-engine machine translation guided by explicit word matching.  ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 101-104. [PDF, 117KB]

(2005) Stephan Kanthak, David Vilar, Evgeny Matusov, Richard Zens, & Hermann Ney: Novel reordering approaches in phrase-based statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 167-174. [PDF, 172KB]

(2005) Shankar Kumar & William Byrne: Local phrase reordering models for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 161-168. [PDF, 193KB]

(2005) Yves Lepage & Etienne Denoual: Automatic generation of paraphrases to be used as translation references in objective evaluation measures of machine translation. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 57-64. [PDF, 131KB]

(2005) Jorge Marques Pelizzoni & Maria das Graças Volpe Nunes: Flexibility, configurability and optimality in UNL deconversion via multiparadigm programming . In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute);  pp.175-194 [abstract, PDF, 13KB]

(2005) Xiaodong Shi & Yidong Chen: A UNL deconverter for Chinese. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.167-174 [abstract, PDF, 13KB]

(2005) Liu Zhanyi, Wang Haifeng & Wu Hua: Example-based machine translation based on TSC and statistical generation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.25-32. [PDF, 476KB]

(2005) Stella Markantonatou, Sokratis Sofianopoulos, Vassiliki Spilioti, Yiorgos Tambouratzis, Marina Vassiliou, Olga Yannoutsou, & Nikos Ioannou: Monolingual corpus-based MT using chunks MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.91-98. [PDF, 694KB]

(2005) Christoph Neumann: A human-aided machine translation system for Japanese-English patent translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.1-7. [PDF, 139KB]

(2005) Richard Power & Donia Scott: Automatic generation of large-scale paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 73-79. [PDF, 49KB]

(2005) Svetlana Sheremetyeva: Embedding MT for generating patent claims in English from a multilingual interface MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.8-15. [PDF, 883KB]

(2005) Hirokazu Suzuki & Akira Kumano: Learning translations from monolingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.33-40. [PDF, 439KB]

(2005) Erik Velldal & Stephan Oepen: Maximum entropy models for realization ranking. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.109-116. [PDF, 217KB]

(2005) Michael Carl, Ecaterina Rascu, & Paul Schmidt: Using template-grammars for shake & bake paraphrasing. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 66-72. [PDF, 107KB]

Lexical transfer

(2009) Marianna Apidianaki: Data-driven semantic analysis for multilingual WSD and lexical selection in translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.77-85. [PDF, 364KB]

(2009) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Statistical machine translation through global lexical selection and sentence reconstruction.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.185-203.

(2009) Marine Carpuat: One translation per discourse. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.19-27. [PDF, 112KB]

(2009) Bruno Cartoni: Lexical morphology in machine translation: a feasibility study.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.130-138. [PDF, 129KB]

(2009) Jesús Giménez & Lluís Màrquez: Discriminative phrase selection for SMT.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.205-236.

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

(2009) Krister Lindén: Guessers for finite-state transducer lexicons [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 103KB]

(2009) Arne Mauser, Saša Hasan & Hermann Ney: Extending statistical machine translation with discriminative and trigger-based lexicon models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.210-218. [PDF, 266KB]

(2009) Le Thi Sinh: French-Vietnamese noun phrase translation.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.158-163 [abstract]

(2009) Ravi Sinha, Diana McCarthy, & Rada Mihalcea: SemEval-2010 Task 2: cross-lingual lexical substitution. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.76-81. [PDF,

(2009) István Varga & Soichi Yokoyama: Transfer rule generation for a Japanese-Hungarian machine translation system. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.356-362. [PDF, 152KB]

(2009) Billy Wong & Chunyu Kit: ATEC: automatic evaluation of machine translation via word choice and word order [abstract]. Machine Translation 23 (2/3), September 2009; pp.141-151.

(2009) Kun Yu & Junichi Tsujii: Extracting bilingual dictionary from comparable corpora with dependency heterogeneity. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.121-124. [PDF, 132KB]

(2008) Bogdan Babych, Serge Sharoff, & Anthony Hartley: Generalising lexical translation strategies for MT using comparable corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 412KB]

(2008) Maxine Carpuat & Dekai Wu: Evaluation of context-dependent phrasal translation lexicons for statistical machine translation. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 122KB]

(2008) Chi-Ho Li, Dongdong Zhang, Mu Li, Ming Zhou, & Hailei Zhang: An empirical study in source word deletion for phrase-based statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.1-8. [PDF, 490KB]

(2008) Sibansu Mukhopadhyay, Biswath Barik, & Sudeshna Sarkar: Problems of transfer grammar: the Hindi postpositional correlatives in Bangla. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 180KB]

(2008) Violaine Prince & Jacques Chauche: Building a bilingual representation of the Roget thesaurus for French to English machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 142KB]

(2008) Michael Subotin: Generalizing local translation models. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.28-36. [PDF, 148KB]

(2008) Takashi Tsunakawa, Naoaki Okazaki, & Jun’ichi Tsujii: Building bilingual lexicons using lexical translation probabilities via pivot languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 66KB]

(2007) Bogdan Babych, Anthony Hartley, Serge Sharoff, & Olga Mudraya: Assisting translators in indirect lexical transfer.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 136-143 [PDF, 285KB]

(2007) Bogdan Babych, Anthony Hartley, & Serge Sharoff: A dynamic dictionary for discovering indirect translation equivalents.  Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 10pp. [PDF, 150KB]

(2007) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Statistical machine translation through global lexical selection and sentence reconstruction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 152-159 [PDF, 458KB]

(2007) Oren Etzioni, Kobi Reiter, Stephen Soderland, & Marcus Sammer: Lexical translation with application to image searching on the web. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.175-182 [PDF, 603KB]

(2007) Jacek Gintrowicz & Krzysztof Jassem: Using regular expressions in translation memories. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.2, October 15-17, 2007, Wisla, Poland; pp.87-92. [PDF, 200KB]

(2007) Maite Melero, Antoni Oliver, Toni Badia & Teresa Suñol: Dealing with bilingual divergences in MT using target language n-gram models.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 63KB]

(2007) Ruhi Sarikaya, Yonggang Deng, & Yuqing Gao: Context dependent word modeling for statistical machine translation using part-of-speech tags. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2821-2824; abstract [PDF, 23KB]

(2007) R.Mahesh K. Sinha: Using rich morphology in resolving certain Hindi-English machine translation divergence. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.429-433 [PDF, 48KB]

(2007) Vincent Vandeghinste: Removing the distinction between a translation memory, a bilingual dictionary and a parallel corpus. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 21pp. [PDF, 90KB]

(2007) Sriram Venkatapathy & Srinivas Bangalore: Three models for discriminative machine translation using global lexical selection and sentence reconstruction. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.96-102 [PDF, 191KB]

(2006) Lea Cyrus: Building a resource for studying translation shifts.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.697-702 [PDF, 358KB]

(2006) Teruko Mitamura, Mengqiu Wang, Hideki Shima, & Frank Lin: Keyword translation accuracy and cross-lingual question answering in Chinese and Japanese. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.31-38 [PDF, 617KB]

(2006) Lars Nygaard, Jan Tore Lønning, Torbjørn Nordgård, & Stephan Oepen: Using a bi-lingual dictionary in lexical transfer. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.233-238 [PDF, 133KB]

(2006) Agam Patel & Dragomir R.Radev: Lexical similarity can distinguish between automatic and manual translations. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1230-1235 [PDF, 326KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Sara Stymne & Lars Ahrenberg: A bilingual grammar for translation of English-Swedish verb frame divergences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.9-17 [PDF, 139KB]

(2006) David Talbot & Miles Osborne: Modelling lexical redundancy for machine translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.969-976. [PDF, 207KB]

(2005) Clara Cabezas & Philip Resnick: Using WSD techniques for lexical selection in statistical machine translation. College Park, ND: Institute for Advanced Computer Studies, University of Maryland, July 2005 (CS-TR-4736, LAMP-TR-124, UMIACS-TR-2005-42) 12pp. [PDF, 240KB]

Morphological analysis (see also Segmentation, Tagging)

(2009) Ibrahim Badr, Rabih Zbib, & James Glass: Syntactic phrase reordering for English-to-Arabic statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.86-93. [PDF, 115KB]

(2009) Nicola Bertoldi, Arianna Bisazza, Mauro Cettolo, Germán Sanchis-Trilles, & Marcello Federico: FBK @ IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 37-44. [PDF, 346KB]; poster [PDF, 411KB]

(2009) Arianna Bisazza & Marcello Federico: Morphological pre-processing for Turkish to English statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 129-135. [PDF, 373KB]; presentation [PDF of PPT, 302KB]

(2009) Fethi Bougares, Laurent Besacier, & Hervé Blanchon: LIG approach for IWSLT09: using multiple morphological segmenters for spoken language translation of Arabic. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 60-64. [PDF, 377KB]; presentation [PDF of PPT, 668KB]

(2009) Marine Carpuat: Toward using morphology in French-English phrase-based SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.150-154. [PDF, 92KB]

(2009) Bruno Cartoni: Lexical morphology in machine translation: a feasibility study.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.130-138. [PDF, 129KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Relevance of different segmentation options on Spanish-Basque SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.74-80. [PDF, 390KB]

(2009) Abu Zaher Md. Faridee & Francis M.Tyers: Development of a morphological analyser for Bengali. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 43-50. [PDF, 351KB]

(2009) Mireia Farrús, Marta R.Costa-jussà, Marc Poch, Adolfo Hernández, & José B.Mariño: Improving a Catalan-Spanish statistical translation system using morphosyntactic knowledge. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.52-57. [PDF, 399KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Rauf Fatullayev, Sevinc Mammadova, & Abulfat Fatullayev: Translating composite sentences in Azerbaijani-English MT system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.104-113 [abstract]

(2009) Alexander Fraser: Experiments in morphosyntactic processing for translating to and from German.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.115-119. [PDF, 75KB]

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2009) Adrià de Gispert, Sami Virpioja, Mikko Kurimo, & William Byrne: Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.73-76. [PDF, 72KB]

(2009) Hieu Hoang & Philipp Koehn: Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379. [PDF, 184KB]

(2009) Maria Holmqvist, Sara Stymne, Jody Foo, & Lars Ahrenberg: Improving alignment for SMT by reordering and augmenting the training corpus.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.120-124. [PDF, 102KB]

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

 (2009) Gorka Labaka: EUSMT: incorporating linguistic information to statistical machine translation for a morphologically rich language. Its use in preliminary SMT-RBMT-EBMT hybridization. Summary of Dissertation submitted to the Department of Computer Languages and Systems, University of the Basque Country, 2009; 80pp. [PDF, 1244KB]

(2009) Pierre Mahé & Nicola Cancedda: Linguistically enriched word-sequence kernels for discriminative language modeling.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.111-128.

(2009) Coşkun Mermer, Hamza Kaya, & Mehmet Uğur Doğan: The TÜBİTAK-UEKAE statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 113-117. [PDF, 457KB]; poster [PDF, 2530KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Ibrahim M.Saleh & Nizar Habash: Automatic extraction of lemma-based bilingual dictionaries for morphologically rich languages. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 622KB]

(2009) Mehrnoush Shamsfard, Soheila Kiani & Yaser Shahedi: STeP-1: standard text preparation for Persian language. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 659KB]

(2009) Wade Shen, Brian Delaney, A.Ryan Aminzadeh, Tim Anderson, & Ray Slyh: The MIT-LL/AFRL IWSLT-2009 MT system. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 71-78. [PDF, 398KB]; presentation [PDF of PPT, 1730KB]

(2009) Oscar Täckström: The impact of morphological errors in phrase-based statistical machine translation from English and German into Swedish. Proceedings of 4th Langauge and Technology Conference, November 6-8, Poznań, Poland; 5pp. [PDF, 126KB]

(2008) Karunesh Arora, Michael Paul & Eiichiro Sumita: Translation of unknown words in phrase-based statistical machine translation for languages of rich morphology. First International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU-2008), Hanoi University of Technology, Hanoi, Vietnam, May 5-7, 2008; pp. 70-75. [PDF, 187KB]

(2008) Eleftherios Avramidis & Philipp Koehn: Enriching morphologically poor languages for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 763-770. [PDF, 685KB]

(2008) Amittai Axelrod, Mei Yang, Kevin Duh, & Katrin Kirchhoff: The University of Washington machine translation system for ACL WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.123-126. [PDF, 65KB]

(2008) Rafael E. Banchs & Haizhou Li: Exploring Spanish-morphology effects on Chinese-Spanish SMT. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 49-53. [PDF, 263KB]

(2008) Nicola Cancedda: Shaping research from user requirements, and other exotic things. [Abstract of] Invited talk at: EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; p.4. [PDF, 449KB]

(2008) Bruno Cartoni: Lexical resources for automatic translation of constructed neologisms: the case study of relational adjectives. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 57KB]

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

(2008) Peter A. Chew & Ahmed Abdelali: The effects of language relatedness on multilingual information retrieval: a case study with Indo-European and Semitic languages. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.1-9. [PDF, 196KB]

(2008) Peter A.Chew, Brett W.Bader, & Ahmed Abdelali: Latent morpho-semantic analysis: multilingual information retrieval with character n-grams and mutual information. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.129-136. [PDF, 117KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Inductive detection of language features via clustering minimal pairs: towards feature-rich grammars in machine translation.  Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.78-86. [PDF, 246KB]

(2008) Béatrice Daille & Emmanuel Morin: A effective compositional model for lexical alignment. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp 95-102. [PDF, 401KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Rauf Fatullayev, Ali Abbasov, & Abulfat Fatullayev: Set of active suffix chains and its role in development of the MT system for Azerbaijani. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 20-22 October 2008; pp.363-368. [PDF, 288KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Vishal Goyal & Gurpreet Singh Lehal: Hindi morphological analyzer and generator. Proceedings of the First International Conference on Emerging Trends in Engineering and Technology, July 16-18, 2008 (IEEE Computer Society Press, 2008); pp. 1156-1159. [PDF, 213KB]

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

(2008) Maxim Khalilov, Adolfo Hernández H., Marta R. Costa-jussà, Josep M. Crego, Carlos A. Henríquez Q., Patrik Lambert, José A.R. Fonollosa, José B. Mariño, & Rafael E. Banchs: The TALP-UPC ngram-based statistical machine translation systems for ACL-WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.127-130. [PDF, 128KB]

(2008) Igor Mel’čuk & Leo Wanner: Morphological mismatches in machine translation [abstract]. Machine Translation 22 (3), September 2008; pp.101-152.

(2008) ThuyLinh Nguyen & Stephan Vogel: Context-based Arabic morphological analysis for machine translation. CoNLL: proceedings of the  Twelfth Conference on Computational Natural Language Learning, 16-17 August 2008, Manchester, UK; pp.135-142. [PDF, 123KB]

(2008) Ananthakrishnan Ramanathan, Pushpak Bhattacharyya, Jayprasad Hegde, Ritesh M.Shah, & Sasikumar M: Simple syntactic and morphological processing can help English-Hindi statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.513-520. [PDF, 368KB]

(2008) Holger Schwenk, Yannick Estève, & Sadaf Abdul Rauf: The LIUM Arabic/English statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 63-68. [PDF, 120KB]; presentation [PDF, 49KB]

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

(2008) Drahomíra (Johanka) Spoustová: Combining statistical and rule-based approaches to morphological tagging of Czech texts. Prague Bulletin of Mathematical Linguistics, no.89, June 2008; pp.23-40. [PDF, 174KB]

(2008) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Effects of morphological analysis in translation between German and English. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.135-138. [PDF, 63KB]

(2008) Kristina Toutanova, Hisami Suzuki, & Achim Ruopp: Applying morphology generation models to machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 514-522. [PDF, 586KB]

(2008) Xianchao Wu, Naoaki Okazaki, Takashi Tsunakawa, & Jun’ichi Tsujii: Improving English-to-Chinese translation for technical terms using morphological information. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.202-211 [PDF, 799KB]

(2008) Simon Zwarts & Mark Dras: Morphosyntactic target language matching in statistical machine translation. Australasian Language Technology Workshop 2008 (ALTW2008): Proceedings of the Workshop, 8-10 December 2008, CSIRO ICT Centre, Hobart; pp.169-177. [PDF, 62KB]

(2007) Ondřej Bojar: English-to-Czech factored machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 232-239 [PDF, 158KB]

(2007) Marta R.Costa-jussà & José A.R. Fonollosa: Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a statistical machine translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 171-176 [PDF, 424KB]

(2007) Christopher J. Dyer: The ‘noisier channel’: translation from morphologically complex languages. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 207-211 [PDF, 127KB]

(2007) Chris Dyer: Decoder-guided backoff: using word lattices to improve translation from morphologically complex languages. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 54pp. [PDF of PPT presentation, 1158KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2007) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Getting to know Moses: initial experiments on German-English factored translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 181-184 [PDF, 123KB]

(2007) Ian Lane, Andreas Zollmann, Thuy Linh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying Zhang, & Alex Waibel: The CMU-UKA statistical machine translation systems for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 150KB]; presentation [PDF, 393KB]

(2007) Einat Minkov, Kristina Toutanova, & Hisami Suzuki: Generating complex morphology for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 128-135 [PDF, 219KB]

(2007) Kemal Oflazer & İlknur Durgar El-Kahlout: Exploring different representational units in English-to-Turkish statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 25-32 [PDF, 447KB]

(2007) Aaron B. Phillips, Violetta Cavalli-Sforza, & Ralf D. Brown: Improving example-based machine translation through morphological generalization and adaptation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.369-375 [PDF, 115KB]

(2007) Ruhi Sarikaya & Yonggang Deng: Joint morphological-lexical language modeling for machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.145-148 [PDF, 157KB]

(2007) Nasredine Semmar & Christian Fluhr: Arabic to French sentence alignment: exploration of a cross-language information retrieval approach. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 73-80 [PDF, 156KB]

(2007) R.Mahesh K. Sinha: Using rich morphology in resolving certain Hindi-English machine translation divergence. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.429-433 [PDF, 48KB]

(2007)  S.Saleem, K.Subramanian, R.Prasad, David Stallard, Chia-Lin Kao, P.Natarajan, & R.Suleiman: Improvements in machine translation for English/Iraqi speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2445-2448; abstract [PDF, 23KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: Machine translation between Turkic languages. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 189-192 [PDF, 135KB]

(2007) Sami Virpioja, Jaako J.Väyrynen, Mathias Creutz, & Markus Sadeniemi: Morphology-aware statistical machine translation based on morphs induced in an unsupervised manner. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.491-498 [PDF, 105KB]

(2007) Yokoyama Shoichi & Kennendai Shigehiro: Error correcting system for analysis of Japanese patent sentences. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.24-27. [PDF, 73KB]

(2007) Ruiqiang Zhang & Eiichiro Sumita: Boosting statistical machine translation by lemmatization and linear interpolation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 181-184 [PDF, 91KB]

(2007) Simon Zwarts & Mark Dras: Statistical machine translation of Australian aboriginal languages: morphological analysis with languages of differing morphological richness. Australasian Language Technology Workshop 2007 (ALTW2007): Proceedings of the Workshop, 10-11 December 2007, Melbourne Zoo; pp.134-142. [PDF, 297KB]

(2006) Mohammed A. Attia: An ambiguity-controlled morphological analyzer for modern standard Arabic modelling finite state networks.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.48-67. [PDF, 290KB]

(2006) Anas El Isbihani, Shahram Khadivi, Oliver Bender, & Hermann Ney: Morpho-syntactic Arabic preprocessing for Arabic-to-English statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 15-22 [PDF, 156KB]

(2006) Jesús Giménez & Lluís Màrquez: The LDV-COMBO system for SMT.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 166-169 [PDF, 183KB]

(2006) Deepa Gupta & Marcello Federico: Exploiting word transformation in statistical machine translation from Spanish to English. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.75-80 [PDF, 140KB]

(2006) Nizar Habash & Owen Rambow: MAGEAD: a morphological analyzer and generator for the Arabic dialects. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.681-688. [PDF, 112KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2006) Youssef Kadri & Jian-Yun Nie: Effective stemming for Arabic information retrieval.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.68-74. [PDF, 210KB]

(2006) Nizar Habash & Fatiha Sadat: Arabic preprocessing schemes for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 49-52 [PDF, 72KB]

(2006) Slim Mesfar: Standard Arabic formalization and linguistic platform for its analysis. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.84-94. [PDF, ]

(2006) Thai Phuong Nguyen & Akira Shimazu: Improving phrase-based statistical machine translation with morpho-syntactic analysis and transformation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.138-147 [PDF, 282KB]

(2006) Thai Phuong Nguyen & Akira Shimazu: Improving phrase-based statistical machine translation with morphosyntactic transformation [abstract]. Machine Translation 20 (3),2006; pp.147-166.

(2006) Maja Popović, Adrià de Gispert, Deepa Gupta, Patrik Lambert, Hermann Ney, José B. Mariño, Marcello Federico, & Rafael Banchs: Morpho-syntactic information for automatic error analysis of statistical machine translation output.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 1-6 [PDF, 306KB]

(2006) Ari Rappoport & Tsahi Levent-Levi: Induction of cross-language affix and letter sequence correspondence. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 332 KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Fatiha Sadat & Nizar Habash: Combination of Arabic preprocessing schemes for statistical machine translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1-8. [PDF, 104KB]

(2006) Eric Wehrli: TwicPen: hand-held scanner and translation software for non-native readers. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Interactive Presentation Sessions, Sydney, July 2006; pp.61-64. [PDF, 229KB]

(2006) Mei Yang & Katrin Kirchhoff: Phrase-based backoff models for machine translation of highly inflected languages. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.41-48 [PDF, 209KB]

(2006) Pavel V. Zheltov: An attribute-sample database system for describing Chuvash affixes.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.109-112. [PDF, 103KB]

(2006) Andreas Zollmann, Ashish Venugopal, & Stefan Vogel: Bridging the inflection morphology gap for Arabic statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 201-204 [PDF, 147KB]

(2005) Elliott Franco Drábek & David Yarowsky: Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 49-56. [PDF, 109KB]

(2005) Emmanuel Giguet: Multi-grained alignment of parallel texts with endogenous resources. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.12-17 [PDF, 194KB]

(2005) Adrià de Gispert: Phrase linguistic classification and generalization for improving statistical machine translation. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 67-72. [PDF, 90KB]

(2005) Sharon Goldwater & David McClosky: Improving statistical MT through morphological analysis.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 676-683. [PDF, 152KB]

(2005) Gregory Grefenstette, Nasredine Semmar, & Faïza Elkateb-Gara: Modifying a natural language processing system for European languages to treat Arabic in information processing and information retrieval applications.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 31-38. [PDF,

(2005) Nizar Habash & Owen Rambow: Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 573-580. [PDF, 107KB]

(2005) Nizar Habash, Owen Rambow & George Kiraz: Morphological analysis and generation for Arabic dialects.  ACL-2005: Workshop on Computational Approaches to Semitic Languages, University of Michigan, Ann Arbor, 29 June 2005; pp. 17-24. [PDF, 121KB]

(2005) Petr Homola & Vladislav Kuboň: A machine translation system into a minority language. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; p.31-35 [PDF, 178KB]

(2005) Maja Popović, David Vilar, Hermann Ney, Slobodan Jovičić, & Zoran Šarić: Augmenting a small parallel text with morpho-syntactic language resources for Serbian-English statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 41-48. [PDF, 211KB]

(2005) R. Mahesh K. Sinha & Anil Thakur: Machine translation of bi-lingual Hindi-English (Hinglish) text. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.149-156. [PDF, 270KB]

Post-editing and post-processing [see also Crowd sourcing]

(2009) Hervé Blanchon, Christian Boitet, & Cong-Phap Huynh: A web service enabling gradable post-edition of pre-translations produced by existing translation tools: practical use to provide high-quality translation of an online encyclopedia. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 2435KB]

(2009) Jing-Shin Chang & Sheng-Sian Lin: Improving translation fluency with search-based decoding and monolingual statistical machine translation model for automatic post-editing. ROCLING 2009: Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, Taichung, Taiwan, 2009; pp.195-207. [PDF, 590KB]

(2009) Loic Dugast, Jean Senellart, & Philipp Koehn: Statistical post editing and dictionary extraction: Systran/Edinburgh submissions for ACL-WMT2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.110-114. [PDF, 245KB]

(2009) Atefeh Farzindar: Automatic translation management system for legal texts. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 417-424. [PDF, 284KB]

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

(2009) Declan Groves & Dag Schmidtke: Identification and analysis of post-editing patterns for MT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 429-436. [PDF, 304KB]

(2009) Ana Guerberof: Productivity and quality in MT post-editing. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 375KB]

(2009) Karin Harbusch & Gerard Kempen: Generating clausal coordinate ellipsis multilingually: a uniform approach based on postediting. Proceedings of the 12th European Workshop on Natural Language Generation, March 30-31, 2009, Athens, Greece; pp.138-145. [PDF, 389KB]

(2009) Philipp Koehn: A process study of computer-aided translation [abstract]. Machine Translation 23 (4), November 2009; pp.241-263.

(2009) Philipp Koehn: Human translation and machine translation – invited talk. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; 80pp. [PDF of PPT, 599KB]

(2009) Philipp Koehn & Barry Haddow: Interactive assistance to human translators using statistical machine translation methods. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 73-80. [PDF, 217KB]

(2009) Philipp Koehn: A web-based interactive computer aided translation tool. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Software Demonstrations, Suntec, Singapore, 3 August 2009; pp.17-20. [PDF, 127KB]

(2009) A.-L.Lagarda, V.Alabau, F.Casacuberta, R.Silva, & E.Díaz-de-Liaño: Statistical post-editing of a rule-based machine translation system. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.217-220. [PDF, 83KB]

(2009) Elina Lagoudaki: Translation editing environments. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 9pp. [PDF, 453KB]

(2009) Wei-Yun Ma & Kathleen McKeown: Where’s the verb? Correcting machine translation during question answering. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.333-336. [PDF, 170KB]

(2009) Sharon O’Brien, Johann Roturier, & Giselle de Almeida: Post-editing MT output: views from the researcher, trainer, publisher and practitioner. [Tutorial at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 77pp. [PDF of PPT presentation, 876KB]; handout [PDF, 23KB]

(2009) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Interactive machine translation based on partial statistical phrase-based alignments. [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp.330-336. [PDF, 275KB]

(2009) Johann Roturier: Deploying novel MT technology to raise the bar for quality: a review of key advantages and challenges. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 1-8. [PDF, 162KB]; presentation [PDF of PPT, 835KB]

(2009) Holger Schwenk, Loïc Barrault, Yannick Estève, & Patrik Lambert: LIUM’s statistical machine translation system for IWSLT 2009.  IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 65-70. [PDF, 344KB]; poster [PDF, 117KB]

(2009) Khaled Shaalan, Azza Abdel Monem & Ahmed Rafea: Syntactic generation of Arabic in interlingua-based machine translation framework. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 574KB]

(2009) Midori Tatsumi: Correlation between automatic evaluation metric scores, post-editing speed, and some other factors. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.332-339. [PDF, 515KB]

(2009) Phan Thi Thanh Thao: Grammatical and lexical errors analysis of English-Vietnamese translation texts with the Google & EVTRAN engines and post-editing tasks.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.190-197 [abstract]

(2009) Holger Schwenk, Sadaf Abdul-Rauf, Loïc Barrault, & Jean Senellart: SMT and SPE machine translation system for WMT’09.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.130-134. [PDF, 73KB]

(2009) Irina Temnikova & Constantin Orasan: Post-editing experiments with MT for a controlled language.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.249-255 [abstract]

(2008) Takeshi Abekawa & Kyo Kageura: What prompts translators to modify draft translations? An anlysis of basic modification patterns for use in the automatic notification of awkwardly translated text. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.241-248. [PDF, 393KB]

(2008) A. Diaz de Ilarraza, G. Labaka, & K. Sarasola: Statistical post-editing: a valuable method in domain adaptation of RBMT systems for less-resourced languages. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 35-40. [PDF, 364KB]

(2008) Jennifer Doyon, Christine Doran, C.Donald Means, & Domenique Parr: Automated machine translation improvement through post-editing techniques: analyst and translator experiments. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.346-353. [PDF, 712KB]

(2008) Petra Gieselmann: Architecture of the Lucy translation system. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 28 slides. [PDF, 225KB]

(2008) Meghan Lammie Glenn, Stephanie Strassel, Lauren Friedman, & Haejoong Lee: Management of large annotation projects involving multiple human judges: a case study of GALE machine translation post-editing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 144KB]

(2008) Cong-Phap Huynh, Christian Boitet, & Hervé Blanchon: SECTra_w.1: an online collaborative system for evaluating, post-editing and presenting MT translation corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 824KB]

(2008) Lene Offersgaard, Claus Povlsen, Lisbeth Almsten, & Bente Maegaard: Domain specific MT in use. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.150-159. [PDF, 554KB]

(2008) Tim Schlippe, ThuyLinh Nguyen, & Stephan Vogel: Diacritization as a machine translation problem and as a sequence labeling problem. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.270-278 [PDF, 701KB]

(2008) Jörg Schütz: Artificial cognitive MT post-editing intelligence. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.448-453. [PDF, 655KB]

(2008) Nicola Ueffing, Jens Stephan, Evgeny Matusov, Loïc Dugast, George Foster, Roland Kuhn, Jean Senellart, & Jin Yang: Tighter integration of rule-based and statistical MT in serial system combination. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.913-920. [PDF, 205KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2007) Takako Aikawa, Lee Schwartz, Ronit King, Mo Corston-Oliver, & Carmen Lozano: Impact of controlled language on translation quality and post-editing in a statistical machine translation environment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.1-7 [PDF, 314KB]

(2007) Julia Aymerich & Hermes Camelo: Automatic extraction of entries for a machine translation dictionary using bitexts. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.21-27 [PDF, 88KB]

(2007) Loïc Dugast, Jean Senellart, & Philipp Koehn: Statistical post-editing on  SYSTRAN’s rule-based translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 220-223 [PDF, 128KB]

(2007) Rafael Guzmán: Manual MT post-editing: “if it’s not broken, don’t fix it!” Translation Journal 11 (4), October 2007; 6pp. [PDF, 113KB]

(2007) Ehara Terumasa: Rule based machine translation combined with statistical post editor for Japanese to English patent translation. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.13-18. [PDF, 355KB]

(2007) Ariadna Font Llitjós, Jaime Carbonell, & Alon Lavie: Improving transfer-based MT systems with automatic refinements. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.183-190 [PDF, 339KB]

 (2007) Ariadna Font Llitjós & William A. Ridmann: The inner works of an automatic rule refiner for machine translation. METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 412KB]

 (2007) Pierre Isabelle, Cyril Goutte, & Michel Simard: Domain adaptation of MT systems through automatic post-editing.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.255-261 [PDF, 151KB]

(2007) Stephen Richardson: Microsoft machine translation: from research to real user. Invited talk at MT Summit XI, 10-14 September 2007, Copenhagen, Denmark; 35pp. [PDF of PPT presentation, 3535KB]

 (2007) Young-Ae Seo, Chang-Hyun Kim, Seong-Il Yang, & Young-gil Kim: Getting professional translation through user interaction. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.413-419 [PDF, 479KB]

 (2007) Michel Simard, Nicola Ueffing, Pierre Isabelle, & Roland Kuhn: Rule-based translation with statistical phrase-based post-editing. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 203-206 [PDF, 130KB]

 (2007) Michel Simard, Cyril Goutte, & Pierre Isabelle: Statistical phrase-based post-editing. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.508-515 [PDF, 224KB]

(2007) Vincent Vandeghinste: Removing the distinction between a translation memory, a bilingual dictionary and a parallel corpus. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 21pp. [PDF, 90KB]

 (2006) Ariadna Font-Llitjós: Can the Internet help improve machine translation?  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 219-222 [PDF, 135KB]

(2006) Matthew Snover, Bonnie Dorr, Richard Schwartz, Linnea Micciulla, & John Makhoul: A study of translation edit rate with targeted human annotation.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.223-231 [PDF, 185KB]

(2005) Christian Boitet: Gradable quality translations through mutualisation of human translation and revision, and UNL-based MT and coedition. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.395-412 [abstract, PDF, 41KB]

(2005) Sharon O’Brien: Methodologies for measuring the correlations between post-editing effort and machine translatability [abstract]. Machine Translation 19 (1), 2005; pp.37-58.

(2005) Sitthaa Phaholphinyo, Teerapong Modhiran, Nattapol Kritsuthikul, & Thepchai Supnithi: A practical of memory-based approach for improving accuracy of MT. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.41-46. [PDF, 297KB]

(2005) Ariadna Font Llitjós, Jaime G. Carbonell, & Alon Lavie: A framework for interactive and automatic refinement of transfer-based machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 87-96. [PDF, 468KB]

(2005) Michael Kliffer: An experiment in MT post-editing by a class of intermediate/advanced French majors. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 160-165. [PDF, 43KB]

Pre-editing and Pre-processing (see also Controlled languages)

(2009) Bogdan Babych, Anthony Hartley, & Serge Sharoff: Evaluation-guided pre-editing of source text: improving MT-tractability of light verb constructions. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.36-43 [PDF, 336KB]

(2009) Anabela Barreiro & Luís Miguel Cabral: ReEscreve: a translator-friendly multi-pupose paraphrasing software tool. MT Summit XII - Workshop: Beyond Translation Memories: New Tools for Translators MT, August 29, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 594KB]; presentation, 37pp. [PDF of PPT, 1344KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2007) Jakob Elming & Nizar Habash: Combination of statistical word alignments based on multiple preprocessing schemes. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.25-28 [PDF, 123KB]

(2006) Kiyotaka Uchimoto, Naoko Hayashida, Toru Ishida, & Hitoshi Isahara: Automatic detection and semi-automatic revision of non-machine-translatable parts of a sentence.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.703-708 [PDF, 330KB]

Rapid development of MT systems

(2008) Toni Badia, Maite Melero, & Oriol Valentin: Rapid deployment of a new METIS language pair: Catalan-English. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 122KB]

(2008) Christopher Dyer, Aaron Cordova, Alex Mont, & Jimmy Lin: Fast, easy, and cheap: construction of statistical machine translation models with MapReduce. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.199-207. [PDF, 228KB]

(2008) C.-L. Kao, S.Saleem, R.Prasad, F.Choi, P.Natarajan, D.Stallard, K.Krstovski, & M.Kamali: Rapid development of an English/Farsi speech-to-speech translation system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.166-173 [PDF, 143KB]; presentation [PDF, 95KB]

(2008) Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime Carbonell, Robert Frederking, Erik Peterson, & Katharina Probst: Linguistic structure and bilingual informants help induce machine translation of lesser-resourced languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2007) Tanja Schultz, Alan W.Black, Sameer Badaskar, Matthew Hornyak, & John Kominek: SPICE: web-based tools for rapid language adaptation in speech processing systems. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2125-2128; abstract [PDF, 23KB]

(2007) Sylvain Surcin, Elke Lange, & Jean Senellart: Rapid development of new language pairs at SYSTRAN. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.443-449 [PDF, 114KB]

(2007) Svetlana Sheremetyeva: On portability of resources for a quick ramp up of multilingual MT of patent claims. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.28-33. [PDF, 66KB]

(2007) Jernej Vicic: Rapid development of RBMT systems for related languages. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 12pp. [PDF, 154KB]

(2006) Ahmed Abdelali, James Cowie, Steve Helmreich, Wanying Jin, Maria Pilar Milagros, Bill Ogden, Hamid Mansouri Rad & Ron Zacharski: Guarani: a case study in resource development for quick ramp-up MT. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.1- 9 [PDF, 344KB]

(2005) Herman A. Engelbrecht & Tanja Schultz: Rapid development of an Afrikaans English speech-to-speech translator. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 171KB]

(2005) Hemali Majithia, Philip Rennart & Evelyne Tzoukermann: Rapid ramp-up for statistical machine translation: minimal training for maximal coverage . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.438-444. [PDF, 160KB]

Re-ordering (see also Generation, Syntactic transfer)

(2009) Nguyen Bach, Qin Gao, & Stephan Vogel: Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24. [PDF, 211KB]

(2009) Ibrahim Badr, Rabih Zbib, & James Glass: Syntactic phrase reordering for English-to-Arabic statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.86-93. [PDF, 115KB]

(2009) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Statistical machine translation through global lexical selection and sentence reconstruction.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.185-203.

(2009) Alexandra Birch, Phil Blunsom & Miles Osborne: A quantitative analysis of reordering phenomena. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.197-205. [PDF, 808KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) John DeNero, Adam Pauls, & Dan Klein: Asynchronous binarization for synchronous grammars. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.141-144. [PDF, 183KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Reordering on Spanish-Basque SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 207-213. [PDF, 327KB]

(2009) Jakob Elming & Nizar Habash: Syntactic reordering for English-Arabic phrase-based machine translation. Proceedings of the EACL 2009 Workhop on Computational Approaches to Semitic Languages, Athens, Greece, 31 March 2009; pp.69-77. [PDF, 155KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Kei Hashimoto, Hirofumi Yamamoto, Hideo Okuma, Eiichiro Sumita, & Keiichi Tokuda: Reordering model using syntactic information of a source tree for statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.69-77. [PDF, 383KB]

(2009) Kenneth Heafield, Greg Hanneman, & Alon Lavie: Machine translation system combination with flexible word ordering. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.56-60. [PDF, 104KB]

(2009) Hieu Hoang & Philipp Koehn: Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379. [PDF, 184KB]

(2009) Maria Holmqvist, Sara Stymne, Jody Foo, & Lars Ahrenberg: Improving alignment for SMT by reordering and augmenting the training corpus.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.120-124. [PDF, 102KB]

 (2009) Maxim Khalilov & José A.R.Fonollosa: N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.424-432. [PDF, 564KB]

(2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: A new subtree-transfer approach to syntax-based reordering for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.197-204. [PDF, 1078KB]

(2009) Philipp Koehn & Barry Haddow: Edinburgh’s submission to all tracks of the WMT2009 shared task with reordering and speed improvements to Moses.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.160-164. [PDF, 105KB]

(2009) Jin-Ji Li, Jungi Kim, Dong-Il Kim, & Jong-Hyeok Lee: Chinese syntactic reordering for adequate generation of Korean verbal phrases in Chinese-to-Korean SMT.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.190-196. [PDF, 323KB]

(2009) Hwidong Na, Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Improving fluency by reordering target constituents using MST parser in English-to-Japanese phrase-based SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.276-283. [PDF, 381KB]

(2009) Toshiaki Nakazawa & Sadao Kurohashi: Statistical phrase alignment model using dependency relation probability.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18. [PDF, 670KB]

(2009) Vinh Van Nguyen, Akira Shimazu, Minh Le Nguyen & Thai Phuong Nguyen: Improving a lexicalized hierarchical reordering model using maximum entropy. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 89-96. [PDF, 172KB]

(2009) Yizhao Ni, Craig J.Saunders, Sandor Szedmak, & Mahesan Niranjan: Handling phrase reorderings for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 4 August 2009; pp.241-244. [PDF, 465KB]

(2009) Jan Niehues, Teresa Herrmann, Muntsin Kolss & Alex Waibel: The Universität Karlsruhe translation system for the EACL-WMT 2009. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.80-84. [PDF, 92KB]

(2009) Jan Niehues & Muntsin Kolss: A POS-based model for long-range reorderings in SMT. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.206-214. [PDF, 200KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Hendra Setiawan, Min-Yen Kan, Haizhou Li, & Philip Resnik: Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332. [PDF, 504KB]

(2009) Roy Tromble & Jason Eisner: Learning linear ordering problems for better translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1007-1016. [PDF, 178KB]

(2009) Peng Xu, Jaeho Kang, Michael Ringgard, & Franz Och: Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253. [PDF, 273KB]

(2009) Sirvan Yahyaei & Christof Monz: Decoding by dynamic chunking for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.160-167. [PDF, 127KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, Chew Lim Tan: Forest-based tree sequence to string  translation model. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.172-180. [PDF, 254KB]

(2009) Min Zhang & Haizhou Li: Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.698-707. [PDF, 294KB]

(2008) Marta R. Costa-jussà & José A.R.Fonollosa: Computing multiple weighted reordering hypotheses for a statistical machine translation phrase-based system. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.82-88. [PDF, 623KB]

(2008) Marta R.Costa-jussà, José A.R.Fonollosa, & Enric Monte: Using reordering in statistical machine translation based on recursive alignment block classification.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 96KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Jakob Elming: Syntactic reordering integrated with phrase-based SMT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.46-54. [PDF, 141KB]

(2008) Jakob Elming: Syntactic reordering integrated with phrase-based SMT. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.209-216. [PDF, 140KB]

(2008) Michel Galley & Christopher D.Manning: A simple and effective hierarchical phrase reordering model. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.848-856. [PDF, 230KB]

(2008) Niyu Ge, Abe Ittycheriah, & Kishinore Papineni: Multiple reorderings in phrase-based machine translation. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.61-68. [PDF, 1-6KB]

(2008) Yanqing He & Chengqing Zong: A generalized reordering model for phrase-based statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.117-124. [PDF, 661KB]

(2008) Hieu Hoang: Short range re-ordering using POS tags. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 24 slides. [PDF, 66KB]

(2008) Jason Katz-Brown & Michael Collins: Syntactic reordering in preprocessing for Japanese®English translation: MIT system description for NTCIR-7 patent translation task. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 409-414. [PDF, 424KB]

(2008) Maxim Khalilov, Adolfo Hernández H., Marta R. Costa-jussà, Josep M. Crego, Carlos A. Henríquez Q., Patrik Lambert, José A.R. Fonollosa, José B. Mariño, & Rafael E. Banchs: The TALP-UPC ngram-based statistical machine translation systems for ACL-WMT 2008. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.127-130. [PDF, 128KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60. [PDF, 98KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Using syntactic coupling features for discriminating phrase-based translations (WMT-08 shared translation task). ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.159-162. [PDF, 68KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: A linguistically annotated reordering model for BTG-based statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 149-152. [PDF, 319KB]

(2008) Vineet Yadav, Sriram Venkatapathy, & Karthik Gali: Comparative analysis of phrasal cohesion of English-Hindi with English-French. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 76KB]

(2008) Hirofumi Yamamoto, Hideo Okuma, & Eiichiro Sumita: Imposing constraints from the source tree on ITG constraints for SMT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.1-9. [PDF, 171KB]

(2008) Mikio Yamamoto, Jyunya Norimatsu, Mitsuru Koshikawa, Takahiro Fukutomi, Taku Nishio, Kugatsu Sadamitsu, & Takehito Utsuro: Diversion of hierarchical phrases as reordering templates. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 466-470. [PDF, 917KB]

(2008) Dakun Zhang, Le Sun, & Wenbo Li: A structured prediction approach for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.649-654. [PDF, 521KB]

(2008) Jiajun Zhang, Chengqing Zong, & Shoushan Li: Sentence type based reordering model for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1089-1096. [PDF, 151KB]

(2008) Min Zhang, Hongfei Jiang, Aiti Aw, Haizhou Li, Chew Lim Tan, & Sheng Li: A tree sequence alignment-based tree-to-tree translation model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 559-567. [PDF, 317KB]

(2008) Bing Zhao & Yaser Al-Onaizan: Generalizing local and non-local word-reordering patterns for syntax-based machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581. [PDF, 538KB]

(2007) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Statistical machine translation through global lexical selection and sentence reconstruction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 152-159 [PDF, 458KB]

(2007) Pi-Chuan Chang & Kristina Toutanova: A discriminative syntactic word order model for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 9-16 [PDF, 235KB]

(2007) Marta R. Costa-jussà, Josep M.Crego, Patrik Lambert, Maxim Khalilov, José A.R.Fonollosa, José B.Mariño, & Rafael E.Banchs: Ngram-based statistical machine translation enhanced with multiple weighted reordering hypotheses.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 167-170 [PDF, 208KB]

(2007) Marta R.Costa-jussà & José A.R. Fonollosa: Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a statistical machine translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 171-176 [PDF, 424KB]

(2007) John DeNero & Dan Klein: Tailoring word alignments to syntactic machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 17-24 [PDF, 415KB]

(2007) Markus Dreyer, Keith Hall, & Sanjeev Khudanpur: Comparing reordering constraints for SMT using efficient BLEU oracle computation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.103-110 [PDF, 206KB]

(2007) Chi-Ho Li, Dongdong Zhang, Mu Li, Ming Zhou, Minghui Li, & Yi Guan: A probabilistic approach to syntax-based reordering for statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 720-727 [PDF, 7105KB]

(2007) Kay Rottmann & Stephan Vogel: Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180 [PDF, 3725KB]; presentation [PDF, 1129KB]

(2007) Germán Sanchis & Francisco Casacuberta: Reordering via n-best lists for Spanish-Basque translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.191-198 [PDF, 309KB]; poster [PDF, 115KB]

(2007) Hendra Setiawan, Min-Yen Kan, & Haizhou Li: Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719 [PDF, 213KB]

(2007) Chao Wang, Michael Collins, & Philipp Koehn: Chinese syntactic reordering for statistical machine translation.  EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 737-745. [PDF, 161KB]

(2007) Dongdong Zhang, Mu Li, Chi-Ho Li, & Ming Zhou: Phrase reordering model integrating syntactic knowledge for SMT. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 533-540. [PDF, 325KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Improved chunk-level reordering for statistical machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 255KB]; presentation [PDF, 1644KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2007) Simon Zwarts & Mark Dras: Syntax-based word reordering in phrase-based statistical machine translation: why does it work? MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.559-566 [PDF, 98KB]

(2006) Josep Maria Crego & José B.Mariño: Improving statistical MT by coupling reordering and decoding [abstract]. Machine Translation 20 (3),2006; pp.199-215.

(2006) Josep M. Crego, Adrià de Gispert, Patrik Lambert, Marta R. Costa-jussà, Maxim Khalilov, Rafael E.Banchs, José B.Mariño, & José A.R.Fonollosa: N-gram-based SMT system enhanced with reordering patterns.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 162-165 [PDF, 209KB]

(2006) Masaaki Nagata, Kuniko Saito, Kazuhide Yamamoto, & Kazuteru Ohashi: A clustered global phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.713-720. [PDF, 182KB]

(2006) Deyi Xiong, Qun Liu, & Shouxun Lin: Maximum entropy based phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.521-528. [PDF, 420KB]

(2006) Richard Zens & Hermann Ney: Discriminative reordering models for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 55-63 [PDF, 153KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

(2006) Simon Zwarts & Mark Dras: This phrase-based SMT system is out of order: generalized word reordering in machine translation. Australasian Language Technology Workshop 2006 (ALTW2006): Proceedings of the Workshop, 30 November - 1 December 2006, University of Sydney; pp.149-156. [PDF, 86KB]

(2005) Stephan Kanthak, David Vilar, Evgeny Matusov, Richard Zens, & Hermann Ney: Novel reordering approaches in phrase-based statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 167-174. [PDF, 172KB]

Reusability

(2009) Kepa Sarasola: Matxin: developing sustainable machine translation for a less-resourced language [abstract]. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 1.

(2009) Kevin Unhammer & Trond Trosterud: Reuse of free resources in machine translation between Nynorsk and Bokmål. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 35-42. [PDF, 219KB]

(2008) I.Alegria, X.Arregi, A.Diaz de Ilarraza, G.Labaka, M.Lersundi, A.Mayor, & K.Sarasola: Strategies for sustainable MT for Basque: incremental design, reusability, standardization and open source. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.59-64. [PDF, 74KB]

(2008) Stephan Oepen: Grammar-based processing and re-usable hybrid MT – an experience report in the wilderness. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 32 slides. [PDF, 254KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2007) Svetlana Sheremetyeva: On portability of resources for a quick ramp up of multilingual MT of patent claims. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.28-33. [PDF, 66KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2006) G. Craig Murray, Bonnie J. Dorr, Jimmy Lin, Jan Hajič, & Pavel Pecina: Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.945-952. [PDF, 159KB]

(2006) Bogdan Sacaleanu & Günter Neumann: Cross-cutting aspects of cross-language question answering systems. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multilingual Question Answering (MLQA06), Trento, Italy, April 4, 2006; pp.15-22 [PDF, 652KB]

(2005) Ondřej Bojar, Petr Homola, & Vladislav Kuboň: Problems of reusing an existing MT system. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.179-184. [PDF, 73KB]

(2005) Ondřej Bojar, Petr Homola & Vladislav Kuboň: An MT system recycled. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.380-387. [PDF, 242KB]

Reversibility

(2009) Andrew Finch & Eiichiro Sumita: Bidirectional phrase-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1124-1132. [PDF, 164KB]

(2007) Pierrette Bouillon, Glenn Flores, Marianne Starlander, Nikos Chatzichrisafis, Marianne Santaholma, Nikos Tsourakis, Manny Rayner, & Beth Ann Hockey: A bidirectional grammar-based medical speech translator. ACL 2007: Proceedings of the Workshop on Grammar-based approaches to spoken language processing (SPEECHGRAM 2007), June 29, 2007, Prague, Czech Republic; pp.41-48. [PDF, 216KB]

(2005) Michel Carl, Paul Schmidt, & Jörg Schütz: Reversible template-based Shake & Bake generation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.17-25. [PDF, 367KB]

Segmentation (see also Morphological analysis, Tagging)

(2009) Timothy Baldwin: The hare and the tortoise: speed and accuracy in translation retrieval [abstract]. Machine Translation 23 (4), November 2009; pp.195-240.

(2009) Arianna Bisazza & Marcello Federico: Morphological pre-processing for Turkish to English statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 129-135. [PDF, 373KB]; presentation [PDF of PPT, 302KB]

(2009) Fethi Bougares, Laurent Besacier, & Hervé Blanchon: LIG approach for IWSLT09: using multiple morphological segmenters for spoken language translation of Arabic. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 60-64. [PDF, 377KB]; presentation [PDF of PPT, 668KB]

(2009) Lars Bungum & Stephan Oepen: Automatic translation of Norwegian noun compounds. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.136-143. [PDF, 465KB]

(2009) Tagyoung Chung & Daniel Gildea: Unsupervised tokenization for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.718-726. [PDF, 197KB]

(2009) Arantza Díaz de Ilarraza, Gorka Labaka, & Kepa Sarasola: Relevance of different segmentation options on Spanish-Basque SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.74-80. [PDF, 390KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Chris Dyer, Hendra Setiawan, Yuval Marton, & Philip Resnik: The University of Maryland statistical machine translation system for the Fourth Workshop on Machine Translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.145-149. [PDF, 120KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Alexander Fraser: Experiments in morphosyntactic processing for translating to and from German.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.115-119. [PDF, 75KB]

(2009) Satoshi Kamatani, Tetsuro Chino & Kazuo Sumita: Hybrid spoken language translation using sentence splitting based on syntax structure. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 57-64. [PDF, 167KB]

(2009) Maoxi Li, Jiajun Zhang, Yu Zhou, & Chengqing Zong: The CASIA statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 83-90. [PDF, 317KB]; presentation [PDF of PPT, 463KB]

(2009) Yanjun Ma & Andy Way: Bilingually motivated domain-adapted word segmentation for statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.549-557. [PDF, 325KB]

(2009) Yanjun Ma, Tsuyoshi Okita, Özlem Çetinoğlu, Jinhua Du, & Andy Way: Low-resource machine translation using MaTrEx: the DCU machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 29-36. [PDF, 305KB]; presentation [PDF of PPT, 466KB]

(2009) Preslav Nakov, Chang Liu, Wei Lu, & Hwee Tou Ng: The NUS statistical machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 91-98. [PDF, 338KB]; presentation [PDF of PPT, 787KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Mehrnoush Shamsfard, Soheila Kiani & Yaser Shahedi: STeP-1: standard text preparation for Persian language. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 659KB]

(2009) Massoud Sharifi-Atashgah & Mahmood Bijankhan: Corpus-based analysis for multi-token units in Persian. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 817KB]

(2009) Ankit Srivastava, Sergio Penkale, Declan Groves, & John Tinsley: Evaluating syntax-driven approaches to phrase extraction for MT.  Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.19-28. [PDF, 250KB]; presentation [PDF of PPT, 329KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2009) Maarten van Gompel, Antal van den Bosch, & Peter Berck: Extending memory-based machine translation to phrases. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.61-68. [PDF, 207KB]

(2009) Dong Yang, Paul Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu Nakamura & Sadaoki Furui: Combining a two-step conditional random field and a joint source channel model for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.72-75. [PDF, 110KB]

(2009) Fan Yang, Jun Zhao, & Kang Liu: A Chinese-English organization name translation system using heuristic web mining and asymmetric alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.387-395. [PDF, 178KB]

(2008) Ibrahim Badr, Rabih Zbib, & James Glass: Segmentation for English-to-Arabic statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers., June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 153-156. [PDF, 89KB]

(2008) Ming-Hong Bai, Keh-Jiann Chen, & Jason S.Chang: Improving word alignment by adjusting Chinese word segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.249-256. [PDF, 560KB]

(2008) Graeme Blackwood, Adrià de Gispert, & William Byrne: Phrasal segmentation models for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.19-22. [PDF, 112KB]

(2008) Hailong Cao, Masao Utiyama & Eiichiro Sumita: Chinese word segmentation: adaptation for statistical machine translation. CWMT’2008: the 4th China Workshop on Machine Translation, November 27-28, 2008, Beijing; 15 slides [PDF of PPT, 128KB]

(2008) Pi-Chuan Chang, Michel Galley, & Christopher D. Manning: Optimizing Chinese word segmentation for machine translation performance. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.224-232. [PDF, 165KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Nikesh Garera & David Yarowsky: Translating compounds by learning component gloss translation models via multiple languages. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.403-410. [PDF, 497KB]

(2008) Jonghoon Lee, Donghyeon Lee & Gary Geunbae Lee: A transformation-based sentence-splitting method for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; 6pp. [PDF, 195KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Stefan Riezler, Yi Liu, & Alexander Vasserman: Translating queries into snippets for improved query expansion.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.737-744. [PDF, 137KB]

(2008) Benjamin Snyder & Regina Barzilay: Unsupervised multilingual learning for morphological segmentation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 737-745. [PDF, 254KB]

(2008) Sara Stymne & Maria Holmqvist: Processing of Swedish compounds for phrase-based statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.182-191. [PDF, 532KB]

(2008) Jia Xu, Jianfeng Gao, Kristina Toutanova, & Hermann Ney: Bayesian semi-supervised Chinese word segmentation for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1017-1024. [PDF, 137KB

(2008) Ruiqiang Zhang, Keiji Yasuda, & Eiichiro Sumita: Chinese word segmentation and statistical machine translation. ACM Transactions on Speech and Language Processing 5 (2), art.4. [abstract, PDF, 13KB]

(2008) Ruiqiang Zhang & Eiichiro Sumita: Chinese unknown word translation by subword re-segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.225-232. [PDF, 360KB]

(2008) Ruiqiang Zhang, Keiji Yasuda, & Eiichiro Sumita: Improved statistical machine translation by multiple Chinese word segmentation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.216-223. [PDF, 250KB]

(2007) Yi Chang, Ying Zhang, Stephan Vogel, & Jie Yang: Enhancing image-based Arabic document translation using noisy channel correction model. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.89-95 [PDF, 157KB]

(2007) Christian Fügen & Muntsin Kolss: The influence of utterance chunking on machine translation performance. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2837-2840; abstract [PDF, 61KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Alignment-guided chunking. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.114-121 [PDF, 357KB]; presentation [PDF, 377KB]

(2007) Alicia Pérez, Víctor Guijarrubia, Raquel Justo, M. Inés Torres, & Francisco Casacuberta: A comparison of linguistically and statistically enhanced models for speech-to-speech machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 1234KB]; presentation [PDF, 2546KB]

(2007) Sharath Rao, Ian Lane, & Tanja Schultz: Optimizing sentence segmentation for spoken language translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2845-2848; abstract [PDF, 23KB]

(2007) Antal van den Bosch, Nicolas Stroppa, & Andy Way: A memory-based classification approach to marker-based EBMT.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 10pp. [PDF, 251KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2006) Stephen Armstrong, Andy Way, Colm Caffrey, Marian Flanagan, Dorothy Kenny, & Minako O’Hagan: Improving the quality of automated DVD substitles via example-based machine translation. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 13pp. [PDF, 159KB]

(2006) Jianfeng Gao, Mu Li, Andi Wu, & Chang-Ning Huang: Chinese word segmentation and named entity recognition: a pragmatic approach. Computational Linguistics 31 (4), pp. 531-574. [PDF, 1638KB]

(2006) Adria de Gispert & José B.Mariño: Linguistic tuple segmentation in n-gram-based statistical machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1049; abstract [PDF, 90KB]

(2006) Howard Johnson, Fatiha Sadat, George Foster, Roland Kuhn, Michel Simard, Eric Joanis, & Samuel Larkin: PORTAGE, with smoothed phrase tables and segment choice models.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 134-137 [PDF, 118KB]

(2006) Roland Kuhn, Denis Yuen, Michel Simard, Patrick Paul, George Foster, Eric Joanis, & Howard Johnson: Segment choice models: feature-rich models for global distortion in statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 25-32 [PDF, 95KB]

(2006) Evgeny Matusov, Arne Mauser, & Hermann Ney: Automatic sentence segmentation and punctuation prediction for spoken language translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 158-165 [PDF, 117KB]

(2006) Bart Mellebeek, Karolina Owczarzak, Josef Van Genabith & Andy Way: Multi-engine machine translation by recursive sentence decomposition. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.110-118 [PDF, 145KB]

(2006) Jason Riesa & David Yarowsky: Minimally supervised morphological segmentation with applications to machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.185-192 [PDF, 613KB]

(2006) Jia Xu, Richard Zens, & Hermann Ney: Partitioning parallel documents using binary segmentation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 78-85 [PDF, 433KB]

(2005) Etienne Denoual & Yves Lepage: BLEU in characters: towards automatic MT evaluation in languages without word delimiters. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.79-84. [PDF, 495KB]

(2005) Mickel Grönroos & Ari Becks: Bringing intelligence to translation memory technology. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 11pp. [PDF, 80KB]

(2005) John Henderson & William Morgan: Gaming fluency: evaluating the bounds and expectations of segment-based translation memory. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 175-182. [PDF, 194KB]

(2005) Krisda Khankasikam & Nuttanart Muansuwan: Thai word segmentation a lexical semantic approach . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.331-338. [PDF, 155KB]

(2005) Evgeny Matusov, Gregor Leusch, Oliver Bender, & Hermann Ney: Evaluating machine translation output with automatic sentence segmentation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 113KB]

(2005) Emmanuel Planas: SIMILIS: second-generation translation memory software. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 7pp. [PDF, 121KB]

(2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

(2005) Jia Xu, Evgeny Matusov, Richard Zens, & Hermann Ney: Integrated Chinese word segmentation in statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 105KB]

(2005) Jia Xu, Richard Zens, & Hermann Ney: Sentence segmentation using IBM word alignment model 1. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 280-287. [PDF, 145KB]

(2005) Andy Way & Nano Gough: Controlled translation in an example-based environment: what do automatic evaluation metrics tell us? [abstract]. Machine Translation 19 (1), 2005; pp.1-36.

(2005) Ying Zhang & Stephan Vogel: Competitive grouping in integrated phrase segmentation and alignment model.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 159-162. [PDF, 437KB]

Semantic analysis and representation (see also Disambiguation, Latent semantic analysis, Treebanks)

(2009) Marianna Apidianaki: Data-driven semantic analysis for multilingual WSD and lexical selection in translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.77-85. [PDF, 364KB]

(2009) Kathy Baker, Steven Bethard, Michael Bloodgood, Ralf Brown, Chris Callison-Burch, Glen Coppersmith, Bonnie Dorr, Wes Filardo, Kendall Giles, Anni Irvine, Mike Kayser, Lori Levin, Justin Martineau, Jim Mayfield, Scott Miller, Aaron Phillips, Andrew Philpot, Christine Piatko, Lane Schwartz, & David Zajic: Semantically informed machine translation (SIMT). Final report of the 2009 Summer Camp for Applied Language Exploration [John Hopkins University], November 2009; 152pp. [PDF, 5705KB]

(2009) R.Basili, D.De Cao, D.Croce, B.Coppola, & A.Moschitti: Cross-language frame semantic transfer in bilingual corpora [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 76KB]

(2009) Anders Björkelund, Love Hafdell, & Pierre Nugues: Multilingual semantic role labeling.  CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.43-48. [PDF, 313KB]

(2009) Xavier Blanco: Remarks about linguistic analysis, normalization and translation of Spanish “What to do in case of fire” texts. ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.43-48 [abstract]

(2009) Julian Brooke, Milan Tofiloski, & Maite Taboada: Cross-linguistic sentiment analysis: from English to Spanish. [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp.50-57. [PDF, 65KB]

(2009) Marine Carpuat: One translation per discourse. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.19-27. [PDF, 112KB]

(2009) Pi-Chuan Chang, Huihsin Tseng, Dan Jurafsky, & Christopher D.Manning: Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59. [PDF, 558KB]

(2009) Wanxiang Che, Zhenghua Li, Yongqiang Li, Yuhang Guo, Bing Qin, & Ting Liu: Multingual dependency-based syntactic and semantic parsing. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.49-54. [PDF, 633KB]

(2009) Dmitry Davidov & Ari Rappoport: Enhancements of lexical concepts using cross-lingual web mining. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.852-861. [PDF, 162KB]

(2009) Dmitry Davidov & Ari Rappoport: Translation and extension of concepts across languages. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.175-183. [PDF, 141KB]

(2009) Beate Dorow, Florian Laws, Lukas Michelbacher, Christian Scheible, & Jason Utt: A graph-theoretic algorithm for automatic extension of translation lexicons. Proceedings of the EACL 2009 Workshop on GEMS: Geometrical Models of Natural Language Semantics, Athens, Greece, 31 March 2009; pp.91-95. [PDF, 81KB]

(2009) Andrea Gesmundo, James Henderson, Paola Merlo, & Ivan Titov: A latent variable model of synchronous syntactic-semantic parsing for multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.37-42. [PDF, 120KB]

(2009) Jesús Giménez & Lluís Màrquez: On the robustness of syntactic and semantic features for automatic MT evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.250-258. [PDF, 171KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

 (2009) Alon Halevy, Peter Norvig, & Fernando Pereira: The unreasonable effectiveness of data. IEEE Intelligent Systems, March-April 2009; pp.8-12. [PDF, 376KB]

(2009) Beth Ann Hockey & Manny Rayner: Using paraphrases of deep semantic representations to support regression testing in spoken dialogue systems. NAACL-HLT-2009: Software Engineering, Testing and Quality Assurance for Natural Language Processing (SETQA-NLP 2009), Proceedings of the Workshop, June 5, 2009, Boulder, Colorado; pp.14-21. [PDF, 139KB]

(2009) Baoli Li, Martin Emms, Saturnino Luz, & Carl Vogel: Exploring multilingual semantic role labeling. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.73-78. [PDF, 140KB]

(2009) Pierre Mahé & Nicola Cancedda: Linguistically enriched word-sequence kernels for discriminative language modeling.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.111-128.

(2009) Ivan Meza-Ruiz & Sebastian Riedel: Multilingual semantic role labelling with Markov logic. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.85-90. [PDF, 150KB]

(2009) Sebastian Padó, Daniel Cer, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Measuring machine translation quality as semantic equivalence: a metric based on entailment features [abstract]. Machine Translation 23 (2/3), 2009; pp.181-193.

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky & Chris Manning: Robust machine translation evaluation with entailment features. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.297-305. [PDF, 437KB]

(2009) Sebastian Padó, Michel Galley, Dan Jurafsky, & Christopher D.Manning: Textual entailment features for machine translation evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.37-41. [PDF, 378KB]

(2009) Davide Picca, Alfio Massimiliano Gliozzo, & Simone Campora: Bridging languages by supersense entity tagging. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.136-142. [PDF, 159KB]

(2009) Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, & Pushpak Bhattacharyya: Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.800-808. [PDF, 221KB]

(2009) Han Ren, Donghong Ji, Jing Wan, & Mingyao Zhang: Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.97-102. [PDF, 228KB]

(2009) Bernard Scott & Anabela Barreiro: OpenLogos MT and the SAL representation language. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 19-26. [PDF, 253KB]

(2009) Oscar Täckström: Multilingual semantic parsing with a pipeline of linear classifiers. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 103KB]

(2009) Yotaro Watanabe, Masayuki Asahara, & Yuji Matsumoto: Multilingual syntactic-semantic dependency parsing with three-stage approximate max-margin linear models. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 727KB]

(2009) Dekai Wu & Pascale Fung: Can semantic role labeling improve SMT? EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.218-225. [PDF, 1141KB]

(2009) Dekai Wu & Pascale Fung: Semantic roles for SMT: a hybrid two-pass model.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.13-16. [PDF, 286KB]

(2008) David Ellis, Mathias Creutz, Timo Honkela, & Mikko Kurimo: Speech to speech machine translation: Biblical chatter from Finnish to English.  IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp. 123-130. [PDF, 104KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008) Nikesh Garera & David Yarowsky: Minimally supervised multilingual taxonomy and translation lexicon induction. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.465-472. [PDF, 356KB]

(2008) Gerhard Kremer, Andrea Abel, & Marco Baroni: Cognitively salient relations for multilingual lexicography. Coling 2008: Proceedings of the Workshop on Cognitive Aspects of the Lexicon, 24 August 2008, Manchester, UK; pp.94-101. [PDF, 84KB]

(2008) Ding Liu & Daniel Gildea: Improved tree-to-string transducer for machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.62-69. [PDF, 208KB]

(2008) Igor Mel’čuk & Leo Wanner: Morphological mismatches in machine translation [abstract]. Machine Translation 22 (3), September 2008; pp.101-152.

(2008) Rajat Kumar Mohanty & Pushpak Bhattacharyya: Lexical resources for semantic extraction.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 148KB]

(2008) Rajat Mohanty, Sandeep Limaye, M.Krishna, & Pushpak Bhattacharyya: Semantic graph from English sentences. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 173KB]

(2008) Petya Osenova, Kiril Simov, & Eelco Mossel: Language resources for semantic document annotation and crosslingual retrieval.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 465KB]

(2008) Manny Rayner, Pierrette Bouillon, Beth Ann Hockey, & Yukie Nakao: Almost flat functional semantics for speech translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.713-720. [PDF, 96KB]

(2008) Yorick Wilks: On whose shoulders? Computational Linguistics 34(4), pp. 471-486. [PDF, 113KB]

(2007) Anja Belz & Sebastian Varges (eds.): MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark. Proceedings; prelims and contents  [PDF, 655KB]

(2007) Pascale Fung, Zhaojun Wu, Yongsheng Yang, & Dekai Wu: Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.75-84 [PDF, 477KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, & Minghui Dong: Semantic transliteration of personal names. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 120-127 [PDF, 152KB]

(2007) Lluís Màrquez, Luis Villarejo, M.A.Martí, & Mariona Taulé: SemEval-2007 task 09: multilevel semantic annotation of Catalan and Spanish. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.42-47 [PDF, 89KB]

(2007) Roser Morante & Bertjan Busser: ILK2: semantic role labelling for Catalan and Spanish using TiMBL. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.183-186 [PDF, 120KB]

(2007) Marcus Sammer & Stephen Soderland: Building a sense-distinguished multilingual lexicon from monolingual corpora and bilingual lexicons. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.399-406 [PDF, 263KB]

(2007) Yuk Wah Wong & Raymond J.Mooney: Generation by inverting a semantic parser that uses statistical machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.172-179 [PDF, 192KB]

(2006) Thierry Declerck, Asunción Gómez Pérez, Ovidiu Vela, Zeno Gantner, & David Manzano-Macho: Multilingual lexical semantic resources for ontology translation. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1492-1495 [PDF, 248KB]

(2006) Gaolin Fang, Hao Yu, & Fumihito Nishino: Chinese-English term translation mining based on semantic prediction.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.199-206. [PDF, 240KB]

(2006) Eduard Hovy: A gentle introduction to ontologies.  Tutorial at AMTA 2006 conference, August 8, 2006, Cambridge, Massachusetts, USA; 60pp. [PDF of PPT presentation, 2050KB]

(2006) Eduard Hovy, Mitchell Marcus, Martha Palmer, Lance Ramshaw & Ralph Weischedel: OntoNotes: the 90% solution.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 57-60 [PDF, 44KB]

(2006) Kentaro Inui, Toru Hirano, Ryu Iida, Atsushi Fujita, & Yuji Matsumoto: Augmenting a semantic verb lexicon with large scale collection of example sentences.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.365-368 [PDF, 283KB]

(2006) Svetla Koeva, Svetlozara Lesseva, & Maria Todorova: Bulgarian sense tagged corpus.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. 5th SALTMIL Workshop on Minority Languages: “Strategies for developing machine translation for minority languages”, Genoa, Italy, 23 May 2006; pp.79-86. [PDF, 494KB]

(2006) Qing Li, Sung-Hyon Myaeng, Yun Jin, & Be-yeong Kang: Concept unification of terms in different languages for IR. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.641-648. [PDF, 234KB]

(2006) Jan Tore Lønning & Stephan Oepen: Re-usable tools for precision machine translation.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.53-56. [PDF, 213KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Stephan Oepen & Jan Tore Lønning: Discriminant-based MRS banking.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1250-1255 [PDF, 430KB]

(2006) Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, & Scott Piao: ASSIST: automated semantic assistance for translators. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.139-142 [PDF, 69KB]

(2006) Sara Stymne & Lars Ahrenberg: A bilingual grammar for translation of English-Swedish verb frame divergences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.9-17 [PDF, 139KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2006) Eiko Yamamoto, Kyoko Kanzaki, & Hitoshi Isahara: Detection of inconsistencies in concept classifications in a large dictionary: toward an improvement of the EDR electronic dictionary.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2325-2330 [PDF, 569KB]

(2005) Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Research on Computing Science, 12). Mexico City: National Polytechnic Institute; 443pp. [full set of abstracts]

(2005) Youjin Chung & Jong-Hyeok Lee: Practical word-sense disambiguation using co-occurring concept codes [abstract]. Machine Translation 19 (1),2005; pp.59-82.

(2005) Mike Dillinger: Interlinguas and semantic roles. MT Summit X, Phuket, Thailand, September 12, 2005, Tutorial Note. [PDF of PPT presentation, 1945KB]

(2005) Andrew Finch, Young-Sook Hwang, & Eiichiro Sumita: Using machine translation evaluation techniques to determine sentence-level semantic equivalence. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 17-24. [PDF, 111KB]

(2005) Dan Flickinger, Jan Tore Lønning, Helge Dyvik, Stephan Oepen, & Francis Bond: SEM-I rational MT: enriching deep grammars with a semantic interface for scalable machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.165-172. [PDF, 249KB]

(2005) Alfio Gliozzo & Carlo Strapparava: Cross language text categorization by acquiring mulitingual domain models from comparable corpora. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 9-16. [PDF, 259KB]

(2005) Toshiyuki Kanamaru, Masaki Murata, Kow Kuroda, & Hitoshi Isahara: Obtaining Japanese lexical units for semantic frames from Berkeley FrameNet using a bilingual corpus. IJCNLP-05: Sixth International Workshop on Linguistically Interpreted Corpora (LINC 2005). Proceedings of the workshop, 15 October 2005, Jeju Island, Korea; pp. 11-20. [PDF, 106KB]

(2005) Marjorie McShane, Sergei Nirenburg, & Stephen Beale: An NLP lexicon as a largely language-independent resource [abstract].  Machine Translation 19 (2), 2005; pp.139-173.

(2005) Rajat Kumar Mohanty, Anupama Dutta, & Pushpak Bhattacharyya: Semantically relatable sets: building blocks for representing semantics. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.101-108. [PDF, 157KB]

(2005) Shigeko Nariyama, Eric Nichols, Francis Bond, Takaaki Tanaka, & Hiromi Nakaiwa: Extracting representative arguments from dictionaries for resolving zero pronouns. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.3-10. [PDF, 665KB]

(2005) Einat H. Nir & Geoffrey L. Melnick: Constant-sense connection paths MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.21-28. [PDF, 147KB]

(2005) Diganta Saha & Sivaji Bandyopadhyay: A semantics-based English-Bengali EBMT system for translating news headlines MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.125-133. [PDF, 429KB]

(2005) R. M. K. Sinha, & Anil Thakur: Dealing with replicative words in Hindi for machine translation to English. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.157-164. [PDF, 298KB]

(2005) Karen Spärck Jones: Some points in a time. Computational Linguistics 31 (1), pp. 1-14. [PDF, 1825KB]

(2005) Bing Zhao, Eric P.Xing, & Alex Waibel: Bilingual word spectral clustering for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 25-32. [PDF, 467KB]

Speech recognition

(2009) Nguyen Bach, Roger Hsiao, Matthias Eck, Paisarn Charoenpornsawat, Stephan Vogel, Tanja Schultz, Ian Lane, Alex Waibel, & Alan W.Black: Incremental adaptation of speech-to-speech translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009;  pp.149-152. [PDF, 111KB]

(2009) L.Baghai-Ravary: Moderating strong accents. Translating and the Computer 31, 19-20 November 2009, London; 13pp. [PDF, 186KB]

(2009) Younès Bahou, Lamia Hadrich Belguith, & Abdelmajid Ben Hamadou: Disfluency and out-of-vocabulary word processing in Arabic speech understanding. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 825KB]

(2009) Llio Humphreys: Automated English subtitling of Welsh TV programs. [SEPLN 2009] SALTMIL 2009, Donostia-San Sebastián, Spain. “Information retrieval and information extraction for less resourced languages”, Donostia-San Sebastián, September 7; pp.63-72. [PDF, 322KB]

(2009) Frederick Jelinek: The dawn of statistical ASR and MT. [Paper given on receipt of the ACL’s Life-time Achievement Award in 2009] Computational Linguistics 35 (4), pp. 483-494 [PDF, 952KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) Aarthi Reddy, Richard Rose, Hani Safadi, Samuel Larkin & Gilles Boulianne: Incorporating knowledge of source language text in a system for dictation of document translations. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 112-119. [PDF, 167KB]

(2009) Sebastian Stüker, Laurent Besacier, & Alex Waibel: Human translations guided language discovery for ASR systems. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) David Suendermann, Jackson Liscombe, Krishna Dayanidhi, & Roberto Pieraccini: Localization of speech recognition in spoken dialog systems: how machine translation can make our lives easier. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) Alain Désilets, Marta Stojanovic, Jean-François Lapointe, Rick Rose, & Aarthi Reddy: Evaluating productivity gains of hybrid ASR-MT systems for translation dictation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.158-165 [PDF, 421KB]; presentation [PDF, 548KB]

(2008) Kathleen Egan, Francis Kubala, & Allen Sears: User-centered development and implementation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.354-363. [PDF of PPT presentation, 1017KB]

(2008) Arnar Jensson, Koji Iwano, & Sadaoki Furui: Development of a speech recognition system for Icelandic using machine translated text. First International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU-2008), Hanoi University of Technology, Hanoi, Vietnam, May 5-7, 2008; pp. 18-21. [PDF, 116KB]

(2008) C.-L. Kao, S.Saleem, R.Prasad, F.Choi, P.Natarajan, D.Stallard, K.Krstovski, & M.Kamali: Rapid development of an English/Farsi speech-to-speech translation system. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.166-173 [PDF, 143KB]; presentation [PDF, 95KB]

(2008) Muntsin Kolss, Stephan Vogel, & Alex Waibel: Stream decoding for simultaneous spoken language translation. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2735-2738; abstract [PDF, 58KB]

(2008) Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, & Robert Frederking: NineOneOne: recognizing and classifying speech for handling minority language emergency calls.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 95KB]

(2008) Alexandre Patry & Philippe Langlais: MISTRAL: a statistical machine translation decoder for speech recognition lattices.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 265KB]

(2008) Michael Paul, Hideo Okuma, Hirofumi Yamamoto, Eiichiro Sumita, Shigeki Matsuda, Tohru Shimizu, & Satoshi Nakamura: Multilingual mobile-phone translation services for world travelers. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.165-168. [PDF, 373KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Martin Franz, Daniel Kiecza, Jerome L.Quinn, Ganesh Ramaswamy, Amit Srivastava, & Paola Virga: Aggregating distributed STT, MT, and information extraction engines: the GALE interoperability-demo system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2743-2746; abstract [PDF, 63KB]

(2008) John F.Pitrelli, Burn L.Lewis, Edward A.Epstein, Jerome L.Quinn, & Ganesh Ramaswamy: A data format enabling interoperation of speech recognition, translation and information extraction engines: the GALE type system. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.1654-1657; abstract [PDF, 67KB]

(2008) Allison L.Powell & Allison Blodgett: The use of machine-generated transcripts during human translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.427-434. [PDF, 649KB]

(2008) Hammam Riza & Oskar Riandi: Toward Asian speech translation system: developing speech recognition and machine translation for Indonesian language. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; 6pp. [PDF, 81KB]

(2008) Sakriani Sakti, Eka Kelana, Hammam Riza, Shinsuke Sakai, Konstantin Markov, & Satoshi Nakamura: Development of Indonesian large vocabulary continuous speech recognition system within A-STAR project. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; 6pp. [PDF, 93KB]

(2008) Hassan Sawaf, Braddock Gaskill, & Michael Veronis: Hybrid machine translation applied to media monitoring. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.440-447. [PDF, 963KB]

(2008) Sebastian Stüker & Alex Waibel: Towards human translations guided language discovery for ASR systems. First International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU-2008), Hanoi University of Technology, Hanoi, Vietnam, May 5-7, 2008; pp. 76-79. [PDF, 218KB]

(2007) Nguyen Bach, Matthais Eck, Paisarn Charoenpornsawat, Thilo Köhler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, & Alan W.Black: The CMU TransTac 2007 eyes-free two-way speech-to-speech translation system. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 489KB]; presentation [PDF, 1043KB]

(2007) Erik A. Chapla: Statistical machine translation of Japanese. MSc thesis, Air Force Institute of Technology, Wright-Patterson Air Force Base, Ohio, March 2007 (AFIT/GE/ENG/07-06). 96pp. [PDF, 1407KB]

(2007) Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arısoy, Murat Saraçlar, & Andreas Stolcke: Analysis of morph-based speech recognition and the modeling of out-of-vocabulary words across languages. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.380-387 [PDF, 136KB]

(2007) Christian Fügen & Muntsin Kolss: The influence of utterance chunking on machine translation performance. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2837-2840; abstract [PDF, 61KB]

(2007) Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, & Hermann Ney: Improving speech translation with automatic boundary prediction. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2449-2452; abstract [PDF, 61KB]

(2007) Aarthi Reddy, Richard Rose, Alain Désilets: Integration of ASR and machine translation models in a document translation task. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2457-2460; abstract [PDF, 61KB]

(2007) Tanja Schultz, Alan W.Black, Sameer Badaskar, Matthew Hornyak, & John Kominek: SPICE: web-based tools for rapid language adaptation in speech processing systems. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2125-2128; abstract [PDF, 23KB]

(2006) Shahram Khadivi, Richard Zens, & Hermann Ney: Integration of speech to computer-assisted translation using finite-state automata. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.467-474. [PDF, 348KB]

(2006) Sameer Maskey, Bowen Zhou, & Yuqing Gao: A phrase-level machine translation approach for disfluency detection using weighted finite state transducers. Interspeech 2006: ICSLP Ninth International Conference on Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1886; abstract [PDF, 81KB]

(2006) Djamal Mostefa, Olivier Hamon, & Khalid Choukri: Evaluation of automatic speech recognition and speech language translation within TC-STAR: results from the first evaluation campaign.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.149-154 [PDF, 388KB]

(2006) Michael Paul: Overview of the IWSLT06 evaluation campaign. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp.1-15  [PDF, 134KB]

(2006) Henk van den Heuvel, Khalid Choukri, Christian Gollan, Asuncion Moreno, & Djamal Mostefa: TC-STAR: new langauge resources for ASR and SLT purposes. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2570-2573 [PDF, 271KB]

(2005) Edouard Geoffrois: Speech recognition evaluation: practices and issues. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 20pp. [PDF from PPT, 148KB]

(2005) Douglas Jones, Edward Gibson, Wade Shen, Neil Granoien, Martha Herzog, Douglas Reynolds, & Clifford Weinstein: Measuring human readability of machine generated text: three case studies in speech recognition and machine translation. Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), March 18-23, 2005, Philadelphia, PA, USA; vol.5, pp. 1009-1012 [PDF, 233KB]

(2005) Shahram Khadivi, András Zolnay, & Hermann Ney: Automatic text dictation in computer-assisted translation. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2265-2268; abstract [PDF, 55KB]

(2005) M.Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, & Alex Waibel: Document driven machine translation enhanced ASR. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2261-2264; abstract [PDF, 59KB]

(2005) Manny Rayner, Nikos Chatzichrisafis, Pierrette Bouillon, Yukie Nakao, Hitoshi Isahara, Kyoko Kanzaki, Beth Ann Hockey, Marianne Santaholma, & Marianne Starlander: Japanese speech understanding using grammar specialization. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 26-27. [PDF, 54KB]

(2005) Manny Rayner, Pierrette Bouillon, Nikos Chatzichrisafis, Beth Ann Hockey, Marianne Santaholma, Marianne Starlander, Hitoshi Isahara, Kyoko Kanzaki, & Yukie Nakao: A methodology for comparing grammar-based and robust approaches to speech understanding. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.1877-1880 [PDF, 52KB]; abstract [PDF, 55KB]

(2005) L.Rodríguez, J.Civera, E.Vidal, Francisco Casacuberta, & C.Martínez: On the use of speech recognition in computer assisted translation. Interspeech 2005 - Eurospeech: 9th European  Conference on  Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005; pp.2269-2272; abstract [PDF, 54KB]

(2005) Toshiyuki Takezawa, Keiji Yasuda, Masahide Mizushima, & Genichiro Kikui: Assessing degradation of spoken language translation by measuring speech recognizer's output against non-native speakers' listening capabilities. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.203-210. [PDF, 2073KB]

(2005) Ruiqiang Zhang, Genichiro Kikui, Hirofumi Yamamoto, & Wai-Kit Lo: A decoding algorithm for word lattice translation in speech translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 91KB]

(2005) Ruiqiang Zhang, Genichiro Kikui, Hirofumi Yamamoto: Using multiple recognition hypotheses to improve speech translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 7pp. [PDF, 78KB]

Speech synthesis

(2009) Peter Cahill, Jinhua Du, Andy Way, & Julie Carson-Berndsen: Using same-language machine translation to create alternative target sequences for text-to-speech synthesis. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Mohammad Mehdi Arab & Ali Azimizadeh: Construction of a Persian letter-to-sound conversion system based on classification and regression tree. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 676KB]

(2009) Taraka Rama, Anil Kumar Singh, & Sudbeer Kolachina: Modeling letter-to-phoneme conversion as a phrase based statistical machine translation problem with minimum error rate training.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.90-95. [PDF, 404KB]

(2009) Sebastian Stüker, Laurent Besacier, & Alex Waibel: Human translations guided language discovery for ASR systems. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2007) Tanja Schultz, Alan W.Black, Sameer Badaskar, Matthew Hornyak, & John Kominek: SPICE: web-based tools for rapid language adaptation in speech processing systems. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2125-2128; abstract [PDF, 23KB]

(2007) Sebastian Varges: One-way translation: an opportunity for NLG and MT research to interact. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.71-72 [PDF, 497KB]

(2006) Graliński Filip, Jassem Krzysztof, Wagner Agnieszka, & Wypych Mikołaj: Text normalization as a special case of machine translation. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 6-10 November 2006; pp.51-56. [PDF, 318KB]

(2005) Nick Campbell: Speech synthesis evaluation. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 14pp. [PDF from PPT, 57KB]

(2005) Harald Höge: Evaluation of speech synthesis technology. ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 2pp. [PDF from PPT, 16KB]

Syntactic analysis and representation (see also Case grammar and case frames, Chart parsing, Dependency relations, Discontiguous relations, Tagging)

(2009) see Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu and David Chiang, NAACL HLT 2009, Boulder, Colorado, June 5, 2009.

(2009) Takako Aikawa & Achim Ruopp: Chained system: a linear combination of different types of statistical machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.184-189. [PDF, 229KB]

(2009) Vamshi Ambati, Alon Lavie, & Jaime Carbonell: Extraction of syntactic translation models from parallel data using syntax from source and target languages. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.190-197. [PDF, 251KB]

(2009) Vamshi Ambati & Jaime Carbonell: Proactive learning for building machine translation systems for minority languages. NAACL-HLT-2009: Active Learning for Natural Language Processing (ALNLP-09), Proceedings of the workshop, June 5, 2009, Boulder, Colorado; pp.58-61. [PDF, 101KB]

(2009) Xavier Carreras & Michael Collins: Non-projective parsing for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.200-209. [PDF, 174KB]

(2009) Wanxiang Che, Zhenghua Li, Yongqiang Li, Yuhang Guo, Bing Qin, & Ting Liu: Multingual dependency-based syntactic and semantic parsing. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.49-54. [PDF, 633KB]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Jonathan Clark & Greg Hanneman: Czech-to-English translation [on Stat-XFER]. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.29-31. [PDF, 1116KB]

(2009) Fabien Cromières & Sadao Kurohashi: An alignment algorithm using belief propagation and a structure-based distortion model. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.166-174. [PDF, 221KB]

(2009) Xiangyu Duan, Deyi Xiong, Hui Zhang, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 50-54. [PDF, 294KB]; poster [PDF, 40KB]

(2009) Mireia Farrús, Marta R.Costa-jussà, Marc Poch, Adolfo Hernández, & José B.Mariño: Improving a Catalan-Spanish statistical translation system using morphosyntactic knowledge. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.52-57. [PDF, 399KB]

(2009) Rauf Fatullayev, Sevinc Mammadova, & Abulfat Fatullayev: Translating composite sentences in Azerbaijani-English MT system.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.104-113 [abstract]

(2009) Andrew Finch & Eiichiro Sumita: Bidirectional phrase-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1124-1132. [PDF, 164KB]

(2009) Alexander Fraser: Experiments in morphosyntactic processing for translating to and from German.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.115-119. [PDF, 75KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) Michel Galley & Christopher D.Manning: Quadratic-time dependency parsing for machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.773-781. [PDF, 235KB]

(2009) Andrea Gesmundo, James Henderson, Paola Merlo, & Ivan Titov: A latent variable model of synchronous syntactic-semantic parsing for multiple languages. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.37-42. [PDF, 120KB]

(2009) Jesús Giménez & Lluís Màrquez: On the robustness of syntactic and semantic features for automatic MT evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.250-258. [PDF, 171KB]

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

(2009) Greg Hanneman, Vamshi Ambati, Jonathan H.Clark, Alok Parlikar, & Alon Lavie: An improved statistical transfer system for French-English machine translation. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.140-144. [PDF, 101KB]

(2009) Rejwanul Haque, Sudip Kumar Naskar, Yanjun Ma & Andy Way: Using supertags as source language context in SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.234-241. [PDF, 247KB]

(2009) Maria Holmqvist, Sara Stymne, Jody Foo, & Lars Ahrenberg: Improving alignment for SMT by reordering and augmenting the training corpus.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.120-124. [PDF, 102KB]

(2009) Liang Huang, Wenbin Jiang & Qun Liu: Bilingually-constrained (monolingual) shift-reduce parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1222-1231. [PDF, 169KB]

(2009) Gonzalo Iglesias, Adrià de Gispert, Eduardo R.Banga & William Byrne: Rule filtering by pattern for efficient hierarchical translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.380-388. [PDF, 109KB]

(2009) Hongfei Jiang, Muyun Yang, Tiejun Zhao, Sheng Li & Bo Wang: A statistical machine translation model based on a syntactic synchronous grammar. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.125-128. [PDF, 514KB]

(2009) Marcin Junczys-Dowmunt: It’s all about the trees – towards a hybrid syntax-based MT system. Proceedings of the International Multiconference on Computer Science and Information Technology, Mrągowo, Poland, 12-14 October 2009; pp.219-226. [PDF, 125KB]

(2009) Satoshi Kamatani, Tetsuro Chino & Kazuo Sumita: Hybrid spoken language translation using sentence splitting based on syntax structure. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 57-64. [PDF, 167KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) Sadao Kurohashi: Fully syntactic example-based machine translation [abstract]. Invited talk [at] Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; p.1. [PDF, 22KB]; presentation [PDF of PPT, 2353KB]

(2009) Alon Lavie: Stat-XFER: a general framework for search-based syntax-driven MT. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.19-28 [PDF, 740KB]

(2009) Zhifei Li, Chris Callison-Burch, Sanjeev Khudanpur, & Wren Thornton: Decoding Joshua: open source, parsing-based machine translation. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.47-56. [PDF, 146KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevich, Sanjeev Khudanpur, Lane Schwartz, Wren Thornton, Jonathan Weese, & Omar Zaidan: Joshua: open source toolkit for parsing-based machine translation. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 23pp.  [PDF, 933KB]

(2009) Yanjun Ma, Patrik Lambert & Andy Way: Tuning syntactically enhanced word alignment for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.250-257. [PDF, 488KB]

(2009) Pierre Mahé & Nicola Cancedda: Linguistically enriched word-sequence kernels for discriminative language modeling.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.111-128.

(2009) Haitao Mi, Yang Liu, Tian Xia, Xinyan Xiao, Yang Feng, Jun Xie, Hao Xiong, Zhaopeng Tu, Daqi Zheng, Yajuan Lu, & Qun Liu: The ICT statistical machine translation systems for the IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 55-59. [PDF, 393KB]; poster [PDF, 199KB]

(2009) Maja Popović & Hermann Ney: Syntax-oriented evaluation measures for machine translation output.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.29-32. [PDF, 278KB]

(2009) Michael Pust & Kevin Knight: Faster MT decoding through pervasive laziness.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.141-144. [PDF, 108KB]

(2009) Han Ren, Donghong Ji, Jing Wan, & Mingyao Zhang: Parsing syntactic and semantic dependencies for multiple languages with a pipeline approach. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.97-102. [PDF, 228KB]

(2009) Felipe Sánchez-Martínez & Andy Way: Marker-based filtering of bilingual phrase pairs for SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.144-151. [PDF , 386KB]

(2009) Ankit Srivastava, Sergio Penkale, Declan Groves, & John Tinsley: Evaluating syntax-driven approaches to phrase extraction for MT.  Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.19-28. [PDF, 250KB]; presentation [PDF of PPT, 329KB]

(2009) Ankit K.Srivastava & Andy Way: Using percolated dependencies for phrase extraction in SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.316-323. [PDF, 91KB]

(2009) Jun Sun, Min Zhang, & Chew Lim Tan: A non-contiguous tree sequence alignment-based model for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.914-922. [PDF, 459KB]

(2009) John Tinsley & Andy Way: Automatically generated parallel treebanks and their exploitability in machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.1-22.

(2009) Vincent Vandeghinste: Tree-based target language modeling. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.152-159. [PDF, 391KB]

(2009) Ashish Venugopal & Andreas Zollmann: Grammar based statistical MT on Hadoop: an end-to-end toolkit for large scale PSCFG based MT. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.67-78. [PDF, 142KB]

(2009) Ashish Venugopal, Andreas Zollmann, Noah A.Smith, & Stephan Vogel: Preference grammars: softening syntactic constraints to improve statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.37-45. [PDF, 212KB]

(2009) Yotaro Watanabe, Masayuki Asahara, & Yuji Matsumoto: Multilingual syntactic-semantic dependency parsing with three-stage approximate max-margin linear models. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.103-108. [PDF, 727KB]

(2009) Benjamin Wellington, Joseph Turian & I.Dan Melamed: Toward purely discriminative training for tree-structured translation models.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.131-149.

(2009) Xianchao Wu, Takuya Matsuzaki, Naoaki Okazaki, Yusuke Miyao, & Jun’ichi Tsujii: The UOT system: improve string-to-tree translation using head-driven phrase structure grammar and predicate-argument structures. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 99-106. [PDF, 496KB]; poster [PDF, 304KB]

(2009) Deyi Xiong, Min Zhang, Aiti Aw & Haizhou Li: A syntax-driven bracketing model for phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.315-323. [PDF, 441KB]

(2009) Hao Xiong, Wenwen Xu, Haitao Mi, Yang Liu, & Qun Liu: Sub-sentence division for tree-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.137-140. [PDF, 126KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, & Chew Lim Tan: Fast translation rule matching for syntax-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1037-1045. [PDF, 262KB]

(2009) Yi Zhang, Rui Wang, & Stephan Oepen: Hybrid multilingual parsing with HPSG for SRL. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.31-36. [PDF, 115KB]

(2009) Hai Zhao, Wenliang Chen, Jun’ichi Kazama, Kiyotaka Uchimoto, & Kentaro Torisawa: Multilingual dependency learning: exploiting rich features for tagging syntactic and semantic dependencies. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning: Shared Task, June 4, 2009, Boulder, Colorado; pp.61-66. [PDF, 364KB]

(2009) Jing Zheng, Necip Fazil Ayan, Wen Wang, & David Burkett: Using syntax in large-scale audio document translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2008) proceedings of SSST-2: Second Workshop on Syntax and Structure in Statistical Translation, David Chiang and Dekai Wu (editors), ACL-08: HLT, Columbus, Ohio, USA, 20 June 2008.

(2008) Vamshi Ambati & Alon Lavie: Improving syntax driven translation models by re-structuring divergent and non-isomorphic parse tree structures. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.235-244 [PDF, 734KB]

(2008) Eleftherios Avramidis & Philipp Koehn: Enriching morphologically poor languages for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 763-770. [PDF, 685KB]

(2008) Phil Blunsom & Miles Osborne: Probabilistic inference for machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.215-223. [PDF, 283KB]

(2008) Ondřej Bojar, Silvie Cinková, & Jan Ptáček: Towards English-to-Czech MT via tectogrammatical layer. Prague Bulletin of Mathematical Linguistics, no.90, December 2008; pp.57-68. [PDF, 169KB]

(2008) Ondřej Bojar & Jan Hajič: Phrase-based and deep syntactic English-to-Czech statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.143-146. [PDF, 94KB]

(2008) Ondřej Bojar & Adam Lopez: Tree-based translation. [tutorial handout at] Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 7pp. [PDF, 248KB]

(2008) David Burkett & Dan Klein: Two languages are better than one (for syntactic parsing).  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.877-886. [PDF, 405KB]

(2008) Chris Callison-Burch: Syntactic constraints on paraphrases extracted from parallel corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.196-205. [PDF, 187KB]

(2008) Mauro Cettolo, Marcello Federico, Daniele Pighin, & Nicola Bertoldi: Shallow-syntax phrase-based translation: joint versus factored string-to-chunk models. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 56-64. [PDF, 176KB]

(2008) Colin Cherry & Chris Quirk: Discriminative, syntactic language modeling through latent SVMs. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.65-74. [PDF, 609KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Victoria Fossum, Kevin Knight, & Steven Abney: Using syntax to improve word alignment precision for syntax-based machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.44-52. [PDF, 150KB]

(2008) Zhongjun He, Qun Liu, & Shouxun Lin: Improving statistical machine translation using lexicalized rule selection.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.321-328. [PDF, 134KB]

(2008) Qun Liu, Zhongjun He, Yang Liu, & Shouxun Lin: Maximum entropy based rule selection model for syntax-based statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.89-97. [PDF, 916KB]

(2008) Yang Liu, Zhongjun He, Haitao Mi, Yun Huang, Yang Feng, Wenbin Jiang, Yajuan Lu, & Qun Liu: The ICT system description for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 52-57. [PDF, 201KB]; presentation [PDF, 531KB]

(2008) Yanjun Ma, John Tinsley, Hany Hassan, Jinhua Du, & Andy Way: Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 26-33. [PDF, 198KB]; presentation [PDF, 559KB]

(2008) Yuval Marton & Philip Resnik: Soft syntactic constraints for hierarchical phrased-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1003-1011. [PDF, 153KB]

(2008) Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, & Joakim Nivre: Swedish-Turkish parallel treebank. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 80KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2008) Toshiaki Nakazawa & Sadao Kurohashi: Linguistically-motivated tree-based probabilistic phrase alignment. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.163-171. [PDF, 603KB]

(2008) Preslav Nakov: Improving English-Spanish statistical machine translation: experiments in domain adaptation, sentence paraphrasing, tokenization, and recasing. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.147-150. [PDF, 110KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Ananthakrishnan Ramanathan, Pushpak Bhattacharyya, Jayprasad Hegde, Ritesh M.Shah, & Sasikumar M: Simple syntactic and morphological processing can help English-Hindi statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.513-520. [PDF, 368KB]

(2008) Germán Sanchis-Trilles & Joan Andreu Sánchez: Using parsed corpora for estimating stochastic inversion transduction grammars.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 3pp. [PDF, 48KB]

(2008) Lei Shi & Ming Zhou: Improved sentence alignment on parallel web pages using a stochastic tree alignment model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.505-513. [PDF, 483KB]

(2008) Anders Søgaard: Range concatenation grammars for translation. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.103-106. [PDF, 130KB]

(2008) Kathrin Spreyer, Jonas Kuhn, & Bettina Schrader: Identification of comparable argument-head relations in parallel corpora.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 138KB]

(2008) Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201. [PDF, 614KB]

(2008) David Vilar, Daniel Stein, & Hermann Ney: Analysing soft syntax features and heuristics for hierarchical phrase based machine translation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.190-197 [PDF, 181KB]; presentation [PDF, 112KB]

(2008) Yushi Xu & Stephanie Seneff: Two-stage translation: a combined linguistic and statistical machine translation framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.222-231 [PDF, 671B]

(2008) Vineet Yadav, Sriram Venkatapathy, & Karthik Gali: Comparative analysis of phrasal cohesion of English-Hindi with English-French. ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 10pp. [PDF, 76KB]

(2008) Daniel Zeman & Philip Resnik: Cross-language parser adaptation between related languages. IJCNLP 2008: Workshop on NLP for Less Privileged Languages. Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.35-42. [PDF, 150KB]

(2008) Dakun Zhang, Le Sun, & Wenbo Li: A structured prediction approach for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.649-654. [PDF, 521KB]

(2008) Bing Zhao & Yaser Al-Onaizan: Generalizing local and non-local word-reordering patterns for syntax-based machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581. [PDF, 538KB]

(2008) Andreas Zollmann, Ashish Venugopal, & Stephan Vogel: The CMU syntax-augmented machine translation system: SAMT on Hadoop with n-best alignments.  IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 18-25. [PDF, 208KB]; presentation [PDF, 109KB]

(2008) Simon Zwarts & Mark Dras: Choosing the right translation: a syntactically informed classification approach. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1153-1160. [PDF, 194KB]

(2008) Simon Zwarts & Mark Dras: Morphosyntactic target language matching in statistical machine translation. Australasian Language Technology Workshop 2008 (ALTW2008): Proceedings of the Workshop, 8-10 December 2008, CSIRO ICT Centre, Hobart; pp.169-177. [PDF, 62KB]

(2007) proceedings of SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY.

(2007) Alexandra Birch, Miles Osborne, & Philipp Koehn: CCG supertags in factored statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 9-16 [PDF, 175KB]

(2007) Matthias Buch-Kromann: Breaking the barrier of context-freeness: towards a linguistically adequate probabilistic dependency model of parallel texts. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.31-40 [PDF, 432KB]; presentation [PDF, 1285KB]

(2007) Michael Carl & Paul Schmidt: A modular architecture for separating hypothesis formation from hypothesis evaluation in data-driven machine translation.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 155KB]

(2007) Wen-Han Chao & Zhou-Jun Li: Incorporating constituent structure constraint into discriminative word alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.97-103 [PDF, 112KB]

(2007) Boxing Chen, Jun Sun, Hongfei Jiang, Min Zhang, & Ai Ti Aw: I2R Chinese-English translation system for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 91KB]; presentation [PDF, 204KB]

(2007) David Chiang: Hierarchical phrase-based translation. Computational Linguistics 33 (2), pp. 202-228. [PDF, 597KB]

(2007) Josep M. Crego & José B. Mariño: Syntax-enhanced n-gram-based SMT. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.111-118 [PDF, 188KB]

(2007) Steve DeNeefe, Kevin Knight, Wei Wang & Daniel Marcu: What can syntax-based MT learn from phrase-based MT? EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 755-763. [PDF, 240KB]

(2007) Peter Dirix, Vincent Vandeghinste, & Ineke Schuurman: Demonstration of the Dutch-to-English METIS-II MT system. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.53-54 [PDF, 316KB]; presentation [PDF, 6104KB]

(2007) Yvette Graham, Deirdre Hogan, & Josef van Genabith: Automatic evaluation of generation and parsing for machine translation with automatically acquired transfer rules. MT Summit XI Workshop: Using corpora for natural language generation:  language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.5-12 [PDF, 1796KB]

(2007) Nizar Habash: Syntactic preprocessing for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.215-222 [PDF, 118KB]

(2007) Jan Hajič: Tree-based translation with tectogrammatical representation. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 32pp. [PDF of PPT presentation, 695KB]

(2007) Hany Hassan, Khalil Sima’an, & Andy Way: Supertagged phrase-based statistical machine translation.  ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 288-295 [PDF, 175KB]

(2007) Zhongjun He, Haitao Mi, Yang Liu, Deyi Xiong, Weihua Luo, Yun Huang, Zhixiang Ren, Yajuan Lu, & Qun Liu: The ICT statistical machine translation systems for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 102KB]; presentation [PDF, 2153KB]

(2007) Hieu Hoang: Asynchronous factored translation. First Machine Translation Marathon, Edinburgh, April 16-20, 2007;  18pp. [PDF of PPT presentation, 33KB]

(2007) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Getting to know Moses: initial experiments on German-English factored translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 181-184 [PDF, 123KB]

(2007) Mark Hopkins & Jonas Kuhn: Deep grammars in a tree labeling approach to syntax-based statistical machine translation. ACL 2007: proceedings of the Workshop on Deep Linguistic Processing, Prague, Czech Republic, 28 June 2007; pp.33-40 [PDF, 156KB]

(2007) Mark Hopkins & Jonas Kuhn: Machine translation as tree labeling. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 79pp. [PDF of PPT presentation, 454KB]

(2007) Ian Lane, Andreas Zollmann, Thuy Linh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying Zhang, & Alex Waibel: The CMU-UKA statistical machine translation systems for IWSLT 2007. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 7pp. [PDF, 150KB]; presentation [PDF, 393KB]

(2007) Irene Langkilde-Geary: Declarative syntactic processing of natural language using concurrent constraint programming and probabilistic dependency modeling. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.55-63 [PDF, 1969KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Alignment-guided chunking. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.114-121 [PDF, 357KB]; presentation [PDF, 377KB]

(2007) Jonathan May & Kevin Knight: Syntactic re-alignment models for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 360-368. [PDF, 180KB]

(2007) Preslav Nakov & Marti Hearst: UCB system description for the WMT 2007 shared task. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 212-215 [PDF, 129KB]

(2007) Joakim Nivre: Data-driven dependency parsing across languages and domains: perspectives from the CoNLL 2007 Shared Task. IWPT 2007: Proceedings of the 10th International Conference on Parsing Technologies, June 23-24, 2007, Prague, Czech Republic; pp.168-170. [PDF, 69KB]

(2007) M. Paulik, K.Rottmann, J.Niehues, S.Hildebrand, & S.Vogel: The ISL phrase-based MT system for the 2007 ACL Workshop on statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 197-202 [PDF, 175KB]

(2007) Nasredine Semmar & Christian Fluhr: Arabic to French sentence alignment: exploration of a cross-language information retrieval approach. ACL 2007: proceedings of the Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, Prague, Czech Republic, 28 June 2007; pp. 73-80 [PDF, 156KB]

(2007) Jiadong Sun, Tiejun Zhao, & Huashen Liang: Meta-structure transformation model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 64-71 [PDF, 172KB]

(2007) Akira Ushioda: Phrase alignment for integration of SMT and RBMT resources. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.8-12. [PDF, 66KB]

(2007) Akira Ushioda: Phrase alignment based on bilingual parsing. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.241-250 [PDF, 312KB]

(2007) Ashish Venugopal & Andreas Zollmann: Hierarchical and syntax structured MT. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 52pp. [PDF of PPT presentation, 383KB]

(2007) Ashish Venugopal, Andreas Zollmann, & Stephan Vogel: An efficient two-pass approach to synchronous-CFG driven statistical MT. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.500-507 [PDF, 279KB]

(2007) Eric Wehrli: Fips, a “deep” linguistic multilingual parser. ACL 2007: proceedings of the Workshop on Deep Linguistic Processing, Prague, Czech Republic, 28 June 2007; pp.120-127 [PDF, 102KB]

(2007) Fai Wong : The parsing algorithm of translation corresponding tree (TCT) grammar . Third Asian-Pacific Congress on Computational Mechanics (APCOM'07) in conjunction with Eleventh International Conference on the Enhancement and Promotion of Computational Methods in Engineering and Science (EPMESC XI), December 3-6, 2007, Kyoto, Japan; 10pp. [PDF, 294KB]

(2007) Yokoyama Shoichi & Kennendai Shigehiro: Error correcting system for analysis of Japanese patent sentences. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.24-27. [PDF, 73KB]

(2007) Andreas Zollmann, Ashish Venugopal, Matthias Paulik, & Stephan Vogel: The syntax augmented MT (SAMT) system for the shared task in the 2007 ACL workshop on statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 216-219 [PDF, 163KB]

(2006) Giuseppe Attardi: Experiments with a multilanguage non-projective dependency parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.166-170. [PDF, 104KB]

(2006) Sabine Buchholz & Erwin Marsi: CoNLL-X shared task on multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.149-164. [PDF, 158KB]

(2006) Aoife Cahill: [review of] Harry Bunt, John Carroll, Giorgio Satta (eds.), New developments in parsing technology. Machine Translation 20 (2),2006; pp.139-142. [see publication]

(2006) Michael Carl & Ecaterina Rascu: A dictionary lookup strategy for translating of discontinuous phrases. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.49-57 [PDF, 197KB]

(2006) Boxing Chen, Mauro Cettolo, & Marcello Federico: Reordering rules for phrase-based statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 182-189 [PDF, 319KB]

(2006) Yuchang Cheng, Masayuki Asahara & Yuji Matsumoto: Multi-lingual dependency parsing at NAIST.  CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.191-195. [PDF, 192KB]

(2006) Simon Corston-Oliver & Anthony Aue: Dependency parsing with reference to Slovene, Spanish and Swedish. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.196-200. [PDF, 286KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

(2006) Brooke Cowan, Ivona Kučerová, & Michael Collins: A discriminative model for tree-to-tree translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 232-241. [PDF, 338KB]

(2006) John DeNero, Dan Gillick, James Zhang, & Dan Klein: Why generative phrase models underperform surface heuristics.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 31-38 [PDF, 393KB]

(2006) Yuan Ding & Martha Palmer: Better learning and decoding for syntax based SMT using PSDIG. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.37-45 [PDF, 391KB]

(2006) Anas El Isbihani, Shahram Khadivi, Oliver Bender, & Hermann Ney: Morpho-syntactic Arabic preprocessing for Arabic-to-English statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 15-22 [PDF, 156KB]

(2006) Saša Hasan, Oliver Bender, & Hermann Ney: Reranking translation hypotheses using structural properties.  EACL-2006: Proceedings of the Workshop on Learning Structured Informationa in Natural Language Applications, April 3, 2006, Trento, Italy; pp.41-48. [PDF, 100KB]

(2006) Mark Hopkins & Jonas Kuhn: A framework for incorporating alignment information in parsing. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.9-16 [PDF, 311 KB]

(2006) Bryant Huang & Kevin Knight: Relabeling syntax trees to improve syntax-based machine translation quality.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 240-247 [PDF, 747KB]

(2006) Liang Huang: Efficient algorithms for richer formalisms: parsing and machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 223-226 [PDF, 87KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Richard Johansson & Pierre Nugues: Investigating multilingual dependency parsing. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.206-210. [PDF, 75KB]

(2006) Mamoru Komachi, Yuji Matsumoto & Masaaki Nagata: Phrase reordering for statistical machine translation based on predicate-argument structure. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 77-82 [PDF, 139KB]

(2006) Philippe Langlais & Fabrizio Gotti: Phrase-based SMT in shallow tree-phrases.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 39-46 [PDF, 190KB]

(2006) Yang Liu, Qun Liu, & Shouxun Lin: Tree-to-string alignment template for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.609-616. [PDF, 392KB]

(2006) Mohamed Maamouri, Ann Bies, & Seth Kulick: Diacritization: a challenge to Arabic treebank annotation and parsing.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.35-47 [PDF, 223KB]

(2006) Ryan McDonald, Kevin Lerman, & Fernando Pereira: Multilingual dependency analysis with a two-stage discriminative parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.216-220. [PDF, 79KB]

(2006) Daniel Marcu, Wei Wang, Abdessamad Echihabi, & Kevin Knight: SPMT: statistical machine translation with syntactified target language phrases. EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 44-52. [PDF, 164KB]

(2006) Bart Mellebeek, Karolina Owczarzak, Declan Groves, Josef Van Genabith, & Andy Way: A syntactic skeleton for statistical machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.195-202

(2006) Arul Menezes, Kristina Toutanova, & Chris Quirk: Microsoft Research treelet translation system: NAACL 2006 Europarl evaluation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 158-161 [PDF, 676KB]

(2006) Slim Mesfar: Standard Arabic formalization and linguistic platform for its analysis. The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.84-94. [PDF, ]

(2006) Thai Phuong Nguyen & Akira Shimazu: Improving phrase-based statistical machine translation with morphosyntactic transformation [abstract]. Machine Translation 20 (3),2006; pp.147-166.

(2006) Maja Popović, Adrià de Gispert, Deepa Gupta, Patrik Lambert, Hermann Ney, José B. Mariño, Marcello Federico, & Rafael Banchs: Morpho-syntactic information for automatic error analysis of statistical machine translation output.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 1-6 [PDF, 306KB]

(2006) Chris Quirk & Arul Menezes: Do we need phrases? Challenging the conventional wisdom in statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 9-16 [PDF, 97KB]

(2006) Chris Quirk & Simon Corston-Oliver: The impact of parse quality on syntactically-informed statistical machine translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 62-69. [PDF, 203KB]

(2006) Owen Rambow, Bonnie Dorr, David Farwell, Rebecca Green, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Keith J.Miller, Teruko Mitamura, Florence Reeder, & Advaith Siddharthan: Parallel syntactic annotation of multiple languages.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.559-564 [PDF, 286KB]

(2006) Sebastian Riedel, Ruket Ҫakıcı, & Ivan Meza-Ruiz: Multi-lingual dependency parsing with incremental integer linear programming. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.226-230. [PDF, 72KB]

(2006) J.A.Sánchez & J.M.Benedí: Stochastic inversion transduction grammars for obtaining word phrases for phrase-based statistical machine translation. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 130-133 [PDF, 91KB]

(2006) J. A. Sánchez & J. M. Benedí: Obtaining word phrases with stochastic inversion translation grammars for phrase-based statistical machine translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.179-186 [PDF, 129KB]

(2006) Violeta Seretan & Eric Wehrli: Accurate collocation extraction using a multilingual parser. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.953-960. [PDF, 168KB]

(2006) Svetlana Sheremetyeva: Handling low translatability in machine translation of long sentences. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.105-114 [PDF, 745KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Daniel Stein, Jan Bungeroth, & Hermann Ney: Morpho-syntax based statistical methods for automatic sign language translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.169-177 [PDF, 206KB]

(2006) Martin Volk: How bad is the problem of PP-attachment? A comparison of English, German and Swedish. Proceedings of the Third ACL-SIGSEM Workshop on Prepositions, Trento, Italy, April 2006; pp.81-88. [PDF, 133KB]

(2006) Taro Watanabe, Hajime Tsukada, & Hideki Isozaki: Left-to-right target generation for hierarchical phrase-based translation.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.777-784. [PDF, 224KB]

(2006) Eric Wehrli: TwicPen: hand-held scanner and translation software for non-native readers. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Interactive Presentation Sessions, Sydney, July 2006; pp.61-64. [PDF, 229KB]

(2006) Benjamin Wellington, Joseph Turian, Chris Pike & I.Dan Melamed: Scalable purely-discriminative training for word and tree transducers. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.251-260 [PDF, 176KB]

(2006) Benjamin Wellington, Sonja Waxmonsky, & I.Dan Melamed: Empirical lower bounds on the complexity of translational equivalence. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.977-984. [PDF, 121KB]

(2006) Yuk Wah Wong & Raymond J. Mooney: Learning for semantic parsing with statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 439-446 [PDF, 205KB]

(2006) Xiaohong Wu, Sylviane Cardey, & Peter Greenfield: Realization of the Chinese BA-construction in an English-Chinese machine translation system. Coling-ACL 2006: Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, Sydney, 22-23 July 2006; pp.79-86. [PDF, 423KB]

(2006) Hao Zhang & Daniel Gildea: Efficient search for inversion transduction grammar.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 224-231. [PDF, 172KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

(2006) Andreas Zollmann, Ashish Venugopal, Stephan Vogel, & Alex Waibel: The CMU-UKA syntax augmented machine translation system for IWSLT-06. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 138-144 [PDF, 218KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

 (2005) Andrea Burbank, Marine Carpuat, Stephen Clark, Markus Dreyer, Pamela Fox, Declan Groves, Keith Hall, Mary Hearne, I. Dan Melamed, Yihai Shen, Andy Way, Ben Wellington, & Dekai Wu: Final report of the 2005 Language Enginering Workshop on statistical machine translation by parsing, Johns Hopkins University, Center for Speech and Language Processing, November 12, 2005. 67pp. [PDF, 645KB]

(2005) David Chiang, Adam Lopez, Nitin Madnani, Christof Monz, Philip Resnik, & Michael Subotin: The Hiero machine translation system: extensions, evaluation and analysis. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 779-786. [PDF, 227KB]

(2005) Michael Collins, Philipp Koehn, & Ivona Kučerová: Clause restructuring for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 531-540. [PDF, 129KB]

(2005) Steve DeNeefe, Kevin Knight, & Hayward H. Chan: Interactively exploring a machine translation model. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 97-100. [PDF, 123KB]

(2005) Kuntal Dey & Pushpak Bhattacharyya: Universal Networking Language based analysis and generation for Bengali case structure constructs. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.215-229 [abstract, PDF, 88KB]

(2005) Adrià de Gispert: Phrase linguistic classification and generalization for improving statistical machine translation. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 67-72. [PDF, 90KB]

(2005) Kenji Imamura, Hideo Okuma, & Eiichiro Sumita: Practical approach to syntax-based statistical machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.267-274. [PDF, 165KB]

(2005) Yookyung Kim, Jun Huang, Youssef Billawala, Demitrios Master, & Farzad Ehsani: Sehda S2MT: incorporation of syntax into statistical translation system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 6pp. [PDF, 116KB]

(2005) Philipp Koehn: Statistical machine translation, lecture 5: syntax-based models. ESSLLI-2005: 17th European Summer School in Logic, Language and Information, Heriot-Watt University, Edinburgh, Scotland, 8-19 August 2005; 7pp. [PDF of PPT presentation, 70KB]

(2005) Christina Lioma & Iadh Ounis: Deploying part-of-speech patterns to enhance statistical phrase-based machine translation resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 163-166. [PDF, 118KB]

(2005) Ding Liu & Daniel Gildea: Syntactic features for evaluation of machine translation. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-32. [PDF, 158KB]

(2005) Sudip Kumar Naskar & Sivaji Bandyopadhyay: A phrasal EBMT system for translating English to Bengali. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.372-379. [PDF, 263KB]

(2005) Patrick Paroubek: Evaluation of text analysis core technologies. Two successful examples: evaluating POS taggers and parsers for French.  ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 35pp. [PDF from PPT, 288KB]

(2005) Manny Rayner, Nikos Chatzichrisafis, Pierrette Bouillon, Yukie Nakao, Hitoshi Isahara, Kyoko Kanzaki, Beth Ann Hockey, Marianne Santaholma, & Marianne Starlander: Japanese speech understanding using grammar specialization. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 26-27. [PDF, 54KB]

(2005) Yohsuke Sakao, Takahiro Ikeda, Kenji Satoh & Susumu Akamine: Japanese language analaysis for syntactic tree mining to extract characteristic contents. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.339-345. [PDF, 159KB]

(2005) R. Mahesh K. Sinha & Anil Thakur: Divergence patterns in machine translation between Hindi and English. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.346-353. [PDF, 288KB]

(2005) Evgeny Matusov, Stephan Kanthak, & Hermann Ney: Efficient statistical machine translation with constrained reordering. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 181-188 [PDF, 110KB]

(2005) Maja Popovic & Hermann Ney: Exploiting phrasal lexica and additional morpho-syntactic language resources for statistical machine translation with scarce training data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 212-218. [PDF, 65KB]

(2005) Gábor Prószéky: An approach to machine translation via the rule-to-rule hypothesis. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 219-225. [PDF, 147KB]

(2005) Dekai Wu: Statistical machine translation part II: tree-based SMT. [Tutorial at] IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; p.276. [PDF, 33KB]

Syntactic transfer

(2009) Ondřej Bojar: Bad news, NLP hacking and feature fishing. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.59-61 [PDF, 178KB]

(2009) Ekaterina Ershova: Syntactic problems in French-Russian machine translation of periodicals.  ISMTCL: International Symposium on Data and Sense Mining, Machine Translation and Controlled Languages, and their application to emergencies and safety critical domains, July 1-3, 2009, Centre Tesnière, University of Franche-Comté, Besançon, France (Presses universitaires de Franche-Comté, 2009); pp.100-103 [abstract]

(2009) Daniel Galron, Sergio Penkale, Andy Way, & I.Dan Melamed: Accuracy-based scoring for DOT: towards direct error minimization for data-oriented translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.371-380. [PDF, 164KB]

(2009) Declan Groves & Dag Schmidtke: Identification and analysis of post-editing patterns for MT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 429-436. [PDF, 304KB]

(2009) Nizar Habash & Jun Hu: Improving Arabic-Chinese statistical machine translation using English as pivot language. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.173-181. [PDF, 265KB]

(2009) Jan Hajič: Tree-based machine translation using syntax and semantics. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 23pp. [PDF of PPT, 543KB]

(2009) Liang Huang, Hao Zhang, David Gildea & Kevin Knight: Binarization of synchronous context-free grammars. Computational Linguistics 35 (4), pp. 559-595 [PDF, 837KB]

(2009) Hongfei Jiang, Sheng Li, Muyun Yang, & Tiejun Zhao: A study of translation rule classification for syntax-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.45-50. [PDF, 466KB]

(2009) Marcin Junczys-Dowmunt: It’s all about the trees – towards a hybrid syntax-based MT system. Proceedings of the International Multiconference on Computer Science and Information Technology, Mrągowo, Poland, 12-14 October 2009; pp.219-226. [PDF, 125KB]

(2009) Jeremy G.Kahn, Matthew Snover, & Mari Ostendorf: Expected dependency pair match: predicting translation quality with expected syntactic structure [abstract]. Machine Translation 23 (2/3), September 2009; pp.169-179.

(2009) Zhifei Li: Training and inference methods over translation forests [abstract]. CWMT 2009: the 5th China Workshop on Machine Translation, Nanjing, China, October 16-17, 2009; 1p. [PDF, 14KB]

(2009) Yang Liu, Yajuan Lü & Qun Liu: Improving tree-to-tree translation with packed forests.  [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.558-566. [PDF, 151KB]

(2009) Martin Popel & Zdeněk Žabokrtský: Improving English-Czech tectogrammatical MT. Prague Bulletin of Mathematical Linguistics, no.92, December 2009; pp.115-134 [PDF, 265KB]

(2009) Felipe Sánchez-Martínez & Mikel L.Forcada: Inferring shallow-transfer machine translation rules from small parallel corpora. Journal of Artificial Intelligence Research 34, pp. 605-635. [PDF, 706KB]

(2009) Vincent Vandeghinste & Scott Martens: Top-down transfer in example-based MT. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.69-76. [PDF, 359KB]

(2009) István Varga & Soichi Yokoyama: Transfer rule generation for a Japanese-Hungarian machine translation system. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.356-362. [PDF, 152KB]

(2009) Billy Wong & Chunyu Kit: ATEC: automatic evaluation of machine translation via word choice and word order [abstract]. Machine Translation 23 (2/3), September 2009; pp.141-151.

 (2008) Ondřej Bojar: Exploiting linguistic data in machine translation. Doctoral thesis, Charles University, Faculty of Mathematics and Physics, Prague 2008; 135pp. [PDF, 1045KB]

(2008) Ondřej Bojar: Problems of deep syntactic English-to-Czech MT. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 33 slides. [PDF, 248KB]

(2008) Ondřej Bojar & Jan Hajič: Phrase-based and deep syntactic English-to-Czech statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.143-146. [PDF, 94KB]

(2008) Ondřej Bojar & Adam Lopez: Tree-based translation. [tutorial handout at] Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 7pp. [PDF, 248KB]

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

(2008) Colin Cherry: Cohesive phrase-based decoding for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 72-80. [PDF, 179KB]

(2008) Christopher Dyer, Smaranda Muresan, & Philip Resnik: Generalizing word lattice translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1012-1020 . [PDF, 201KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Greg Hanneman, Edmund Huber, Abhaya Agarwal, Vamshi Ambati, Alok Parlikar, Erik Peterson, & Alon Lavie: Statistical transfer systems for French-English and German-English machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.163-166. [PDF, 72KB]

(2008) Alon Lavie, Alok Parlikar, & Vamshi Ambati: Syntax-driven learning of sub-sentential translation equivalents and translation rules from parsed parallel corpora. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.87-95. [PDF, 1610KB]

(2008) Ding Liu & Daniel Gildea: Improved tree-to-string transducer for machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.62-69. [PDF, 208KB]

(2008) David Mareček, Zdeněk Žabokrtský, & Václav Novák: Automatic alignment of Czech and English deep syntactic dependency trees. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.104-113. [PDF, 586KB]

(2008) Arul Menezes & Chris Quirk: Syntactic models for structural word insertion and deletion.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.735-744. [PDF, 178KB]

(2008) Haitao Mi, Liang Huang, & Qun Liu: Forest-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 192-199. [PDF, 185KB]

(2008) Haitao Mi & Liang Huang: Forest-based translation rule extraction.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.206-214. [PDF, 170KB]

(2008) Thai Phuong Nguyen, Akira Shimazu, Tu-Bao Ho, Minh Le Nguyen, & Vinh Van Nguyen: A tree-to-string phrase-based model for statistical machine translation. CoNLL: proceedings of the  Twelfth Conference on Computational Natural Language Learning, 16-17 August 2008, Manchester, UK; pp.143-150. [PDF, 392KB]

(2008) Attila Novák, László Tihanyi, & Gábor Prószéky: The MetaMorpho translation system. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.111-114.  [PDF, 169KB]

(2008) Michael Subotin: Generalizing local translation models. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.28-36. [PDF, 148KB]

(2008) Christoph Tillmann: A rule-driven dynamic programming decoder for statistical MT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.37-45. [PDF, 152KB]

(2008) Zdeněk Žabokrtský, Jan Ptáček, & Petr Pajas: TectoMT: highly modular MT system with tectogrammatics used as transfer layer. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.167-170. [PDF, 136KB]

(2007) Eckard Bick: Dan2eng: wide-coverage Danish-English machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.37-43 [PDF, 621KB]

(2007) Rens Bod: Unsupervised syntax-based machine translation: the contribution of discontiguous phrases. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.51-56 [PDF, 87KB]

(2007) Yvette Graham, Deirdre Hogan, & Josef van Genabith: Automatic evaluation of generation and parsing for machine translation with automatically acquired transfer rules. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.5-12 [PDF, 1796KB]

(2007) Jan Hajič: Tree-based translation with tectogrammatical representation. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 32pp. [PDF of PPT presentation, 695KB]

(2007) Mary Hearne, John Tinsley, Ventsislav Zhechev, & Andy Way: Capturing translational divergences with a statistical tree-to-tree aligner. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.85-94 [PDF, 394KB]; presentation [PDF, 462KB]

(2007) Mark Hopkins & Jonas Kuhn: Machine translation as tree labeling. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.41-48 [PDF, 214KB]

(2007) Krzysztof Jassem & Tomasz Kowalski: Machine translation using scarce bilingual corpora. TASK Quarterly 11, no.1-2, 21-33. [PDF, 216KB]

(2007) Yang Liu, Yun Huang, Qun Liu, & Shouxun Lin: Forest-to-string statistical translation rules. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 704-711 [PDF, 198KB]

(2007) Arul Menezes & Chris Quirk: Using dependency order templates to improve generality in translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 1-8 [PDF, 153KB]

(2007) Felipe Sánchez-Martínez & Mikel L.Forcada: Automatic induction of shallow-transfer rules for open-source machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.181-190 [PDF, 336KB]; poster [PDF, 29KB]

(2007) Jiadong Sun, Tiejun Zhao, & Huashen Liang: Meta-structure transformation model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 64-71 [PDF, 172KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2007) John Tinsley, Ventsislav Zhechev, Mary Hearne, & Andy Way: Robust language pair-independent sub-tree alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.467-474 [PDF, 1310KB]

(2007) Vincent Vandeghinste, Peter Dirix, & Ineke Schuurman: The effect of a few rules on a data-driven MT system. METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 154KB]

(2007) Vincent Vandeghinste: Removing the distinction between a translation memory, a bilingual dictionary and a parallel corpus. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 21pp. [PDF, 90KB]

(2007) Min Zhang, Hongfei Jiang, Ai Ti Aw, Jun Sun, Sheng Li, & Chew Lim Tan: A tree-to-tree alignment-based model for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.535-542 [PDF, 243KB]

(2006) Yaser Al-Onaizan & Kishore Papineni: Distortion models for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.529-536. [PDF, 135KB]

(2006) Brooke Cowan, Ivona Kučerová, & Michael Collins: A discriminative model for tree-to-tree translation.  EMNLP-2006: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, Sydney, Australia, July 2006; pp. 232-241. [PDF, 338KB]

(2006) Lea Cyrus: Building a resource for studying translation shifts.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.697-702 [PDF, 358KB]

(2006) Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, & Ignacio Thayer: Scalable inference and training of context-rich syntatic translation models.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.961-968. [PDF, 438KB]

(2006) Mary Hearne & Andy Way: Disambiguation strategies for data-oriented translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.59-68 [PDF, 164KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Liang Huang, Kevin Knight, & Aravind Joshi: A syntax-directed translator with extended domain of locality. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.1-8. [PDF, 120KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Philippe Langlais & Fabrizio Gotti: Phrase-based SMT in shallow tree-phrases.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 39-46 [PDF, 190KB]

(2006) Yang Liu, Qun Liu, & Shouxun Lin: Tree-to-string alignment template for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.609-616. [PDF, 392KB]

(2006) Zhanyi Liu, Haifeng Wang, & Hua Wu: Example-based machine translation on tree-string correspondences and statistical generation [abstract]. Machine Translation 20 (1),2006; pp.25-41.

(2006) Igor Mel’čuk & Leo Wanner: Syntactic mismatches in machine translation [abstract]. Machine Translation 20 (2),2006; pp.81-138.

(2006) Thai Phuong Nguyen & Akira Shimazu: Improving phrase-based statistical machine translation with morpho-syntactic analysis and transformation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.138-147 [PDF, 282KB]

(2006) Karolina Owczarzak, Bart Mellebeek, Declan Groves, Josef Van Genabith & Andy Way: Wrapper syntax for example-based machine translation.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.148-155 [PDF, 248KB]

(2006) Stefan Riezler & John T. Maxwell III: Grammatical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 248-255 [PDF, 299KB]

(2006) Koichiro Ryu, Shigeki Matsubara, & Yasuyoshi Inagaki: Simultaneous English-Japanese spoken language translation based on incremental dependency parsing and transfer. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.683-690. [PDF, 420KB]

(2006) David A. Smith & Jason Eisner: Quasi-synchronous grammars: alignment by soft projection of syntactic dependencies.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 23-30 [PDF, 207KB]

(2006) Mihaela Vela & Silvia Hansen-Schirra: The use of multi-level annotation and alignment for the translator. Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 19pp. [PDF, 187KB]

(2005) Antonio M.Corbi-Bellot, Mikel L. Forcada, Sergio Ortíz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, & Kepa Sarasola: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 79-86. [PDF, 130KB]

(2005) Yuan Ding & Martha Palmer: Machine translation using probabilistic synchronous dependency insertion grammars.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 541-548. [PDF, 369KB]

(2005) Heidi J. Fox: Dependency-based statistical machine translation.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 91-96. [PDF, 101KB]

(2005) Martin Jansche: Treebank transfer. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.74-82. [PDF, 120KB]

 (2005) Chris Quirk, Arul Menezes, & Colin Cherry: Dependency treelet translation: syntactically informed phrasal SMT.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 271-279. [PDF, 118KB]

(2005) R. Mahesh K.Sinha & Anil Thakur: Translation divergence in English-Hindi MT. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 245-254. [PDF, 93KB]

(2005) Dekai Wu: Recognizing paraphrases and textual entailment using inversion transduction grammars.  ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 25-30. [PDF, 113KB]

Tagging (see also Segmentation)

(2009) Han-Bin Chen, Jian-Cheng Wu & Jason S.Chang: Learning bilingual linguistic reordering model for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.254-262. [PDF, 333KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Nikesh Garera, Chris Callison-Burch & David Yarowsky: Improving translation lexicon induction from monolingual corpora via dependency contexts and part-of-speech equivalences. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.129-137. [PDF, 450KB]

(2009) Roland Meyer: Tagging Old Russian texts via modern translations – applications and resources for historical MT. EAMT-2009 Workshop Machine Translation for Historical Languages, 13 May 2009, Barcelona; slides [PDF of PPT, 142KB]

(2009) Jan Niehues & Muntsin Kolss: A POS-based model for long-range reorderings in SMT. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.206-214. [PDF, 200KB]

(2009) Davide Picca, Alfio Massimiliano Gliozzo, & Simone Campora: Bridging languages by supersense entity tagging. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.136-142. [PDF, 159KB]

(2009) Maja Popović & Hermann Ney: Syntax-oriented evaluation measures for machine translation output.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.29-32. [PDF, 278KB]

(2009) Maja Popović, David Vilar, Daniel Stein, Evgeny Matusov & Hermann Ney: The RWTH machine translation system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.66-69. [PDF, 210KB]

(2009) Zaid Md Abdul Wahab Sheikh & Felipe Sánchez-Martínez: A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortíz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 67-74. [PDF, 227KB]

(2009) Benjamin Snyder, Tahira Naseem, Jacob Eisenstein, & Regina Barzilay: Adding more languages improves unsupervised multilingual part-of-speech tagging: a Bayesian non-parametric approach. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.83-91. [PDF, 599KB]

(2009) Sara Stymne: A comparison of merging strategies for translation of German compounds. Proceedings of the EACL 2009 Student Research Workshop, Athens, Greece, 2 April 2009; pp.61-69. [PDF, 111KB]

(2008) A.Allauzen & H.Bonneau-Maynard: Training and evaluation of POS taggers on the French MULTITAG corpus. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 113KB]

(2008) Sankaran Baskaran, Kalika Bali, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Girish Nath Jha, Rajendran S, Saravanan K, Sobha L, & Subbarao K.V: Designing a common POS-tagset framework for Indian languages. IJCNLP 2008: Sixth Workshop on Asian Language Resources, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.89-92. [PDF, 77KB]

(2008) Helena M. Caseli, Maria das Graças V.Nunes, & Mikel L. Forcada: From free shallow monolingual resources to machine translation systems: easing the task. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 41-48. [PDF, 454KB]

(2008) Mauro Cettolo, Marcello Federico, Daniele Pighin, & Nicola Bertoldi: Shallow-syntax phrase-based translation: joint versus factored string-to-chunk models. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 56-64. [PDF, 176KB]

(2008) Colin Cherry & Chris Quirk: Discriminative, syntactic language modeling through latent SVMs. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.65-74. [PDF, 609KB]

(2008) Hieu Hoang: Short range re-ordering using POS tags. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 24 slides. [PDF, 66KB]

(2008) Shasha Liao: Combining source and target language information for name tagging of machine translation output. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 19-24. [PDF, 98KB]

(2008) Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz, & Mikel L.Forcada: Using target-language information to train part-of-speech taggers for machine translation [abstract]. Machine Translation 22 (1/2), March-June 2008; pp.29-66.

(2008) Germán Sanchis & Joan A. Sanchis: Vocabulary extension via PoS information for SMT. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 63-70. [PDF, 379KB]

(2008) Thamar Solorio & Yang Liu: Part-of-speech tagging for English-Spanish code-switched text. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1051-1060. [PDF, 129KB]

(2008) Drahomíra (Johanka) Spoustová: Combining statistical and rule-based approaches to morphological tagging of Czech texts. Prague Bulletin of Mathematical Linguistics, no.89, June 2008; pp.23-40. [PDF, 174KB]

(2007) Alexandra Birch, Miles Osborne, & Philipp Koehn: CCG supertags in factored statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 9-16 [PDF, 175KB]

(2007) D. Déchelotte, H.Schwenk, H.Bonneau-Maynard, A.Allauzen, & G.Adda: A state-of-the-art statistical machine translation system based on Moses. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.127-133 [PDF, 142KB]

(2007) Olivier Gouirand: A probabilistic approach to linguistic analysis in machine translation output evaluation. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.46-54 [PDF, 2182KB]

(2007) Deepa Gupta, Mauro Cettolo, & Marcello Federico: POS-based reordering models for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp. 207-213 [PDF, 152KB]

(2007) Nizar Habash & Owen Rambow: Arabic diacritization through full morphological tagging. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.53-56 [PDF, 99KB]

(2007) Kay Rottmann & Stephan Vogel: Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180 [PDF, 3725KB]; presentation [PDF, 1129KB]

(2007) Ruhi Sarikaya, Yonggang Deng, & Yuqing Gao: Context dependent word modeling for statistical machine translation using part-of-speech tags. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2821-2824; abstract [PDF, 23KB]

(2006) Josep M.Crego & José B.Mariño: Integration of POStag-based source reordering into SMT decoding by an extended search graph. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.29-36 [PDF, 287KB]

(2006) Nizar Habash & Fatiha Sadat: Arabic preprocessing schemes for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 49-52 [PDF, 72KB]

(2006) Jirka Hana, Anna Feldman, Chris Brew, & Luiz Amaral: Tagging Portuguese with a Spanish tagger using cognates. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 253KB]

(2006) Sylwia Ozdowska: Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.53-60 [PDF, 366 KB]

(2006) Maja Popović & Hermann Ney: POS-based reorderings for statistical machine translation. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1278-1283 [PDF, 255KB]

(2005) Necip Fazil Ayan, Bonnie J.Dorr, & Christof Monz: Alignment link projection using transformation-based learning.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 185-192. [PDF, 132KB]

(2005) Antonio M.Corbi-Bellot, Mikel L. Forcada, Sergio Ortíz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Iñaki Alegria, Aingeru Mayor, & Kepa Sarasola: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 79-86. [PDF, 130KB]

(2005) Elliott Franco Drábek & David Yarowsky: Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 49-56. [PDF, 109KB]

(2005) Nizar Habash & Owen Rambow: Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 573-580. [PDF, 107KB]

(2005) Adam Lopez & Philip Resnik: Pattern visualization for machine translation output.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 12-13. [PDF, 160KB]

(2005) Patrick Paroubek: Evaluation of text analysis core technologies. Two successful examples: evaluating POS taggers and parsers for French.  ELRA-HLT Evaluation Workshop, Malta, 1-2 December 2005; 35pp. [PDF from PPT, 288KB]

(2005) Maja Popović, David Vilar, Hermann Ney, Slobodan Jovičić, & Zoran Šarić: Augmenting a small parallel text with morpho-syntactic language resources for Serbian-English statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 41-48. [PDF, 211KB]

(2005) Thamar Solorio: Exploiting named entity taggers in a second language. ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp.25-30.[PDF, 70KB]

Word alignment

(2009) Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, & Jason S.Chang: Acquiring translation equivalences of multiword expressions by normalized correlation frequencies. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.478-486. [PDF, 371KB]

(2009) Boxing Chen, Min Zhang, Haizhou Li & Aiti Aw: A comparative study of hypothesis alignment and its improvement for machine translation system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.941-948. [PDF, 119KB]

(2009) Fabien Cromières & Sadao Kurohashi: An alignment algorithm using belief propagation and a structure-based distortion model. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.166-174. [PDF, 221KB]

(2009) Yonggang Deng & Bowen Zhou: Optimizing word alignment combination for phrase table training. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.229-232. [PDF, 132KB]

(2009) Jinhua Du, Yanjun Ma & Andy Way: Source-side context-informed hypothesis alignment for combining outputs from machine translation systems. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.230-27. [PDF, 277KB]

(2009) Jakob Elming, Nizar Habash, & Josep M.Crego: Combination of statistical word alignments based on multiple preprocessing schemes.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.93-110.

(2009) Christian Federmann, Silke Theison, Andreas Eisele, Hans Uszkoreit, Yu Chen, Michael Jellinghaus, & Sabine Hunsicker: Translation combination using factored word substitution.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.70-74. [PDF, 100KB]

(2009) Alexander Fraser, Renjing Wang, & Hinrich Schütze: Rich bitext projection features for parse reranking. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.282-290. [PDF, 132KB]

(2009) João Graça, Kuzman Ganchev & Ben Taskar: PostCAT – posterior constrained alignment toolkit. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.27-36. [PDF, 134KB]

(2009) Francisco Guzman, Qin Gao & Stephan Vogel: Reassessment of the role of phrase extraction in PBSMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 49-56. [PDF, 131KB]

(2009) Barry Haddow: Word-based models and the EM algorithm. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.35-41 [PDF, 167KB]

(2009) Aria Haghighi, John Blitzer, John DeNero & Dan Klein: Better word alignments with supervised ITG models. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.306-314. [PDF, 301KB]

(2009) Ulf Hermjakob: Improved word alignment with statistics and linguistic heuristics. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.229-237. [PDF, 107KB]

(2009) Maria Holmqvist, Sara Stymne, Jody Foo, & Lars Ahrenberg: Improving alignment for SMT by reordering and augmenting the training corpus.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.120-124. [PDF, 102KB]

(2009) Gumwon Hong, Seung-Wook Lee & Hae-Chang Rim: Bridging morpho-syntactic gap between source and target sentences for English-Korean statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.233-236. [PDF, 178KB]

(2009) Fei Huang: Confidence measure for word alignment. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.932-940. [PDF, 727KB]

(2009) Hamidreza Kobdani, Alexander Fraser, & Hinrich Schütze: Word alignment by thresholded two-dimensional normalization. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.260-267. [PDF, 164KB]

(2009) Patrik Lambert, Yanjun Ma, Sylwia Ozdowska & Andy Way: Tracking relevant alignment characteristics for machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.268-275. [PDF, 443KB]

(2009) Yang Liu, Tian Xia, Xinyan Xiao, & Qun Liu: Weighted alignment matrices for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1017-1026. [PDF, 164KB]

(2009) Yanjun Ma, Patrik Lambert & Andy Way: Tuning syntactically enhanced word alignment for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.250-257. [PDF, 488KB]

(2009) Yanjun Ma & Andy Way: Bilingually motivated domain-adapted word segmentation for statistical machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.549-557. [PDF, 325KB]

(2009) Svetlin Nakov, Preslav Nakov, & Elena Paskaleva: Unsupervised extraction of false friends from parallel bi-texts using the web as a corpus.  [RANLP 2009] International conference: Recent Advances in Natural Language Processing. Proceedings ed. Galia Angelova, Kalina Bontcheva, Ruslan Mitkov, Nicolas Nicolov, Nikolai Nikolov, Borovets, Bulgaria, 14-16 September 2009; pp. 292-298. [PDF, 235KB]

(2009) Tsuyoshi Okita: Data cleaning for word alignment. [ACL-IJCNLP-2009] Proceedings of the ACL-IJCNLP Student Research Workshop, Suntec, Singapore, 4 August 2009; pp.72-80. [PDF, 566KB]

(2009) Markus Saers & Dekai Wu: Improving phrase-based translation via word alignments from stochastic inversion transduction grammars.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.28-36. [PDF, 554KB]

(2009) Sandor Szedmak, Esther Galbrun, Craig Saunders, & Yizhao Ni: Large scale, maximum margin regression based, structural learning approach to phrase translations. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 34 slides. [PDF of PPT, 276KB]

(2009) Sina Zarrieß & Jonas Kuhn: Exploiting translational correspondences for pattern-independent MWE identification.  [ACL-IJCNLP-2009] Proceedings of the 2009 Workshop on Multiword Expressions, ACL-IJCNLP 2009, Suntec, Singapore, 6 August 2009; pp.23-30. [PDF, 132KB]

(2009) Yuqi Zhang, Evgeny Matusov, & Hermann Ney: Are unaligned words important for machine translation? EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.226-233. [PDF, 388KB]

(2008) Vamshi Ambati & Alon Lavie: Improving syntax driven translation models by re-structuring divergent and non-isomorphic parse tree structures. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.235-244 [PDF, 734KB]

(2008) Nguyen Bach, Qin Gao, & Stephan Vogel: Improving word alignment with language model based confidence scores. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.151-154. [PDF, 607KB]

(2008) Brett W.Bader & Peter A.Chew: Enhancing multilingual latent semantic analysis with term alignment information.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.49-56. [PDF, 295KB]

(2008) Ming-Hong Bai, Keh-Jiann Chen, & Jason S.Chang: Improving word alignment by adjusting Chinese word segmentation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.249-256. [PDF, 560KB]

(2008) Josep M. Crego & Nizar Habash: Using shallow syntax information to improve word alignment and reordering for SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.53-61. [PDF, 200KB]

(2008) Béatrice Daille & Emmanuel Morin: A effective compositional model for lexical alignment. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp 95-102. [PDF, 401KB]

(2008) John DeNero, Alexandre Bouchard-Côté, & Dan Klein: Sampling alignment structure under a Bayesian translation model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.314-323. [PDF, 356KB]

(2008) Chris Dyer, Alex Mont, Aaron Cordova, & Jimmy Lin: SMT model building with MapReduce. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 91 slides. [PDF, 1850KB]

(2008) Victoria Fossum & Kevin Knight: Using bilingual Chinese-English word alignments to resolve PP-attachment ambiguity in English. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.245-253 [PDF, 595KB]

(2008) Victoria Fossum, Kevin Knight, & Steven Abney: Using syntax to improve word alignment precision for syntax-based machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.44-52. [PDF, 150KB]

(2008) Alex Fraser: Improved word alignments for statistical machine translation. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 28 slides. [PDF, 174KB]

(2008) Kuzman Ganchev, João V. Graça, & Ben Taskar: Better alignments = better translations? ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 763-770. [PDF, 685KB]

(2008) Qin Gao & Stephan Vogel: Parallel implementations of word alignment tool. ACL-08 HLT: Software Engineering, Testing, and Quality Assurance for Natural Language Processing, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp.49-57. [PDF, 340KB]

(2008) Ulrich Germann: Yawat: yet another word alignment tool. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the demo session, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 20-23. [PDF, 360KB]

(2008) Jesús González-Rubio, Germán Sanchis-Trilles, Alfons Juan, & Francisco Casacuberta: A novel alignment model inspired on IBM model 1. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.47-56. [PDF, 525KB]

(2008) João Graça, Joana Paulo Pardal, Luisa Coheur, & Diamantino Caseiro: Building a golden collection of parallel multi-language word alignments. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 301KB]

(2008) Le An Ha, Gabriela Fernandez, Ruslan Mitkov, & Gloria Corpas: Mutual bilingual terminology extraction. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 87KB]

(2008) Saša Hasan, Juri Ganitkevitch, Hermann Ney, & Jesús Andrés-Ferrer:  Triplet lexicon models for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.372-381. [PDF, 181KB]

(2008) Maria Holmqvist, Lars Ahrenberg, & Sara Stymne: High-precision word alignment with parallel phrases. SLTC 2008: Second Swedish Language Technology Conference, November 20-21, 2008, Stockholm; pp.45-46. [PDF, 286KB]

(2008) Katsunori Kotani, Takehiko Yoshimi, Takeshi Kutsumi, Ichiko Sata, & Hitoshi Isahara: A method of automatically evaluating machine translations using a word-alignment based classifier. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 11-18. [PDF, 259KB]

(2008) Patrik Lambert & Rafael E.Banchs: Word association models and search strategies for discriminative word alignment. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.97-103. [PDF, 633KB]

(2008) Jin-Ji Li, Dong-Il Kim, & Jong-Hyeok Lee: Annotation guidelines for Chinese-Korean word alignment.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 195KB]

(2008) Dekang Lin, Shaojun Zhao, Benjamin Van Durme, & Marius Paşca: Mining parenthetical translations from the web by word alignment. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 994-1002. [PDF, 419KB]

(2008) Yanjun Ma, John Tinsley, Hany Hassan, Jinhua Du, & Andy Way: Exploiting alignment techniques in MATREX: the DCU machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp. 26-33. [PDF, 198KB]; presentation [PDF, 559KB]

(2008) Yanjun Ma, Sylwia Ozdowska, Yanli Sun, & Andy Way: Improving word alignment using syntactic dependencies. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.69-77. [PDF, 181KB]

(2008) Jan Niehues & Stephan Vogel: Discriminative word alignment via alignment matrix modelling. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.18-25. [PDF, 576KB]

(2008) Vassilina Nikoulina & Marc Dymetman: Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60. [PDF, 98KB]

(2008) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Phrase-level alignment generation using a smoothed loglinear phrase-based statistical alignment model. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.160-169. [PDF, 536KB]

(2008) Sylwia Ozdowska: Cross-corpus evaluation of word alignment.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 63KB]

(2008) Diana Santos & Alberto Simões: Portuguese-English word alignment: some experiments. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 68KB]

(2008) Jörg Tiedemann: Synchronizing translated movie subtitles. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 171KB]

(2008) Takashi Tsunakawa, Naoaki Okazaki, & Jun’ichi Tsujii: Building a bilingual lexicon using phrase-based statistical machine translation via a pivot language. Coling 2008:  22nd International Conference on Computational Linguistics, Posters and demonstrations, 18-22 August 2008, Manchester UK; pp.127-130. [PDF, 115KB]

(2008) Takashi Tsunakawa, Naoaki Okazaki, & Jun’ichi Tsujii: Building bilingual lexicons using lexical translation probabilities via pivot languages. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 66KB]

(2008) Tom Vanallemeersch & Cornelia Wermuth: Linguistics-based word alignment for medical translators. Journal of Specialised Translation 9 (January 2008); pp.20-38. [PDF, 224KB]

(2008) Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201. [PDF, 614KB]

(2008) Hao Zhang, Daniel Gildea, & David Chiang: Extracting synchronous grammar rules from word-level alignments in linear time. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1081-1088. [PDF, 181KB]

(2008) Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma, & Hitoshi Isahara: Word alignment annotation in a Japanese-Chinese parallel corpus.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 195KB]

(2008) Ming Zhou, Bo Wang, Shujie Liu, Mu Li, Dongdong Zhang, & Tiejun Zhao: Diagnostic evaluation of machine translation systems using automatically constructed check-points. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1121-1128. [PDF, 231KB]

(2007) Wen-Han Chao & Zhou-Jun Li: Incorporating constituent structure constraint into discriminative word alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.97-103 [PDF, 112KB]

(2007) Paul C. Davis, Zhuli Xie, & Kevin Small: All links are not the same: evaluating word alignments for statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.119-126 [PDF, 122KB]

(2007) John DeNero & Dan Klein: Tailoring word alignments to syntactic machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 17-24 [PDF, 415KB]

(2007) Yonggang Deng & Yuqing Gao: Guiding statistical word alignment models with prior knowledge. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 1-8 [PDF, 245KB]

(2007) Jakob Elming & Nizar Habash: Combination of statistical word alignments based on multiple preprocessing schemes. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.25-28 [PDF, 123KB]

(2007) Alexander Fraser & Daniel Marcu: Measuring word alignment quality for statistical machine translation. Computational Linguistics 33 (3), pp. 293-303. [PDF, 163KB]

(2007) Alexander Fraser & Daniel Marcu: Getting the structure right for word alignment: LEAF. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 51-60. [PDF, 669KB]

(2007) Ulrich Germann: Two tools for creating and visualizing sub-sentential alignments of parallel text. ACL 2007: proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic, 28-29 June 2007; pp.121-124 [PDF, 247KB]

(2007) Xiaodong He: Using word dependent transition models in HMM based word alignment for statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 80-87 [PDF, 139KB]

(2007) Philipp Koehn & Hieu Hoang: Factored translation models. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 868-876. [PDF, 225KB]

(2007) Shankar Kumar, Franz Och, & Wolfgang Macherey: Improving word alignment with bridge languages. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 42-50. [PDF, 183KB]

(2007) Caroline Lavecchia, Kamel Smaïli, & David Langlois: Building a bilingual dictionary from movie subtitles based on inter-lingual triggers. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 19pp. [PDF, 139KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Alignment-guided chunking. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.114-121 [PDF, 357KB]; presentation [PDF, 377KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Bootstrapping word alignment via word packing. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 304-311 [PDF, 198KB]

(2007) Jonathan May & Kevin Knight: Syntactic re-alignment models for machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 360-368. [PDF, 180KB]

(2007) Kemal Oflazer & İlknur Durgar El-Kahlout: Exploring different representational units in English-to-Turkish statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 25-32 [PDF, 447KB]

(2007) Dengjun Ren, Hua Wu, & Haifeng Wang: Improving statistical word alignment with various clues. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.391-397 [PDF, 167KB]

(2007) Sriram Venkatapathy & Aravind K.Joshi: Discriminative word alignment by learning the alignment structure and syntactic divergence between a language pair. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.49-56 [PDF, 172KB]

(2007) Hua Wu & Haifeng Wang: Comparative study of word alignment heuristics and phrase-based SMT. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.507-514 [PDF, 168KB]

(2007) Ruiqiang Zhang & Eiichiro Sumita: Boosting statistical machine translation by lemmatization and linear interpolation. ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 181-184 [PDF, 91KB]

(2006) Necip Fazil Ayan & Bonnie J. Dorr: A maximum entropy approach to combining word alignments.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 96-103 [PDF, 325KB]

(2006) Necip Fazil Ayan & Bonnie J. Dorr: Going beyond AER: an extensive analysis of word alignments and their impact on MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.9-16. [PDF, 274KB]

(2006)Alexandra Birch, Chris Callison-Burch, Miles Osborne, & Philipp Koehn: Constraining the phrase-based, joint probability statistical translation model.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 154-157 [PDF, 134KB]

(2006) Ondřej Bojar & Magdalena Prokopová: Czech-English word alignment. LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1236-1239 [PDF, 270KB]

(2006) Helena M.Caseli, Maria das Graças V.Nunes, & Mikel L.Forcada: Automatic induction of bilingual resources from aligned parallel corpora: application to shallow-transfer machine translation [abstract]. Machine Translation 20 (4),2006; pp.227-245.

(2006) Niladri Chatterjee & Saumya Agrawal: Word alignment in English-Hindi parallel corpus using recency-vector approach: some studies. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.649-656. [PDF, 391KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Colin Cherry & Dekang Lin: A comparison of syntactically motivated word alignment spaces. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.145-152 [PDF, 150KB]

(2006) Hal Daumé III & Daniel Marcu: Induction of word and phrase alignments for automatic document summarization. Computational Linguistics 31 (4), pp. 505-530. [PDF, 507KB]

(2006) Alexander Fraser & Daniel Marcu: Semi-supervised training for statistical word alignment. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.769-776. [PDF, 200KB]

(2006) Ivana Kruijff-Korbayová, Klára Chvátalová, & Oana Postolache: Annotation guidelines for Czech-English word alignment.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1256-1261 [PDF, 608KB]

(2006) Simon Lacoste-Julien, Ben Taskar, Dan Klein, & Michael I. Jordan: Word alignment via quadratic assignment.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 112-119 [PDF, 177KB]

(2006) Percy Liang, Ben Taskar, & Dan Klein: Alignment by agreement.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 104-111 [PDF, 150KB]

(2006) Yang Liu, Qun Liu, & Shouxun Lin: Tree-to-string alignment template for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.609-616. [PDF, 392KB]

(2006) Adam Lopez & Philip Resnik: Word-based alignment, phrase-based translation: what’s the link?  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.90-99 [PDF, 183KB]

(2006) Robert C.Moore, Wen-tau Yih, & Andreas Bode: Improved discriminative bilingual word alignment. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.513-520. [PDF, 121KB]

(2006) Sylwia Ozdowska: Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.53-60 [PDF, 366 KB]

(2006) Ari Rappoport & Tsahi Levent-Levi: Induction of cross-language affix and letter sequence correspondence. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 332 KB]

(2006) Luis Rodríguez, Ismael García-Varea, & José A. Gámez: Searching for alignments in SMT: a novel approach based on an estimation of distribution algorithm. HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 47-54 [PDF, 478KB]

(2006) Bettina Schrader: ATLAS – a new text alignment architecture. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.715-722. [PDF, 139KB]

(2006) Bettina Schrader: Non-probabilistic alignment of rare German and English nominal expressions.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1274-1277 [PDF, 251KB]

(2006) Dan Tufiş, Radu Ion, Alexandru Ceauşu, & Dan Ştefănescu: Improved lexical alignment by combining multiple reified alignments. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.153-160 [PDF, 195KB]

(2006) Lonneke van der Plas & Jörg Tiedemann: Finding synonyms using automatic word alignment and measures of distributional similarity.  Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.866-873. [PDF, 223KB]

(2006) Sriram Venkatapathy & Aravind K.Joshi: Using information about multi-word expressions for the word-alignment task. Coling-ACL 2006: Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, Sydney, July 2006; pp.20-27 [PDF, 120KB]

(2006) Begoña Villada Moirón & Jörg Tiedemann: Identifying idiomatic expressions using word-alignment. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.33-40 [PDF, 291KB]

(2006) Haifeng Wang, Hua Wu, & Zhanyi Liu: Word alignment for languages with scarce resources using bilingual corpora of other language pairs. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.874-881. [PDF, 155KB]

(2006) Hua Wu, Haifeng Wang, & Zhanyi Liu: Boosting statistical word alignment using labeled and unlabeled data. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.913-920. [PDF, 97KB]

(2006) Hao Zhang & Daniel Gildea: Inducing word alignments with bilexical synchronous trees. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.953-960. [PDF, 156KB]

(2006) Bing Zhao & Eric P. Xing: BiTAM: bilingual topic admixture models for word alignment. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.969-976. [PDF, 574KB]

(2005) Alison Alvarez, Lori Levin, Robert Frederking, Erik Peterson & Jeff Good: Semi-automated elicitation corpus generation . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.388-395. [PDF, 131KB]

(2005) Niraj Aswani & Robert Gaizauskas: Aligning words in English-Hindi parallel corpora. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.115-118. [PDF, 106KB]

(2005) Niraj Aswani & Robert Gaizauskas: A hybrid approach to align sentences and words in English-Hindi parallel corpora. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.57-64. [PDF, 160KB]

(2005) Necip Fazil Ayan, Bonnie J. Dorr, & Christof Monz: NeurAlign: combining word alignments using neural networks.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 65-72. [PDF, 139KB]

(2005) Necip Fazil Ayan, Bonnie J.Dorr, & Christof Monz: Alignment link projection using transformation-based learning.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 185-192. [PDF, 132KB]

(2005) Helena M. Caseli, Maria G.V. Nunes, & Mikel L. Forcada: LIHLA: shared task system description. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp.111-114. [PDF, 68KB]

(2005) Josep M. Crego, Marta R. Costa-jussà, José B. Mariño, & José A. R. Fonollosa: Ngram-based versus phrase-based statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2005], 24-25 October, 2005, Pittsburgh, PA, USA; 8pp. [PDF, 175KB]

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Elliott Franco Drábek & David Yarowsky: Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 49-56. [PDF, 109KB]

(2005) Hiroshi Echizen-ya, Kenji Araki, & Yoshio Momouchi: Automatic acquisition of bilingual rules for extraction of bilingual word pairs from parallel corpora. ACL-SIGLEX-2005: Workshop on Deep Lexical Acquisition, University of Michigan, Ann Arbor, 30 June 2005; pp. 87-96.  [PDF, 726KB]

(2005) Marcello Federico & Nicola Bertoldi: A word-to-phrase statistical translation model. ACM Transactions on Speech and Language Processing 2 (2), pp.1-24. [abstract, PDF, 13KB]

(2005) Abraham Ittycheriah & Salim Roukos: A maximum entropy word aligner for Arabic-English machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 327KB]

(2005) Hideki Kashioka: Word alignment viewer for long sentences. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.427-431. [PDF, 256KB]

(2005) Philipp Koehn: Statistical machine translation, lecture 3: word alignment and phrase models. ESSLLI-2005: 17th European Summer School in Logic, Language and Information, Heriot-Watt University, Edinburgh, Scotland, 8-19 August 2005; 8pp. [PDF of PPT presentation, 121KB]

(2005) Grzegorz Kondrak: Cognates and word alignment in bitexts. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.305-312. [PDF, 179KB]

(2005) Jonas Kuhn: Parsing word-aligned parallel corpora in a grammar induction context. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 17-25. [PDF, 150KB]

(2005) Philippe Langlais, Fabrizio Gotti, & Guihong Cao: NUKTI: English-Inuktitut word alignment system description.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 75-78. [PDF, 72KB]

(2005) Yang Liu, Qun Liu, & Shouxun Lin: Log-linear models for word alignment. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 459-466. [PDF, 466KB]

(2005) Adam Lopez & Philip Resnik: Improved HMM alignment models for languages with scarce resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 83-86. [PDF, 103KB]

(2005) Joel Martin, Rada Mihalcea, & Ted Pedersen: Word alignment for languages with scarce resources. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 65-74. [PDF, 170KB]

(2005) Evgeny Matusov, Stephan Kanthak, & Hermann Ney: Efficient statistical machine translation with constrained reordering. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 181-188 [PDF, 110KB]

(2005) Robert C. Moore: Association-based bilingual word alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 1-8. [PDF, 102KB]

(2005) Robert C. Moore: A discriminative framework for bilingual word alignment.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 81-88. [PDF, 118KB]

(2005) Carol Nichols & Rebecca Hwa: Word alignment and cross-lingual resource acquisition. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 69-72. [PDF, 331KB]

(2005) Sylwia Ozdowska: Using bilingual dependencies to align words in English/French parallel corpora.  ACL-2005: Student Research Workshop, University of Michigan, Ann Arbor, June 2005; pp. 127-132. [PDF, 89KB]

(2005) Charles Schafer & Elliott Franco Drábek: Models for Inuktitut-English word alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 79-82. [PDF, 183KB]

(2005) Sayori Shimohata: Finding translation candidates from patent corpus MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Patent Translation; pp.50-54. [PDF, 290KB]

(2005) Michel Simard, Nicola Cancedda, Bruno Cavestro, Marc Dymetman, Eric Gaussier, Cyril Goutte, Kenji Yamada, Philippe Langlais, & Arne Mauser: Translating with non-contiguous phrases. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 755-762. [PDF, 180KB]

(2005) Ben Taskar, Simon Lacoste-Julien, & Dan Klein: A discriminative matching approach to word alignment. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 73-80. [PDF, 902KB]

(2005) Dan Tufiş, Radu Ion, Alexandru Ceauşu & Dan Ştefănescu: Combined word alignments. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 107-110. [PDF, 248KB]

(2005) Wu Hua, Wang Haifeng, Liu Zhanyi & Tang Kai: Improving translation memory with word alignment information . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.364-371. [PDF, 474KB]

(2005) Wu Hua & Wang Haifeng: Boosting statistical word alignment. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.313-320. [PDF, 393KB]

(2005) Wu Hua, Wang Haifeng, & Liu Zhanyi: Alignment model adaptation for domain-specific word alignment.  ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 467-474. [PDF, 291KB]

(2005) Virach Sornlertlamvanich, Canasai Kruengkrai, Shisanu Tongchin, Prapass Srichaivattana, & Hitoshi Isahara: Term-based ontology alignment. In: Jesús Cardeñosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.138-144 [abstract, PDF, 70KB]

(2005) Yujie Zhang, Qun Liu, Qing Ma, & Hitoshi Isahara: A multi-aligner for Japanese-Chinese parallel corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.133-140. [PDF, 420KB]

(2005) Bing Zhao, Eric P.Xing, & Alex Waibel: Bilingual word spectral clustering for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 25-32. [PDF, 467KB]

(2005) Bing Zhao, Niyu Ge, & Kishore Papineni: Inner-outer bracket models for word alignment using hidden blocks. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 177-184. [PDF, 592KB]

Word sense disambiguation

 (2009) Marianna Apidianaki: Data-driven semantic analysis for multilingual WSD and lexical selection in translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.77-85. [PDF, 364KB]

(2009) Jinho D.Choi, Martha Palmer, & Nianwen Xue: Using parallel Propbanks to enhance word-alignments. ACL-IJCNLP 2009: Third Linguistic Annotation Workshop (LAW III), Proceedings of the workshop, 6-7 August 2009, Suntec, Singapore; pp.121-124. [PDF, 139KB]

(2009) David Farwell & Lluís Padró: FreeLing: from a multilingual open-source analyzer suite to an EBMT platform. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.37-43. [PDF, 372KB]; presentation [PDF of PPT, 331KB]

(2009) Mitesh M.Khapra, Sapan Shah, Piyush Kedia, & Pushpak Bhattacharyya: Projecting parameters for multilingual word sense disambiguation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.459-467. [PDF, 670KB]

(2009) Els Lefever & Veronique Hoste: SemEval-2010 Task 3: cross-lingual word sense disambiguation. NAACL-HLT-2009 (SEW-2009): Semantic Evaluations: Recent Achievements and Future Directions, Proceedings of the workshop, June 4, 2009, Boulder, Colorado; pp.76-81. [PDF, 146KB]

(2009) Chakaveh Saedi, Mehrnoush Shamsfard, & Yasaman Motazedi: Automatic translation between English and Persian texts. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 1001KB]

(2008) Eneko Agirre & Aitor Soroa: Using the multilingual central repository for graph-based word sense disambiguation. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 114KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation using parallel corpora and matching  schemes.  EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.6-11. [PDF, 616KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation approach based on naïve Bayesian classifier. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 20-22 October 2008; pp.331-338. [PDF, 318KB]

(2008) Marianna Apidianaki: Translation-oriented word sense induction based on parallel corpora. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 232KB]

(2008) Atelach Alemu Argaw: Word sense discrimination using context vector similarity. SLTC 2008: Second Swedish Language Technology Conference, November 20-21, 2008, Stockholm; pp.65-66. [PDF, 322KB]

(2008) Kevin Gimpel & Noah A. Smith: Rich source-side context for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.9-17. [PDF, 211KB]

(2008) Violaine Prince & Jacques Chauche: Building a bilingual representation of the Roget thesaurus for French to English machine translation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 142KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Eneko Agirre, Bernardo Magnini, Oier Lopez de Lacalle, Arantxa Otegi, German Rigau, & Piek Vossen: SemEval-2007 task 01: evaluating WSD on cross-language information retrieval. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.1-6 [PDF, 79KB]

(2007) Marine Carpuat & Dekai Wu: Improving statistical machine translation using word sense disambiguation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 61-72. [PDF, 439KB]

(2007) Yee Seng Chan, Hwee Tou Ng, & David Chiang: Word sense disambiguation improves statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 33-40 [PDF, 516KB]

(2007) Peng Jin, Yunfang Wu, & Shiwen Yu: SemEval-2007 task 5: multilingual Chinese-English lexical sample. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.19-23 [PDF, 107KB]

(2007) Peng Yuan Liu, TieJun Zhao, & Mu Yun Yang: HIT-WSD: using search engine for multilingual Chinese-English lexical sample task. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.160-172 [PDF,113KB]

(2007) Hwe Tou Ng & Yee Seng Chan: SemEval-2007 task 11: English lexical sample task via English-Chinese parallel text. ACL 2007: proceedings of the 4th International  Workshop on Semantic Evaluations (SemEval-2007), Prague, Czech Republic, 23-24 June 2007; pp.54-58 [PDF, 84KB]

(2007) Alla Rozovskaya & Richard Sproat: Multilingual word sense disambiguation: a comparative cross-linguistic study. ACL 2007: Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, June 29, 2007, Prague, Czech Republic; pp.82-87. [PDF, 122KB]

(2007) Lucia Specia, Mark Stevenson, & Maria das Graças V.Nunes: Learning expressive models for word sense disambiguation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 41-48 [PDF, 99KB]

(2006) Simona Balbi & Michelangelo Misuraca: Rotated canonical correlation analysis for multilingual corpora. JADT 2006: 8es Journées internationals d’Analyse statistique des Données Textuelles, 19-21 avril 2006, Besançon, France; pp.99-106. [PDF, 216KB]

(2006) Marine Carpuat, Yihai Shen, Xiaofeng Yu, & Dekai Wu: Toward integrating word sense and entity disambiguation into statistical machine translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 37-44  [PDF, 330KB]

(2006) Irina Matveeva & Gina-Anne Levow: Computing term translation probabilities with generalized latent semantic analysis. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Posters and demonstrations, Trento, Italy, April 5-6, 2006; pp.151-154 [PDF, 90KB]

(2006) Márton Miháltz & Gábor Pohl: Exploiting parallel corpora for supervised word sense disambiguation in English-Hungarian machine translation.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1294-1297 [PDF, 339KB]

(2006) Reinhard Rapp & Carlos Martin Vide: Example-based machine translation using a dictionary of word pairs.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.1268-1273 [PDF, 352KB]

(2006) Lucia Specia: A hybrid relational approach for WSD – first results. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Student Research Workshop, Sydney, July 2006; pp.55-60. [PDF, 75KB]

(2006) Lucia Specia, Maria das Graças Volpe Nunes, Mark Stevenson, & Gabriela Castelo Branco Ribeiro: Multilingual versus monolingual WSD. EACL-2006: Proceedings of the Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together, April 4, 2006, Trento, Italy; pp.33-40. [PDF, 116KB]

(2006) Lucia Specia, Maria das Graças Volpe Nunes, & Mark Stevenson: Translation context sensitive WSD. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.227-232 [PDF, 206KB]

(2006) Xinglong Wang & David Martinez: Word sense disambiguation using automatically translated sense examples. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Cross-Language Knowledge Induction Workshop, Trento, Italy, April 3, 2006; pp.45-52 [PDF, 272 KB]

(2005) Ralf Brown: Context-sensitive retrieval for example-based translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.9-15. [PDF, 273KB]

(2005) Clara Cabezas & Philip Resnick: Using WSD techniques for lexical selection in statistical machine translation. College Park, ND: Institute for Advanced Computer Studies, University of Maryland, July 2005 (CS-TR-4736, LAMP-TR-124, UMIACS-TR-2005-42) 12pp. [PDF, 240KB]

(2005) Marine Carpuat & Dekai Wu: Evaluating the word sense disambiguation performance of statistical machine translation. IJCNLP-05: Second International Joint Conference on Natural Language Processing, 11-13 October 2005, Jeju Island, Republic of Korea; pp.120-125. [PDF, 93KB]

(2005) Marine Carpuat & Dekai Wu: Word sense disambiguation vs. statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 387-394. [PDF, 78KB]

(2005) Youjin Chung & Jong-Hyeok Lee: Practical word-sense disambiguation using co-occurring concept codes [abstract]. Machine Translation 19 (1),2005; pp.59-82.

(2005) Márton Miháltz: Towards a hybrid approach to word-sense disambiguation in machine translation. International workshop: Modern approaches in translation technologies, Borovets, Bulgaria, 24 September 2005; 4pp. [PDF, 145KB]

(2005) Constantin Orasan, Ted Marshall, Robert Clark, Le An Ha, & Ruslan Mitkov: Building a WSD module within an MT system to enable interactive resolution in the user's source language. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 205-211. [PDF, 67KB]

(2005) Siddharth Patwardhan, Satanjeev Banerjee & Ted Pedersen: SenseRelate::TargetWord – a generalized framework for word sense disambiguation. ACL-2005: Interactive Poster and Demonstration Sessions, University of Michigan, Ann Arbor, June 2005; pp. 73-76. [PDF, 54KB]

(2005) David Vickrey, Luke Biewald, Marc Teyssier, & Daphne Koller: Word-sense disambiguation for machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 771-778. [PDF, 149KB]

(2005) Xinglong Wang & John Carroll: Word sense disambiguation using sense examples automatically acquired from a second language. HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 684-691. [PDF, 311KB]