Machine Translation Archive

Index to statistical and computational aspects

Publications 2005-2009

For other periods go to: Publications since 2010; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

To return to home page click here

Bayesian methods

(2009) Adriŕ de Gispert, Sami Virpioja, Mikko Kurimo, & William Byrne: Minimum Bayes risk combination of translation hypotheses from alternative morphological decompositions.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.73-76. [PDF, 72KB]

(2009) John DeNero, David Chiang, & Kevin Knight: Fast consensus decoding over translation forests. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.567-575. [PDF, 650KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Benjamin Snyder, Tahira Naseem, Jacob Eisenstein, & Regina Barzilay: Adding more languages improves unsupervised multilingual part-of-speech tagging: a Bayesian non-parametric approach. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.83-91. [PDF, 599KB]

(2009) Zhifei Li, Jason Eisner, & Sanjeev Khudanpur: Variational decoding for statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.593-601. [PDF, 217KB]

(2008) John DeNero, Alexandre Bouchard-Côté, & Dan Klein: Sampling alignment structure under a Bayesian translation model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.314-323. [PDF, 356KB]

(2008) Farag Ahmed & Andreas Nürnberger: Arabic/English word translation disambiguation approach based on naďve Bayesian classifier. Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 20-22 October 2008; pp.331-338. [PDF, 318KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Machery: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2008) Jia Xu, Jianfeng Gao, Kristina Toutanova, & Hermann Ney: Bayesian semi-supervised Chinese word segmentation for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1017-1024. [PDF, 137KB

(2007) Nicola Ehling, Richard Zens, & Hermann Ney: Minimum Bayes risk decoding for BLEU.  ACL 2007: proceedings of demo and poster sessions, Prague, Czech Republic, June 2007; pp. 101-104 [PDF, 334KB]

(2006) Simon Corston-Oliver, Anthony Aue, Kevin Duh & Eric Ringger: Multilingual dependency parsing using Bayes point machines.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 160-167 [PDF, 178KB]

Bootstrapping

(2007) Paula Estrella, Olivier Hamon, & Andrei Popescu-Belis: How much data is needed for reliable MT evaluation? Using bootstrapping to study human and automatic metrics. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.167-174 [PDF, 143KB]

(2007) Paula Estrella, Andrei Popescu-Belis, & Maghi King: A new method for the study of correlations between MT evaluation metrics. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.55-64 [PDF, 949KB]; presentation [PDF, 95KB]

(2007) Yanjun Ma, Nicolas Stroppa, & Andy Way: Bootstrapping word alignment via word packing. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 304-311 [PDF, 198KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Bootstrapping a stochastic transducer for Arabic-English transliteration extraction. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 864-871 [PDF, 189KB]

(2007) Yik-Cheung Tam, Ian Lane, & Tanja Schultz: Bilingual-LSA based LM adaptation for spoken language translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 520-527 [PDF, 162KB]

(2005) Philippe Langlais, Simona Gandrabur, Thomas Leplus, & Guy Lapalme: The long-term forecast for weather bulletin translation [abstract]. Machine Translation 19 (1), 2005; pp.83-112.

Chart parsing

(2009) Martin Čmejrek, Bowen Zhou, & Bing Xiang: Enriching SCFG rules directly from efficient bilingual chart parsing. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 136-143. [PDF, 315KB]; presentation [PDF of PPT, 982KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) Zhifei Li & Jason Eisner: First- and second-order expectation semirings with applications to minimum-risk training on translation forests. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.40-51. [PDF, 406KB]

(2009) Yang Liu, Yajuan Lü & Qun Liu: Improving tree-to-tree translation with packed forests.  [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.558-566. [PDF, 151KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, Aiti Aw, Chew Lim Tan: Forest-based tree sequence to string  translation model. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.172-180. [PDF, 254KB]

(2008) Haitao Mi, Liang Huang, & Qun Liu: Forest-based translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 192-199. [PDF, 185KB]

(2008) Haitao Mi & Liang Huang: Forest-based translation rule extraction.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.206-214. [PDF, 170KB]

(2007) Liang Huang & David Chiang: Forest rescoring: faster decoding with integrated language models. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 144-151 [PDF, 215KB]

(2007) Yang Liu, Yun Huang, Qun Liu, & Shouxun Lin: Forest-to-string statistical translation rules. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 704-711 [PDF, 198KB]

(2006) Andreas Zollmann & Ashish Venugopal: Syntax augmented machine translation via chart parsing.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 138-141 [PDF, 99KB]

Cloud computing

(2009) Sven Christian Andrä & Jörg Schütz: Cloud computing in GILT ecosystems and evolution. Translating and the Computer 31, 19-20 November 2009, London; 10pp. [PDF, 177KB]

(2009) Sven Christian Andrä & Jörg Schütz: MT Bazaar: translation ecosystems in the cloud. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 395-402. [PDF, 144KB]

(2009) Lamine Aouad: Mapping localisation workflows on the cloud. Translating and the Computer 31, 19-20 November 2009, London; 12pp. [PDF, 303KB]

Computational complexity (see also Modularity)

(2009) Anders Sřgaard: On the complexity of alignment problems in two synchronous grammar formalisms.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.60-68. [PDF, 161KB]

(2008) Yu Chen, Andreas Eisele, & Martin Kay: Improving statistical machine translation efficiency by triangulation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 122KB]

(2008) Chris Dyer, Alex Mont, Aaron Cordova, & Jimmy Lin: SMT model building with MapReduce. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 91 slides. [PDF, 1850KB]

(2008) Gregor Leusch, Evgeny Matusov, & Hermann Ney: Complexity of finding the BLEU-optimal hypothesis in a confusion network.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.839-847. [PDF, 255KB]

(2008) Adam Lopez: Tera-scale translation models via pattern matching. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.505-512. [PDF, 86KB]

(2008) Adam Lopez: Translation by pattern matching. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 109 slides. [PDF, 174KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Marco Turchi, Tijl de Bie, & Nello Cristianini: Learning performance of a machine translation system: a statistical and computational analysis. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.35-43. [PDF, 108KB]

(2009) Marco Turchi, Tijl de Bie, & Nelo Cristianini: Learning to translate: a statistical and computational analysis. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 32 slides. [PDF of PPT, 336KB]

(2007) Marcello Federico & Mauro Cettolo: Efficient handling of n-gram language models for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 88-95 [PDF, 361KB]

(2006) Alexandra Birch, Chris Callison-Burch & Miles Osborne: Constraining the phrase-based, joint probability statistical translation model. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.10-18 [PDF, 244KB]

(2006) Alexandra Birch, Chris Callison-Burch, Miles Osborne, & Philipp Koehn: Constraining the phrase-based, joint probability statistical translation model.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 154-157 [PDF, 134KB]

(2006) Raghavendra Udupa U. & Hemanta K.Maji: Computational complexity of statistical machine translation. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.25-32 [PDF, 142KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

Conditional random fields

(2009) Eiji Aramaki & Takeshi Abekawa: Fast decoding and easy implementation: transliteration as sequential labeling. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.65-68. [PDF, 413KB]

(2009) Asif Ekbal & Sivajii Banyopadhyay: Voted NER system using appropriate unlabeled data. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.202-210. [PDF, 98KB]

(2009) Spence Green, Conal Sathi & Christopher D.Manning: NP subject detection in verb-initial Arabic clauses. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp.  [PDF, 729KB]

(2009) Matti Kääriäinen: Sinuhe – statistical machine translation using a globally trained conditional exponential family translation model. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1027-1036. [PDF, 159KB]

(2009) Mitesh M.Khapra & Pushpak Bhattacharyya: Improving transliteration accuracy using word-origin detection and lexicon lookup. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.84-87. [PDF, 114KB]

(2009) Sravana Reddy & Sonjia Waxmonsky: Substring-based transliteration with conditional random fields. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.92-95. [PDF, 90KB]

(2009) Praneeth Shishtla, Surya Ganesh V, Sethuramalingam Subramaniam, & Vasudeva Varma: A language-independent transliteration schema using character aligned models at NEWS 2009. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.40-43. [PDF, 114KB]

(2009) Dong Yang, Paul Dixon, Yi-Cheng Pan, Tasuku Oonishi, Masanobu Nakamura & Sadaoki Furui: Combining a two-step conditional random field and a joint source channel model for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.72-75. [PDF, 110KB]

Confusion networks (see also Combination systems, Lattices and networks, Multi-engine systems)

(2009) Boxing Chen, Min Zhang, Haizhou Li & Aiti Aw: A comparative study of hypothesis alignment and its improvement for machine translation system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.941-948. [PDF, 119KB]

(2009) Yang Feng, Yang Liu, Haitao Mi, Qun Liu, & Yajuan Lü: Lattice-based system combination for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1105-1113. [PDF, 161KB]

(2009) Xiaodong He & Kristina Toutanova: Joint optimization for machine translation system combination. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1202-1211. [PDF, 337KB]

(2009) Gregor Leusch, Evgeny Matusov, & Hermann Ney: The RWTH system combination system for WMT 2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.51-55. [PDF, 173KB]

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Zhifei Li & Sanjeev Khudanpur: Efficient extraction of oracle-best translations from hypergraphs.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.9-12. [PDF, 112KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Holger Schwenk, Loďc Barrault, Yannick Estčve, & Patrik Lambert: LIUM’s statistical machine translation system for IWSLT 2009.  IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 65-70. [PDF, 344KB]; poster [PDF, 117KB]

(2009) Yong Zhao & Xiaodong He: Using n-gram based features for machine translation system combination.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.205-208. [PDF, 357KB]

(2008) Necip Fazil Ayan, Jing Zheng, & Wen Wang: Improving alignments for better confusion networks for combining machine translation systems. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.33-40. [PDF, 228KB]

(2008) Boxing Chen, Min Zhang, Aiti Aw, & Haizhou Li: Regenerating hypotheses for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.105-112. [PDF, 99KB]

(2008) Damianos Karakos, Jason Eisner, Sanjeev Khudanpur, & Markus Dreyer: Machine translation system combination using ITG-based alignments. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 81-84. [PDF, 65KB]

(2008) Gregor Leusch, Evgeny Matusov, & Hermann Ney: Complexity of finding the BLEU-optimal hypothesis in a confusion network.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.839-847. [PDF, 255KB]

(2008) Antti-Veikko I. Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.183-186. [PDF, 61KB]

(2007) Nicola Bertoldi: Spoken language translation through confusion network decoding. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 32pp. [PDF of PPT presentation, 499KB]

(2007) Roldano Cattoni, Nicola Bertoldi, & Marcello Federico: Punctuating confusion networks for speech translation. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2453-2456; abstract [PDF, 23KB]

(2007) Daniele Falavigna, Nicola Bertoldi, Fabio Brugnara, Roldano Cattoni, Mauro Cettolo, Boxing Chen, Marcello Federico, Diego Giuliani, Roberto Gretter, Deepa Gupta, & Dino Seppi: The IRST English-Spanish translation system for European Parliament speeches. Interspeech 2007: 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007; pp.2833-2836; abstract [PDF, 23KB]

Customization and implementation

(2009) Julia Aymerich & Hermes Camelo: The machine translation maturity model at PAHO. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 403-409. [PDF, 44KB]

(2009) Donald A.DePalma: The business case for machine translation. [Tutorial at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26, 2009, Ottawa, Ontario, Canada; 27pp. [PDF of PPT presentation, 481KB]

(2009) Raymond Flournoy & Christine Duran: Machine translation and document localization at Adobe: from pilot to production. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 425-428. [PDF, 123KB]

(2009) Young-Pyo Kim: KIPO’s MT activity & IP5 mutual MT project. MT Summit XII: Third Workshop on Patent Translation, August 30, 2009, Ottawa, Ontario, Canada; 19pp. [PDF of PPT presentation, 1766KB]

(2009) Patrick Paul, Samuel Larkin, Ulrich Germann, Eric Joanis, & Roland Kuhn: PortageLive: delivering machine translation technology via virtualization. [Paper for technology showcase at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; 7pp.. [PDF, 77KB]

(2009) Johann Roturier: Controlled language for MT in action. Translingual Europe 2009, May 13-14, Prague, Czech Republic; 11pp. [PDF of PPT, 162KB]

(2009) Chris Wendt & Will Lewis: Pushing the quality of a customized SMT system using shared training data. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 450-456. [PDF of PPT presentation, 250KB]

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2008) Reginald Hobbs, Jamal Laoudi, & Clare R.Voss: MTriage: web-enabled software for the creation, machine translation, and annotation of smart documents.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 433KB]

(2008) Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Akira Terada, & Yasumori Abe: Application of resource-based machine translation to real business scenes. LREC 2008: 6th Language Reso urces and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 65KB]

(2008) Masaki Itagaki & Takako Aikawa: Post-MT term swapper: supplementing a statistical machine translation system with a user dictionary.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 5pp. [PDF, 352KB]

(2008) Mark Seligman & Mike Dillinger: Rapid portability among domains in an interactive spoken language translation system. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.40-47. [PDF, 129KB]

(2007) Jen Doyon: How to successfully integrate MT. Tutorial at MT Summit XI, 10 September 2007, Copenhagen, Denmark; abstract, 1p. [PDF, 88KB]

(2007) Hans-Udo Stadler & Ursula Peter-Spörndli: The quest for machine translation quality at CLS Communication. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.435-442 [PDF, 115KB]

(2006) Statistical machine translation gets real: the LanguageWeaver profile. Global by Design, December 2006; pp. 8-19 [PDF, 2115KB]

(2006) Donald A. DePalma: Evaluating MT prior to deployment.  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; presentation on User Track [PDF of PPT presentation, 226KB]

(2005) Mats Attnäs, Pierre Senellart, & Jean Senellart: Integration of SYSTRAN MT systems in an open workflow. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.211-218. [PDF, 1951KB]

(2005) Chengqing Zong & Mark Seligman: Toward practical spoken language translation [abstract]. Machine Translation 19 (2), 2005; pp.113-137.

Dynamic programming

(2009) Sittichai Jiampojamarn, Aditya Bhargava, Qing Dou, Kenneth Dwyer, & Grzegorz Kondrak: DirecTL: a language-independent approach to transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.28-31. [PDF, 128KB]

(2009) Felipe Sánchez-Martínez, Mikel L. Forcada, & Andy Way: Hybrid rule-based – example-based MT: feeding Apertium with sub-sentential translation units. Proceedings of the 3rd International Workshop on Example-Based Machine Translation, 12-13 November 2009, Dublin City University, Dublin, Ireland, ed. Mikel L. Forcada [and] Andy Way; pp.11-18. [PDF, 150KB]; presentation [PDF of PPT, 192KB]

(2008) Christoph Tillmann: A rule-driven dynamic programming decoder for statistical MT. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.37-45. [PDF, 152KB]

(2008) Richard Zens & Hermann Ney: Improvements in dynamic programming beam search for phrase-based statistical machine translation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.195-205 [PDF, 229KB]; presentation [PDF, 186KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2006) Jorge Civera & Alfons Juan: Mixtures of IBM Model 2. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.159-167 [PDF, 142KB]

(2006) Liang Huang, Kevin Knight & Aravind Joshi: Statistical syntax-directed translation with extended domain of locality. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.66-73 [PDF, 162KB]

(2006) Liang Huang, Kevin Knight, & Aravind Joshi: A syntax-directed translator with extended domain of locality. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.1-8. [PDF, 120KB]

(2006) Christoph Tillmann: Efficient dynamic programming search algorithms for phrase-based SMT. HLT-NAACL-2006: Computationally Hard Problems and Joint Inferenec in Speech and Language Processing. Proceedings of the workshop, 9 June 2006, New York City, NY, USA; pp.9-16. [PDF, 183KB]

(2005) Liang Huang, Hao Zhang, & Daniel Gildea: Machine translation as lexicalized parsing with hooks. IWPT 2005: Proceedings of the Ninth International Workshop on Parsing Technologies, Vancouver, October 2005; pp.65-73. [PDF, 130KB]

File interchange and exchange

(2008) Francis Bond, Seiji Okura, Yuji Yamamoto, Toshiki Murata, Kiyotaka Uchimoto, Michael Kato, Miwako Shimazu, & Tsugiyoshi Suzuki: Sharing user dictionaries across multiple systems with UTX-S (AAMT Sharing/Standardization Working Group).  AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp. 304-313. [PDF of PPT presentation, 180KB]

Finite-state methods (see also Statistical analysis)

(2009) Sergio Barrachina, Oliver Bender, Francisco Casacuberta, Jorge Civera, Elsa Cubel, Shahram Khadivi, Antonio Lagarda, Hermann Ney, Jesús Tomás, Enrique Vidal, & Juan-Miguel Vilar: Statistical approaches to computer-assisted translation. Computational Linguistics 35 (1), pp.3-28. [PDF, 252KB]

(2009) Michael Gasser: Semitic morphological analysis and generation using finite state transducers with feature structures. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.309-317. [PDF, 217KB]

(2009) Abu Zaher Md. Faridee & Francis M.Tyers: Development of a morphological analyser for Bengali. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 43-50. [PDF, 351KB]

(2009) Mikel L.Forcada, Francis M.Tyers, & Gema Ramírez-Sánchez: The Apertium machine translation platform: five years on. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 3-10. [PDF, 253KB]

(2009) Jorge González & Francisco Casacuberta: GREAT: a finite-state machine translation toolkit implementing a grammatical inference approach for transducer inference (GIATI). Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference, Athens, Greece, 30 March 2009; pp.24-32. [PDF, 125KB]

(2009) Gonzalo Iglesias, Adriŕ de Gispert, Eduardo R.Banga & William Byrne: Hierarchical phrase-based translation with weighted finite state transducers. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.433-441. [PDF, 170KB]

(2009) Martin Jansche & Richard Sproat: Named entity transcription with pair n-gram models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.32-35. [PDF, 113KB]

(2009) Selçuk Köprü & Jude Miller: A unification based approach to the morphological analysis and generation of Arabic. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 615KB]

(2009) Krister Lindén: Guessers for finite-state transducer lexicons [abstract]. CICLING 2009: 10th International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City, Mexico, March 1-7, 2009; 1p. [PDF, 103KB]

(2009) Abbas Malik, Laurent Besacier, Christian Boitet, & Pushpak Bhattacharyya: A hybrid model for Urdu Hindi transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.177-185. [PDF, 377KB]

(2009) Peter Nabende: Transliteration system using pair HMM with weighted FSTs. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.100-103. [PDF, 127KB]

(2009) Balakrishnan Vardarajan & Delip Rao: Î-extension Hidden Markov Models and weighted transducers for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.120-123. [PDF, 129KB]

(2008) Graeme Blackwood, Adriŕ de Gispert, Jamie Brunning, & William Byrne: Large-scale statistical machine translation with weighted finite state transducers. FSMNLP 2008: Seventh International Workshop on Finite-State Methods and Natural Language Processing, Ispra, Italy, 11-12 September 2009; 12pp. [PDF, 361KB]

(2008) Graeme Blackwood, Adriŕ de Gispert, Jamie Brunning, & William Byrne: European language translation with weighted finite string transducers: the CUED MT system for the 2008 ACL workshop on SMT. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.131-134. [PDF, 77KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Christopher Dyer, Smaranda Muresan, & Philip Resnik: Generalizing word lattice translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1012-1020 . [PDF, 201KB]

(2008) Jorge González & Francisco Casacuberta: A finite-state framework for log-linear models in machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.41-46. [PDF, 514KB]

(2008) Jorge González & Francisco Casacuberta: Linguistic categorisation in machine translation using stochastic finite state transducers. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 55-62. [PDF, 321KB]

(2008) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Hindi Urdu machine transliteration using finite-state transducers.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.537-544. [PDF,774KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Macherey: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2007) Vicente Alabau, Alberto Sanchis, & Francisco Casacuberta: Using word posterior probabilities in lattice translation. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 94KB]; presentation [PDF, 141KB]

(2007) Jacek Gintrowicz & Krzysztof Jassem: Using regular expressions in translation memories. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.2, October 15-17, 2007, Wisla, Poland; pp.87-92. [PDF, 200KB]

(2007) Alicia Pérez, Víctor Guijarrubia, Raquel Justo, M. Inés Torres, & Francisco Casacuberta: A comparison of linguistically and statistically enhanced models for speech-to-speech machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 1234KB]; presentation [PDF, 2546KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: Speech-input multi-target machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 56-63 [PDF, 208KB]

(2007) Alicia Pérez, M.Inés Torres, M.Teresa González, & Francisco Casacuberta: An integrated architecture for speech-input multi-target machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.133-136 [PDF, 130KB]

(2007) Tarek Sherif & Grzegorz Kondrak: Substring-based transliteration. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 944-951 [PDF, 152KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

(2006) Mohammed A. Attia: An ambiguity-controlled morphological analyzer for modern standard Arabic modelling finite state networks.  The Challenge of Arabic for NLP/MT. International conference at the British Computer Society, London, 23 October 2006; pp.48-67. [PDF, 290KB]

(2006) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Finite-state transducer-based statistical machine translation using joint probabilities. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 16-22 [PDF,179 KB]

(2006) Jorge Civera, Antonio L. Lagarda, Elsa Cubel, Francisco Casacuberta, Enrique Vidal, Juan M. Vilar, & Sergio Barrachina: A computer-assisted translation tool based on finite-state technology. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.33-40 [PDF, 139KB]

(2006) Shahram Khadivi, Richard Zens, & Hermann Ney: Integration of speech to computer-assisted translation using finite-state automata. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.467-474. [PDF, 348KB]

(2006) José B.Marińo, Rafael E.Banchs, Josep M.Crego, Adriŕ de Gispert, Patrik Lambert, José A.R.Fonollosa, & Marta R.Costa-jussŕ: N-gram-based machine translation.  Computational Linguistics 32 (4), pp. 528-549. [PDF, 277KB]

(2006) Charles Schafer: Novel probabilistic finite-state transducers for cognate and transliteration modeling. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.203-212 [PDF, 250KB]

(2005) Stephan Kanthak, David Vilar, Evgeny Matusov, Richard Zens, & Hermann Ney: Novel reordering approaches in phrase-based statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 167-174. [PDF, 172KB]

(2005) Shankar Kumar & William Byrne: Local phrase reordering models for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 161-168. [PDF, 193KB]

Graphs

(2009) Beate Dorow, Florian Laws, Lukas Michelbacher, Christian Scheible, & Jason Utt: A graph-theoretic algorithm for automatic extension of translation lexicons. Proceedings of the EACL 2009 Workshop on GEMS: Geometrical Models of Natural Language Semantics, Athens, Greece, 31 March 2009; pp.91-95. [PDF, 81KB]

(2008) Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jorge Civera, Francisco Casacuberta, Enrique Vidal, & Hieu Hoang: Improving interactive machine translation via mouse actions. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.485-494. [PDF, 148KB]

(2005) Richard Zens & Hermann Ney: Word graphs for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 191-198. [PDF, 178KB]

Hidden Markov models

 (2009) Jesús Andrés-Ferrer & Alfons Juan: A phrase-based hidden semi-Markov approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.168-175. [PDF, 410KB]

(2009) Jamie Brunning, Adriŕ de Gispert & William Byrne: Context-dependent alignment models for statistical machine translation.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.110-118. [PDF, 226KB]

(2009) Chi-Ho Li, Xiaodong He, Yupeng Liu, & Ning Xi: Incremental HMM alignment for MT system combination. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.949-957. [PDF, 559KB]

(2009) Peter Nabende: Transliteration system using pair HMM with weighted FSTs. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.100-103. [PDF, 127KB]

(2009) Zaid Md Abdul Wahab Sheikh & Felipe Sánchez-Martínez: A trigram part-of-speech tagger for the Apertium free/open-source machine translation platform. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortíz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 67-74. [PDF, 227KB]

(2009) Balakrishnan Vardarajan & Delip Rao: Î-extension Hidden Markov Models and weighted transducers for machine transliteration. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.120-123. [PDF, 129KB]

(2009) Zdeněk Žabokrtský & Martin Popel: Hidden Markov tree model in dependency-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.145-148. [PDF, 154KB]

(2009) Yilu Zhou: Maximum n-gram HMM-based name transliteration: experiment in NEWS 2009 on English-Chinese corpus. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.128-131. [PDF, 244KB]

(2008) Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, & Robert Moore: Indirect HMM-based hypothesis alignment for combining outputs from machine translation systems.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.98-107. [PDF, 280KB]

(2007) Jesús Andrés-Ferrer & Alfons Juan-Císcar: A phrase-based hidden Markov model approach to machine translation.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 6pp. [PDF, 76KB]

(2007) John DeNero & Dan Klein: Tailoring word alignments to syntactic machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 17-24 [PDF, 415KB]

(2007) Xiaodong He: Using word dependent transition models in HMM based word alignment for statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 80-87 [PDF, 139KB]

(2007) Bing Zhao, Nguyen Bach, Ian Lane & Stephan Vogel: A log-linear block transliteration model based on bi-stream HMMs. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.364-371 [PDF, 453KB]

(2005) Yonggang Deng & William Byrne: HMM word and phrase alignment for statistical machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 169-176. [PDF, 189KB]

(2005) Adam Lopez & Philip Resnik: Improved HMM alignment models for languages with scarce resources.  ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 83-86. [PDF, 103KB]

(2005) Bing Zhao, Eric P.Xing, & Alex Waibel: Bilingual word spectral clustering for statistical machine translation. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 25-32. [PDF, 467KB]

Information theory and MT

(2006) Paul McNamee & James Mayfield: Translation of multiword expressions using parallel suffix arrays.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.100-109 [PDF, 733KB]

Lattices and networks

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Yang Feng, Yang Liu, Haitao Mi, Qun Liu, & Yajuan Lü: Lattice-based system combination for statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1105-1113. [PDF, 161KB]

(2009) Kevin Gimpel & Noah A.Smith: Feature-rich translation by quasi-synchronous lattice parsing. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.219-228. [PDF, 329KB]

(2009) Gonzalo Iglesias, Adriŕ de Gispert, Eduardo R.Banga & William Byrne: Hierarchical phrase-based translation with weighted finite state transducers. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.433-441. [PDF, 170KB]

(2009) Maxim Khalilov, José A.R.Fonollosa, & Mark Dras: Compiling hierarchical word reordering and decoding in phrase-based statistical machine translation.  Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.78-86. [PDF, 1161KB]

(2009) Selçuk Köprü & Adnan Yazıcı: Lattice parsing to integrate speech recognition and rule-based machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.469-477. [PDF, 185KB]

(2009) Shankar Kumar, Wolfgang Macherey, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Zhifei Li & Jason Eisner: First- and second-order expectation semirings with applications to minimum-risk training on translation forests. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.40-51. [PDF, 406KB]

(2009) Yanjun Ma, Tsuyoshi Okita, Özlem Çetinoğlu, Jinhua Du, & Andy Way: Low-resource machine translation using MaTrEx: the DCU machine translation system for IWSLT 2009. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 29-36. [PDF, 305KB]; presentation [PDF of PPT, 466KB]

(2009) Peter Nabende: Evaluation of dynamic Bayesian network models for entitiy name transliteration. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; pp.99-100. [PDF, 151KB]

(2009) Josh Schroeder, Trevor Cohn, & Philipp Koehn: Word lattices for multi-source translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.719-727. [PDF, 214KB]

(2008) Steve DeNeefe, Ulf Hermjakob & Kevin Knight: Overcoming vocabulary sparsity in MT using lattices. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.89-96. [PDF, 492KB]

(2008) Christopher Dyer, Smaranda Muresan, & Philip Resnik: Generalizing word lattice translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 1012-1020 . [PDF, 201KB]

(2008) Wolfgang Macherey, Franz Josef Och, Ignacio Thayer, & Jakob Uszkoreit: Lattice-based minimum error rate training for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.725-734. [PDF, 304KB]

(2008) Alexandre Patry & Philippe Langlais: MISTRAL: a statistical machine translation decoder for speech recognition lattices.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 265KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Macherey: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2007) Vicente Alabau, Alberto Sanchis, & Francisco Casacuberta: Improving speech-to-speech translation using word posterior probabilities. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.9-14 [PDF, 87KB]

(2007) Vicente Alabau, Alberto Sanchis, & Francisco Casacuberta: Using word posterior probabilities in lattice translation. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 6pp. [PDF, 94KB]; presentation [PDF, 141KB]

(2007) L. Besacier, A.Mahdhaoui, & V-B.Le: The LIG Arabic/English speech translation system at IWSLT07. IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 5pp. [PDF, 305KB]; presentation [PDF, 313KB]

(2007) Alexandre Patry, Philippe Langlais, & Frédéric Béchet: MISTRAL: a lattice translation system for IWSLT 2007.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 5pp. [PDF, 120KB]; presentation [PDF, 491KB]

(2007) Antti-Veikko I. Rosti, Spyros Matsoukas, & Richard Schwartz: Improved word-level system combination for machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 312-319 [PDF, 135KB]

(2007) Kay Rottmann & Stephan Vogel: Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180 [PDF, 3725KB]; presentation [PDF, 1129KB]

(2007) Holger Schwenk, Marta R.Costa-jussŕ, & José A.R.Fonollosa: Smooth bilingual n-gram translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 218KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Improved chunk-level reordering for statistical machine translation.  IWSLT 2007: International Workshop on Spoken Language Translation, 15-16 October 2007, Trento, Italy. 8pp. [PDF, 255KB]; presentation [PDF, 1644KB]

(2007) Yuqi Zhang, Richard Zens, & Hermann Ney: Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8 [PDF, 165KB]

(2006) Evgeny Matusov, Nicola Ueffing, & Hermann Ney: Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Trento, Italy, April 3-7, 2006; pp.33-40 [PDF, 91KB]

Learning and training

(2009) proceedings of Statistical Multilingual Analysis for Retrieval and Translation. Workshop at EACL 2009, Barcelona, Spain, 13 May 2009.

(2009) Andrei Alexandrescu & Katrin Kirchhoff: Graph-based learning for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.119-127. [PDF, 148KB]

(2009) Vamshi Ambati & Jaime Carbonell: Proactive learning for building machine translation systems for minority languages. NAACL-HLT-2009: Active Learning for Natural Language Processing (ALNLP-09), Proceedings of the workshop, June 5, 2009, Boulder, Colorado; pp.58-61. [PDF, 101KB]

(2009) Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez, & Philipp Koehn: Monte Carlo inference and maximization for phrase-based translation. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.102-110. [PDF, 167KB]

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) Delphine Bernhard & Iryna Gurevych: Combining lexical semantic resources with question & answer archives for translation-based answer finding. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.728-736. [PDF, 145KB]

(2009) Nicola Bertoldi, Barry Haddow, and Jean-Baptiste Fouet: Improved minimum error rate training in Moses. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.7-16. [PDF, 194KB] [presentation at MT Marathon 2009, 120KB]

(2009) Ming-Wei Chang, Dan Goldwasser, Dan Roth, & Yuancheng Tu: Unsupervised constraint driven learning for transliteration discovery. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.299-307. [PDF, 580KB]

(2009) Yonggang Deng & Bowen Zhou: Optimizing word alignment combination for phrase table training. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.229-232. [PDF, 132KB]

(2009) Thomas Deselaers, Saša Hasan, Oliver Bender, & Hermann Ney: A deep learning approach to machine transliteration.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.233-241. [PDF, 201KB]

(2009) George Foster & Roland Kuhn: Stabilizing minimum error rate training. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.242-249. [PDF, 126KB]

(2009) Jorge González & Francisco Casacuberta: GREAT: a finite-state machine translation toolkit implementing a grammatical inference approach for transducer inference (GIATI). Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference, Athens, Greece, 30 March 2009; pp.24-32. [PDF, 125KB]

(2009) Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. Cambridge, Mass.: The MIT Press, 2009; xii, 316pp.

(2009) Yvette Graham & Josef van Genabith: An open source rule induction tool for transfer-based SMT. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.37-46. [PDF, 321KB]

(2009) Gholamreza Haffari & Anoop Sarkar: Active learning for multlingual statistical machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.181-189. [PDF, 243KB]

(2009) Gholamreza Haffari, Maxim Roy, & Anoop Sarkar: Active learning for statistical phrase-based machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.415-423. [PDF, 190KB]

(2009) Xiwu Han, Hanzhang Li, & Tiejun Zhao: Train the machine with what it can learn – corpus selection for SMT. [ACL-IJCNLP-2009] Proceedings of the 2nd Workshop on Building and Using Comparable Corpora, Suntec, Singapore, 6 August 2009; pp.27-33. [PDF, 184KB]

(2009) Katsuhiko Hayashi, Taro Watanabe, Hajime Tsukada, & Hideki Isozaki: Structural support vector machines for log-linear approach in statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 144-151. [PDF, 352KB]; presentation [PDF of PPT, 220KB]

(2009) Yifan He & Andy Way: Learning labeled dependencies in machine translation evaluation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.44-51. [PDF, 426KB]

(2009) Long Jiang, Shiquan Yang, Ming Zhou, Xiaohua Liu, & Qingsheng Zhu: Mining bilingual data from the web with adaptively learnt patterns. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.870-878. [PDF, 306KB]

(2009) Amit Kirschenbaum & Shuly Wintner: Lightly supervised transliteration for machine translation.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.433-441. [PDF, 147KB]

(2009) Philipp Koehn: Discriminative training and factored translation models. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.69-81 [PDF, 75 slides, 530KB]

(2009) Philippe Langlais, François Yvon & Pierre Zweigenbaum: Improvements in analogical learning: application to translating multi-terms of the medical domain.  EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.487-495. [PDF, 172KB]

(2009) Zhifei Li: Training and inference methods over translation forests [abstract]. CWMT 2009: the 5th China Workshop on Machine Translation, Nanjing, China, October 16-17, 2009; 1p. [PDF, 14KB]

(2009) Zhifei Li & Jason Eisner: First- and second-order expectation semirings with applications to minimum-risk training on translation forests. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.40-51. [PDF, 406KB]

(2009) Spyros Matsoukas, Antti-Veikko I.Rosti, & Bing Zhang: Discriminative corpus weight estimation for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.708-717. [PDF, 116KB]

(2009) Mausam, Stephen Soderland, Oren Etzioni, Daniel S.Weld, Michael Skinner & Jeff Bilmes: Compiling a massive, multilingual dictionary via probabilistic inference. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.262-270. [PDF, 704KB}

(2009) Behrang Mohit, Frank Liberato, & Rebecca Hwa: Language model adaptation for difficult to translate phrases. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.160-167. [PDF, 343KB]

(2009) Sylwia Ozdowska & Andy Way: Optimal bilingual data for French-English PB-SMT. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.96-103. [PDF, 224KB]

(2009) Adam Pauls, John DeNero, & Dan Klein: Consensus training for consensus decoding in machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1418-1427. [PDF, 502KB]

(2009) Kuniko Saito & Kenji Imamura: Tag confidence measure for semi-automatically updating. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.168-176. [PDF, 204KB]

(2009) Khalil Sima’an: Well-behaved nonparametric statistical learning for parsing and translation [abstract]. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; p.65. [PDF, 139KB]

(2009) Sebastian Stüker, Laurent Besacier, & Alex Waibel: Human translations guided language discovery for ASR systems. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Roy Tromble & Jason Eisner: Learning linear ordering problems for better translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1007-1016. [PDF, 178KB]

(2009) Francis M.Tyers: Rule-based augmentation of training data in Breton-French statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.213-217. [PDF , 227KB]

(2009) Xiaojun Wan: Co-training for cross-lingual sentiment classification. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.235-243. [PDF, 273KB]

(2009) Chris Wendt & Will Lewis: Pushing the quality of a customized SMT system using shared training data. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 450-456. [PDF of PPT presentation, 250KB]

(2009) Omar F. Zaidan: Z-MERT: a fully configurable open source tool for minimum error rate training of machine translation systems. Prague Bulletin of Mathematical Linguistics 91, 2009; pp.79-88. [PDF, 263KB]

(2009) Dmitry Zelenko: Combining MDL transliteration training with discriminative modeling. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.116-119. [PDF, 92KB]

(2009) Yuejie Zhang, Yang Wang, & Xiangyang Xue: English-Chinese bi-directional OOV translation based on web mining and supervised learning. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.129-132. [PDF, 325KB]

(2008) Joshua S.Albrecht & Rebecca Hwa: Regression for machine translation evaluation at the sentence level [abstract].  Machine Translation 22 (1/2), March-June 2008; pp.1-27.

(2008) Sivaji Bandyopadhyay: Multilingual named entity recognition. IJCNLP 2008: Workshop on NER for South and South East Asian Languages, Proceedings of the workshop, 12 January 2008, Hyderabad, India; pp.3-4. [PDF, 37KB]

(2008) Nicola Bertoldi: A tutorial on the IRSTLM library. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 34 slides. [PDF, 401KB]

(2008) Phil Blunsom, Trevor Cohn, & Miles Osborne: A discriminative latent variable model for statistical machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 200-208. [PDF, 210KB]

(2008) Phil Blunsom & Miles Osborne: Probabilistic inference for machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.215-223. [PDF, 283KB]

(2008) Francis Bond, Eric Nichols, Darren Scott Appling, & Michael Paul: Improving statistical machine translation by paraphrasing the training data. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.150-157 [PDF, 132KB]; presentation [PDF,189KB]

(2008) Nicola Cancedda: Shaping research from user requirements, and other exotic things. [Abstract of] Invited talk at: EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; p.4. [PDF, 449KB]

(2008) Michael Carl: Using log-linear models for selecting best machine translation output. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 255KB]

(2008) Daniel Cer, Daniel Jurafsky, & Christopher D. Manning: Regularization and search for minimum error rate training. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.26-34. [PDF, 712KB]

(2008) Boxing Chen, Min Zhang, Aiti Aw, & Haizhou Li: Exploiting n-best hypotheses for SMT self-enhancement. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 157-160. [PDF, 96KB]

(2008) Yu Chen, Andreas Eisele, & Martin Kay: Improving statistical machine translation efficiency by triangulation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 122KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) Jonathan H. Clark, Robert Frederking, & Lori Levin: Toward active learning in data selection: automatic discovery of language features during elicitation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 141KB]

(2008) Vincent Claveau: Automatic translation of biomedical terms by supervised machine learning.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 202KB]

(2008) Marta R.Costa-jussŕ, José A.R.Fonollosa, & Enric Monte: Using reordering in statistical machine translation based on recursive alignment block classification.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 96KB]

(2008) Kevin Duh & Katrin Kirchhoff: Beyond log-linear models: boosted minimum error rate training for n-best re-ranking. ACL 2008 HLT Short Papers, June 2008, Columbus, Ohio; pp.37-40. [PDF, 102KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Mitigation of data sparsity in classifier-based translation. Coling 2008: Proceedings of the Workshop on Speech Processing for Safety Critical Translation and Pervasive Applications, 23 August 2008, Manchester, UK; pp.1-4. [PDF, 138KB]

(2008) Emil Ettelaie, Panayiotis G.Georgiou, & Shrikanth S.Narayanan: Towards unsupervised training of the classifier-based speech translator. Interspeech 2008: 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008; pp.2739-2742; abstract [PDF, 58KB]

(2008)  Nizar Habash & Hayden Metsky: Automatic learning of morphological variations for handling out-of-vocabulary terms in Urdu-English machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.107-116. [PDF, 551KB]

(2008) Takeshi Ito, Tomoyosi Akiba, & Katunobu Itou: Effect of the topic dependent translation models for patent translation – experiment at NTCIR-7. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 425-429. [PDF, 365KB]

(2008) Jin-Shea Kuo & Haizhou Li: Multi-view co-training of transliteration model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.373-380. [PDF, 465KB]

(2008) Jin-Shea Kuo, Haizhou Li, & Chih-Lung Lin: Mining transliterations from web query results: an incremental approach. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.16-23. [PDF, 400KB]

(2008) Antonio L.Lagarda & Francisco Casacuberta: Applying boosting to statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.88-96. [PDF, 631KB]

(2008) Caroline Lavecchia, David Langlois, & Kamel Smaili: Phrase-based machine translation based on simulated annealing. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 96KB]

(2008) Zhifei Li & Sanjeev Khudanpur: Large-scale discriminative n-gram language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.133-142. [PDF, 643KB]

(2008) Wolfgang Macherey, Franz Josef Och, Ignacio Thayer, & Jakob Uszkoreit: Lattice-based minimum error rate training for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.725-734. [PDF, 304KB]

(2008) Robert C.Moore & Chris Quirk: Random restarts in minimum error rate training for statistical machine translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.585-592. [PDF, 552KB]

(2008) Jin’ichi Murakami, Masato Tokuhisa, & Satoru Ikehara: Statistical machine translation with long phrase table and without long parallel sentences. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 454-461. [PDF, 538KB]

(2008) Jin’ichi Murakami, Masato Tokuhisa, & Satoru Ikehara: Statistical machine translation without long parallel sentences for training data. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.132-137 [PDF, 108KB]

(2008) Jong-Hoon Oh & Hitoshi Isahara: Hypothesis selection in machine transliteration: a web mining approach. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.233-240. [PDF, 461KB]

(2008) Holger Schwenk: Investigations on large-scale lightly-supervised training for statistical machine translation. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.182-189 [PDF, 141KB]; presentation [PDF, 182KB]

(2008) Holger Schwenk: Smoothing and data selection in large SMT systems. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 30 slides. [PDF, 276KB]

(2008) Holger Schwenk & Philipp Koehn: Large and diverse language models for statistical machine translation. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.661-666. [PDF, 364KB]

(2008) Lei Shi & Ming Zhou: Improved sentence alignment on parallel web pages using a stochastic tree alignment model.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.505-513. [PDF, 483KB]

(2008) Matthew Snover, Bonnie Dorr, & Richard Schwartz: Language and translation model adaptation using comparable corpora. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.857-866. [PDF, 132KB]

(2008) Anders Sřgaard: Learning context-sensitive synchronous rules. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.170-175. [PDF, 525KB]

(2008) Suqi Sun, Yin Chen & Jufeng Li: A re-examination on features in regression based approach to automatic MT evaluation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 25-30. [PDF, 234KB]

(2008) Marco Turchi, Tijl de Bie, & Nello Cristianini: Learning performance of a machine translation system: a statistical and computational analysis. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.35-43. [PDF, 108KB]

(2008) Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201. [PDF, 614KB]

(2008) M.S.Vijaya, G.Shivapratap, V.Dhanalakshmi, V.P. Ajith, & K.P.Soman: Sequence labeling approach for English to Tamil transliteration using memory-based learning.  ICON-2008: 6th International Conference on Natural Language Processing, CDAC Pune, India, 20-22 December 2008; 7pp. [PDF, 195KB]

(2008) Keiji Yasuda, Ruiqiang Zhang, Hirofumi Yamamoto, & Eiichiro Sumita: Method of selecting training data to build a compact and efficient translation model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.655-660. [PDF, 447KB]

(2008) Bing Zhao & Yaser Al-Onaizan: Generalizing local and non-local word-reordering patterns for syntax-based machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581. [PDF, 538KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Joshua S. Albrecht & Rebecca Hwa: Regression for sentence-level MT evaluation with pseudo references. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 296-303 [PDF, 98KB]

(2007) Joshua S. Albrecht & Rebecca Hwa: A re-examination of machine learning approaches for sentence-level MT evaluation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 880-887 [PDF, 126KB]

(2007) Abhishek Arun: Discriminative training for phrase-based machine translation. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 31pp. [PDF of PPT presentation, 544KB]

(2007) Abhishek Arun & Philipp Koehn: Online learning methods for discriminative training of phrase based statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.15-20 [PDF, 133KB]

(2007) Thorsten Brants, Ashok C.Popat, Peng Xu, Franz J.Och, & Jeffrey Dean: Large language models in machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 858-857. [PDF, 297KB]

(2007) Colin Cherry & Dekang Lin: Inversion transduction grammar for joint phrasal translation modeling. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.17-24 [PDF, 178KB]

(2007) Ariadna Font Llitjós, Jaime Carbonell, & Alon Lavie: Improving transfer-based MT systems with automatic refinements. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.183-190 [PDF, 339KB]

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

(2007) Pascale Fung, Zhaojun Wu, Yongsheng Yang, & Dekai Wu: Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.75-84 [PDF, 477KB]

(2007) Jesús Giménez & Lluís Mŕrquez: Context-aware discriminative phrase selection for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 159-166 [PDF, 1046KB]

(2007) Philippe Langlais & Alexandre Patry: Translating unknown words using analogical learning. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 877-886. [PDF, 155KB]

(2007) John Lee, Ming Zhou, & Xiaohua Liu: Detection of non-native sentences using machine-translated training data. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.93-96 [PDF, 188KB]

(2007) Ding Liu & Daniel Gildea: Source-language features and maximum correlation training for machine translation evaluation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.41-48 [PDF, 201KB]

(2007) Yajuan Lü, Jin Huang & Qun Liu: Improving statistical machine translation performance by training data selection and optimization. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 343-350. [PDF, 235KB]

(2007) Robert C.Moore & Chris Quirk: An iteratively-trained segmentation-free phrase translation model for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 112-119 [PDF, 137KB]

(2007) Patrick Nguyen, Milind Mahajan, & Xiaodong He: Training non-parametric features for statistical machine translation. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 72-79 [PDF, 169KB]

(2007) Lucia Specia, Mark Stevenson, & Maria das Graças V.Nunes: Learning expressive models for word sense disambiguation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 41-48 [PDF, 99KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2007) Nicola Ueffing, Gholamreza Haffari, & Anoop Sarker: Semi-supervised model adaptation for statistical machine translation [abstract]. Machine Translation 21 (2), June 2007; pp.77-94.

(2007) Nicola Ueffing, Gholamreza Haffari, & Anoop Sarkar: Transductive learning for statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 25-32 [PDF, 309KB]

(2007) Vincent Vandeghinste: Removing the distinction between a translation memory, a bilingual dictionary and a parallel corpus. Translating and the Computer 29. Proceedings of the twenty-ninth international conference on Translating and the Computer, 29-30 November 2007 (London: Aslib, 2007); 21pp. [PDF, 90KB]

(2007) Sriram Venkatapathy & Aravind K.Joshi: Discriminative word alignment by learning the alignment structure and syntactic divergence between a language pair. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.49-56 [PDF, 172KB]

(2007) Zhuoran Wang, John Shawe-Taylor & Sandor Szedmak: Kernel regression based machine translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; Companion volume, pp.185-188 [PDF, 134KB]

(2007) Taro Watanabe, Jun Suzuki, Hajime Tsukada, & Hideki Isozaki: Online large-margin training for statistical machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 764-773. [PDF, 199KB]

(2007) Richard Zens, Saša Hasan, & Hermann Ney: A systematic comparison of training criteria for statistical machine translation. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 524-532. [PDF, 186KB]

(2006) Giuseppe Attardi: Experiments with a multilanguage non-projective dependency parser. CoNLL-X: Proceedings of the 10th Conference on Computational Natural Language Learning, New York City, June 2006; pp.166-170. [PDF, 104KB]

(2006) Jaime Carbonell, Steve Klein, David Miller, Michael Steinbaum, Tomer Grassiany, & Jochen Frei: Context-based machine translation. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.19-28 [PDF, 338KB]

(2006) Colin Cherry & Dekang Lin: Soft syntactic constraints for word alignment through discriminative training. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.105-112. [PDF, 153KB]

(2006) Alexander Fraser & Daniel Marcu: Semi-supervised training for statistical word alignment. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.769-776. [PDF, 200KB]

(2006) Pascale Fung & Benfeng Chen: Robust word sense translation by EM learning of frame semantics. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.239-246. [PDF, 223KB]

(2006) Michel Galley, Jonathan Graehl, Kevin Knight, Daniel Marcu, Steve DeNeefe, Wei Wang, & Ignacio Thayer: Scalable inference and training of context-rich syntatic translation models.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.961-968. [PDF, 438KB]

(2006) Rebecca Hwa, Carol Nichols & Khalil Sima’an: Corpus variations for translation lexicon induction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.74-81 [PDF, 178KB]

(2006) David Kauchak: Contributions to research on machine translation. PhD thesis, University of California, San Diego, 2006. xiv,92pp. [PDF, 491KB]

(2006) Jin-Shea Kuo, Haizhou Li, & Ying-Kuei Yang: Learning transliteration lexicons from the Web. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1129-1136. [PDF, 428KB]

(2006) Percy Liang, Alexandre Bouchard-Côté, Dan Klein, & Ben Taskar: An end-to-end discriminative approach to machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.761-768. [PDF, 136KB]

(2006) Nicolas Stroppa: [review of] Walter Daelemans and Antal van den Bosch, Memory-based language processing. Machine Translation 20 (2),2006; pp.143-145.  [see publication]

(2006) Christoph Tillmann & Tong Zhang: A discriminative global training algorithm for statistical MT. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.721-728. [PDF, 179KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

(2006) Nicola Ueffing: Using monolingual source-language data to improve MT performance. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 174-181 [PDF, 103KB]

(2006) Benjamin Wellington, Joseph Turian, Chris Pike & I.Dan Melamed: Scalable purely-discriminative training for word and tree transducers. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.251-260 [PDF, 176KB]

(2006) Hua Wu, Haifeng Wang, & Zhanyi Liu: Boosting statistical word alignment using labeled and unlabeled data. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.913-920. [PDF, 97KB]

(2005) Necip Fazil Ayan, Bonnie J.Dorr, & Christof Monz: Alignment link projection using transformation-based learning.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 185-192. [PDF, 132KB]

(2005) Ilyas Cicekli: Learning translation templates with type constraints MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.27-33. [PDF, 356KB]

(2005) Matthias Eck, Stephan Vogel, & Alex Waibel: Low cost portability for statistical machine translation based on n-gram coverage. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.227-234. [PDF, 199KB]

(2005) Hemali Majithia, Philip Rennart & Evelyne Tzoukermann: Rapid ramp-up for statistical machine translation: minimal training for maximal coverage . MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.438-444. [PDF, 160KB]

(2005) Seong-Bae Park, Jeong-Woo Son, & Yoon-Shik Tae: Augmentation of modality translation rules in Korean-to-English machine translation by rule learning. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.93-100. [PDF, 2045KB]

(2005) Michael Paul, Eiichiro Sumita, & Seiichi Yamamoto: A machine learning approach to hypotheses selection of greedy decoding for SMT MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Second Workshop on Example-Based Machine Translation; pp.117-124. [PDF, 424KB]

(2005) Hendra Setiawan, Haizhou Li, & Min Zhang: Learning phrase translation using level of detail approach. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.243-250. [PDF, 250KB]

(2005) Hirokazu Suzuki & Akira Kumano: Learning translations from monolingual corpora. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.33-40. [PDF, 439KB]

(2005) Christoph Tillmann & Tong Zhang: A localized prediction model for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 557-564. [PDF, 189KB]

(2005) Daniel J. Walker: The Open A.I. KitTM: general machine learning modules from statistical machine translation MT Summit X, Phuket, Thailand, September 16, 2005, Proceedings of Workshop on Open Source Machine Translation; pp.1-6. [PDF, 162KB]

(2005) Wu Hua & Wang Haifeng: Boosting statistical word alignment. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.313-320. [PDF, 393KB]

(2005) Ariadna Font Llitjós, Jaime G. Carbonell, & Alon Lavie: A framework for interactive and automatic refinement of transfer-based machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 87-96. [PDF, 468KB]

(2005) Ebba Gustavii: Target language preposition selection - an experiment with transformation based learning and aligned bilingual data. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 112-118. [PDF, 45KB]

(2005) Ashish Venugopal & Stephan Vogel: Considerations in maximum mutual information and minimum classification error training for statistical machine translation. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 271-279. [PDF, 89KB]

(2005) Bing Zhao & Alex Waibel: Learning a log-linear model with bilingual phrase-pair features for statistical machine translation. IJCNLP-05: Fourth SIGHAN  Workshop on Chinese Language Processing. Proceedings of the workshop, 14-15 October 2005, Jeju Island, Korea; pp. 79-86. [PDF, 1301KB]

Margin induced relaxed algorithm

(2009) Kedar Bellare, Koby Crammer, & Dayne Freitag: Loss-sensitive discriminative training of machine transliteration models. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Proceedings of the Student Research Workshop and Doctoral Consortium, Boulder, Colorado, June 1, 2009;  pp.61-65. [PDF, 117KB]

(2009) David Chiang, Kevin Knight, & Wei Wang: 11,001 new features for statistical machine translation. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.218-226. [PDF, 114KB]

(2009) Zuoran Wang, John Shawe-Taylor, & Sándor Szedmák: Large-margin structured prediction via linear programming. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 89 slides. [PDF of PPT, 950KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2007) Abhishek Arun & Philipp Koehn: Online learning methods for discriminative training of phrase based statistical machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.15-20 [PDF, 133KB]

Maximum entropy methods

(2009) Chris Dyer: Using maximum entropy model to build segmentation lattices for MT.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.406-414. [PDF, 173KB]

(2009) Asif Ekbal & Sivajii Banyopadhyay: Voted NER system using appropriate unlabeled data. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.202-210. [PDF, 98KB]

(2009) Vinh Van Nguyen, Akira Shimazu, Minh Le Nguyen & Thai Phuong Nguyen: Improving a lexicalized hierarchical reordering model using maximum entropy. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 89-96. [PDF, 172KB]

(2009) Sirvan Yahyaei & Christof Monz: Decoding by dynamic chunking for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.160-167. [PDF, 127KB]

(2008) David Burkett & Dan Klein: Two languages are better than one (for syntactic parsing).  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.877-886. [PDF, 405KB]

 (2008) Andrew Finch & Eiichiro Sumita: Dynamic model interpolation for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.208-215. [PDF, 428KB]

(2008) Zhongjun He, Qun Liu, & Shouxun Lin: Improving statistical machine translation using lexicalized rule selection.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.321-328. [PDF, 134KB]

(2008) Qun Liu, Zhongjun He, Yang Liu, & Shouxun Lin: Maximum entropy based rule selection model for syntax-based statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.89-97. [PDF, 916KB]

(2007) Abraham Ittycheriah & Salim Roukos: Direct translation model 2. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.57-64 [PDF, 238KB]

(2007) Long Jiang, Ming Zhou, Lee-Feng Chien & Cheng Niu: Named entity translation with web mining and transliteration. IJCAI-07: Twentieth International Joint conference on Artificial Intelligence, Hyderabad, India, 6-12 January 2007; pp.1629-1634. [PDF, 491KB]

(2007) Yang Ye, Karl-Michael Schneider, & Steven Abney: Aspect marker generation in English-to-Chinese machine translation. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.521-527 [PDF, 129KB]

(2006) Necip Fazil Ayan & Bonnie J. Dorr: A maximum entropy approach to combining word alignments.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 96-103 [PDF, 325KB]

(2006) Srinivas Bangalore, Stephan Kanthak, & Patrick Haffner: Finite-state transducer-based statistical machine translation using joint probabilities. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 16-22 [PDF,179 KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Extracting parallel sub-sentential fragments from non-parallel corpora. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.81-88. [PDF, 1598KB]

(2006) Dragos Stefan Munteanu & Daniel Marcu: Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics 31 (4), pp. 477-504 [PDF, 1060KB]

(2006) Deyi Xiong, Qun Liu, & Shouxun Lin: Maximum entropy based phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.521-528. [PDF, 420KB]

(2005) Abraham Ittycheriah & Salim Roukos: A maximum entropy word aligner for Arabic-English machine translation.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 33-40. [PDF, 327KB]

(2005) Erik Velldal & Stephan Oepen: Maximum entropy models for realization ranking. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.109-116. [PDF, 217KB]

Mixture modelling

(2008) Andrew Finch & Eiichiro Sumita: Dynamic model interpolation for statistical machine translation. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.208-215. [PDF, 428KB]

(2007) Jorge Civera & Alfons Juan: Domain adaptation in statistical machine translation with mixture modelling.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 177-180 [PDF, 94KB]

(2007) George Foster & Roland Kuhn: Mixture-model adaptation for SMT.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 128-135 [PDF, 380KB]

(2006) Jorge Civera & Alfons Juan: Mixtures of IBM Model 2. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.159-167 [PDF, 142KB]

(2006) Bing Zhao & Eric P. Xing: BiTAM: bilingual topic admixture models for word alignment. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.969-976. [PDF, 574KB]

Modularity (see also Customization and implementation)

(2009) Mikel L.Forcada, Francis M.Tyers, & Gema Ramírez-Sánchez: The Apertium machine translation platform: five years on. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 3-10. [PDF, 253KB]

 (2009) Adam Lopez: Translation as weighted deduction. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.532-540. [PDF, 128KB]

(2005) Mats Attnäs, Pierre Senellart, & Jean Senellart: Integration of SYSTRAN MT systems in an open workflow. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.211-218. [PDF, 1951KB]

Natural language understanding

(2009) Younčs Bahou, Lamia Hadrich Belguith, & Abdelmajid Ben Hamadou: Disfluency and out-of-vocabulary word processing in Arabic speech understanding. CAASL-3 – Third Workshop on Computational Approaches to Arabic Script-based Languages [at] MT Summit XII, August 26, 2009, Ottawa, Ontario, Canada; 8pp. [PDF, 825KB]

Neural networks

 (2006) Holger Schwenk, Marta R. Costa-jussŕ & José A. R. Fonollosa: Continuous space language models for the IWSLT 2006 task. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2006], November 27-28, 2006, Kyoto, Japan; pp. 166-173 [PDF, 165KB]

(2006) Holger Schwenk, Daniel Dechelotte, & Jean-Luc Gauvain: Continuous space language models for statistical machine translation. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.723-730. [PDF, 508KB]

(2005) Necip Fazil Ayan, Bonnie J. Dorr, & Christof Monz: NeurAlign: combining word alignments using neural networks.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; pp. 65-72. [PDF, 139KB]

(2005) Philippe Langlais, Thomas Leplus, Simona Gandrabur, & Guy Lapalme: From the real world to real words: the METEO case. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 166-175. [PDF, 102KB]

(2005) Philippe Langlais, Simona Gandrabur, Thomas Leplus, & Guy Lapalme: The long-term forecast for weather bulletin translation [abstract]. Machine Translation 19 (1), 2005; pp.83-112.

Perceptron

(2009) Dayne Freitag & Zhiqiang Wang: Name transliteration with bidirectional perceptron edit models. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.132-135. [PDF, 96KB]

(2009) Alexandre Patry & Philippe Langlais: Prediction of words in statistical machine translation using a multilayer perceptron. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 101-111. [PDF, 337KB]

(2007) Dayne Freitag & Shahram Khadivi: A sequence alignment model based on the averaged perceptron. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 238-247. [PDF, 180KB]

Predictive methods

 (2009) Alexandre Patry & Philippe Langlais: Prediction of words in statistical machine translation using a multilayer perceptron. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 101-111. [PDF, 337KB]

 (2009) Zuoran Wang, John Shawe-Taylor, & Sándor Szedmák: Large-margin structured prediction via linear programming. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 89 slides. [PDF of PPT, 950KB]

(2007) Christoph Tillmann & Tong Zhang: A block bigram prediction model for statistical machine translation. ACM Transactions on Speech and Language Processing 4 (3), art.6. [abstract, PDF, 12KB]

(2005) Christoph Tillmann & Tong Zhang: A localized prediction model for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 557-564. [PDF, 189KB]

Programming and algorithms [see also Computer equipment and accessories, Dynamic programming, Parallel processing]

(2009) Jesús Andrés-Ferrer & Alfons Juan: A phrase-based hidden semi-Markov approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.168-175. [PDF, 410KB]

(2009) Abhishek Arun, Chris Dyer, Barry Haddow, Phil Blunsom, Adam Lopez, & Philipp Koehn: Monte Carlo inference and maximization for phrase-based translation. CoNLL-2009. Proceedings of the Thirteenth Conference on Computational Natural Language Learning, June 4-5, 2009, Boulder, Colorado; pp.102-110. [PDF, 167KB]

(2009) Michael Auli, Adam Lopez, Hieu Hoang & Philipp Koehn: A systematic analysis of translation model search spaces. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.224-232. [PDF, 101KB]

(2009) Sander Canisius & Antal van den Bosch: A constraint satisfaction approach to machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.182-189. [PDF, 533KB]

(2009) Yu Chen, Martin Kay, & Andreas Eisele: Intersecting multilingual data for faster and better statistical translations.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.128-136. [PDF, 159KB]

(2009) Fabien Cromičres & Sadao Kurohashi: An alignment algorithm using belief propagation and a structure-based distortion model. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.166-174. [PDF, 221KB]

(2009) John DeNero, Adam Pauls, & Dan Klein: Asynchronous binarization for synchronous grammars. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.141-144. [PDF, 183KB]

(2009) John DeNero, David Chiang, & Kevin Knight: Fast consensus decoding over translation forests. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.567-575. [PDF, 650KB]

(2009) Beate Dorow, Florian Laws, Lukas Michelbacher, Christian Scheible, & Jason Utt: A graph-theoretic algorithm for automatic extension of translation lexicons. Proceedings of the EACL 2009 Workshop on GEMS: Geometrical Models of Natural Language Semantics, Athens, Greece, 31 March 2009; pp.91-95. [PDF, 81KB]

(2009) Andrew Finch & Eiichiro Sumita: Bidirectional phrase-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1124-1132. [PDF, 164KB]

(2009) Barry Haddow: Word-based models and the EM algorithm. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.35-41 [PDF, 167KB]

(2009) Hieu Hoang & Josh Schroeder: Moses installation and training run-through. Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; 18pp. [PDF, 74KB]

(2009) Gonzalo Iglesias, Adriŕ de Gispert, Eduardo R.Banga & William Byrne: Rule filtering by pattern for efficient hierarchical translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.380-388. [PDF, 109KB]

(2009) Matti Kääriäinen: Sinuhe – statistical machine translation using a globally trained conditional exponential family translation model. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1027-1036. [PDF, 159KB]

(2009) Shankar Kumar, Wolfgang Machery, Chris Dyer & Franz Och: Efficient minimum error rate training and minimum Bayes-risk decoding for translation hypergraphs and lattices. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.163-171. [PDF, 342KB]

(2009) Abby Levenberg & Miles Osborne: Stream-based randomized language models for SMT. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.756-764. [PDF, 179KB]

(2009) Zhifei Li & Sanjeev Khudanpur: Efficient extraction of oracle-best translations from hypergraphs.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.9-12. [PDF, 112KB]

(2009) Zhifei Li, Chris Callison-Burch, Chris Dyer, Juri Ganitkevitch, Sanjeev Khudanpur, Lane Schwartz, Wren N.G.Thornton, Jonathan Weese, & Omar F.Zaidan: Joshua: an open source toolkit for parsing-based machine translation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.135-139. [PDF, 109KB]

(2009) Adam Lopez: Translation as weighted deduction. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.532-540. [PDF, 128KB]

(2009) Pasquale Minervini: Apertium goes SOA: an efficient and scalable service based on the Apertium rule-based machine translation platform. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 59-65. [PDF, 236KB]

(2009) Robert C.Moore & Chris Quirk: Less is more: significance-based n-gram selection for smaller, better language models. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.746-755. [PDF, 564KB]

(2009) Patrick Paul, Samuel Larkin, Ulrich Germann, Eric Joanis, & Roland Kuhn: PortageLive: delivering machine translation technology via virtualization. [Paper for technology showcase at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; 7pp.. [PDF, 77KB]

(2009) Adam Pauls, John DeNero, & Dan Klein: Consensus training for consensus decoding in machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1418-1427. [PDF, 502KB]

(2009) Michael Pust & Kevin Knight: Faster MT decoding through pervasive laziness.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.141-144. [PDF, 108KB]

(2009) Sebastian Riedel & James Clarke: Revisiting optimal decoding for machine translation IBM Model 4. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.5-8. [PDF, 112KB]

(2009) Antti-Veikko I.Rosti, Bing Zhang, Spyros Matsoukas, & Richard Schwartz: Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.61-65. [PDF, 115KB]

(2009) Markus Saers, Joakim Nivre, & Dekai Wu: Learning stochastic bracketing inversion transduction grammars with a cubic time biparsing algorithm. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 29-32. [PDF, 126KB]

(2009) Victor M.Sánchez-Cartagena & Juan Antonio Pérez-Ortiz: An open-source highly scalable web service architecture for the Apertium machine translation engine. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 51-58. [PDF, 240KB]

(2009) Anders Sřgaard & Dekai Wu: Emprical lower bounds on translation unit error rate for the full class of inversion transduction grammars. IWPT-09: Proceedings of the 11th International Conference on Parsing Technologies, 7-9 October 2009, Paris, France; pp. 33-36. [PDF, 87KB]

(2009) Nadi Tomeh, Nicola Cancedda, & Marc Dymetman: Complexity-based phrase-table filtering for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.144-151. [PDF, 215KB]

(2009) Nicola Ueffing, Gholamreza Haffari, & Anoop Sarkar: Semisupervised learning for machine translation.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.237-256.

(2009) David Vilar & Hermann Ney: On LM heuristics for the cube growing algorithm. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.242-249. [PDF, 459KB]

(2009) Zuoran Wang, John Shawe-Taylor, & Sándor Szedmák: Large-margin structured prediction via linear programming. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 89 slides. [PDF of PPT, 950KB]

(2009) Zhuoran Wang & John Shawe-Taylor: Kernel-based machine translation.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.169-184.

(2009) Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu, & Ming Zhou: Better synchronous binarization for machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.362-370. [PDF, 637KB]

(2009) Deyi Xiong, Min Zhang, Aiti Aw & Haizhou Li: Efficient beam thresholding for statistical machine translation. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 363-370. [PDF, 92KB]

(2009) Kenji Yamada & Ion Muslea: Reranking for large-scale statistical machine translation.  In: Cyril Goutte, Nicola Cancedda, Marc Dymetman, & George Foster (eds.) Learning machine translation. (Cambridge, Mass.: The MIT Press, 2009); pp.151-168.

(2009) Mei Yang & Jing Zheng: Toward smaller, faster, and better hierarchical phrase-based SMT. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.237-240. [PDF, 140KB]

(2009) Zdeněk Žabokrtský & Martin Popel: Hidden Markov tree model in dependency-based machine translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Short Papers, Suntec, Singapore, 4 August 2009; pp.145-148. [PDF, 154KB]

(2009) Zdeněk Žabokrtský: TectoMT: software framework for developing MT systems (and other NLP applications). Third Machine Translation Marathon, Prague, Czech Republic, 26-30 January 2009; pp.51-56 [PDF, 570KB]

(2009) Mikhail Zaslavskiy, Marc Dymetman, & Nicola Cancedda: Phrase-based statistical machine translation as a traveling salesman problem. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.333-341. [PDF, 331KB]

(2009) Hui Zhang, Min Zhang, Haizhou Li, & Chew Lim Tan: Fast translation rule matching for syntax-based statistical machine translation. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.1037-1045. [PDF, 262KB]

(2009) Bing Zhao & Shengyuan Chen: A simplex Armijo downhill algorithm for optimizing statistical machine translation decoding parameters.  NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.21-24. [PDF, 601KB]

(2009) Ali Zogheib: Genetic algorithm-based multi-word automatic language translation. (IIS-2009) Recent Advances in Intelligent Information Systems, ed. M.A.Klopotek, A. Przepiórkowski, S.T. Wierzchon, K. Trojanowski, Kraków, Poland, June 15-18, 2009; pp.751-760. [PDF, 141KB]

(2008) Yu Chen, Andreas Eisele, & Martin Kay: Improving statistical machine translation efficiency by triangulation.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 6pp. [PDF, 122KB]

(2008) David Chiang, Yuval Marton, & Philip Resnik: Online large-margin training of syntactic and structural translation features.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233. [PDF, 162KB]

(2008) David Chiang, Steve DeNeefe, Yee Seng Chan & Hwee Tou Ng: Decomposability of translation metrics for improved evaluation and efficient algorithms.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.610-619. [PDF, 316KB]

(2008) Markus Dreyer, Jason R.Smith, & Jason Eisner: Latent-variable modeling of string transductions with finite-state methods.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.1080-1089. [PDF, 513KB]

(2008) Kevin Duh & Katrin Kirchhoff: Beyond log-linear models: boosted minimum error rate training for n-best re-ranking. ACL 2008 HLT Short Papers, June 2008, Columbus, Ohio; pp.37-40. [PDF, 102KB]

(2008) Christopher Dyer, Aaron Cordova, Alex Mont, & Jimmy Lin: Fast, easy, and cheap: construction of statistical machine translation models with MapReduce. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.199-207. [PDF, 228KB]

(2008) Qin Gao & Stephan Vogel: Parallel implementations of word alignment tool. ACL-08 HLT: Software Engineering, Testing, and Quality Assurance for Natural Language Processing, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp.49-57. [PDF, 340KB]

(2008) Jorge González & Francisco Casacuberta: Linguistic categorisation in machine translation using stochastic finite state transducers. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 55-62. [PDF, 321KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Ahmed Hatem & Amin Nassar: Modified Dijstra-like search algorithm for English to Arabic machine translation system. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.66-71. [PDF, 562KB]

(2008) Hieu Hoang & Philipp Koehn: Design of the Moses decoder for statistical machine translation. ACL-08 HLT: Software Engineering, Testing, and Quality Assurance for Natural Language Processing, June 20, 2008, The Ohio State University, Columbus, Ohio, USA; pp.58-65. [PDF, 137KB]

(2008) Reginald L.Hobbs & Clare R.Voss: Designing and executing machine translation workflows through the Kepler framework. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.380-389.  [PDF of PPT presentation, 1204KB]

(2008) Antonio L.Lagarda & Francisco Casacuberta: Applying boosting to statistical machine translation. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.88-96. [PDF, 631KB]

(2008) Patrik Lambert & Rafael E.Banchs: Word association models and search strategies for discriminative word alignment. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.97-103. [PDF, 633KB]

(2008) Gregor Leusch, Evgeny Matusov, & Hermann Ney: Complexity of finding the BLEU-optimal hypothesis in a confusion network.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.839-847. [PDF, 255KB]

(2008) Zhifei Li & Sanjeev Khudanpur: Large-scale discriminative n-gram language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.133-142. [PDF, 643KB]

(2008) Kai Li & Yuejie Zhang: Efficient statistical machine translation algorithm based on IBM model 4. Proceedings of NTCIR-7 Workshop Meeting, December 16-19, 2008, Tokyo, Japan; pp. 450-453. [PDF, 394KB]

(2008) Zhifei Li & Sanjeev Khudanpur: A scalable decoder for parsing-based machine translation with equivalent language model state maintenance. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.10-18. [PDF, 569KB]

(2008) Wolfgang Machery, Franz Josef Och, Ignacio Thayer, & Jakob Uszkoreit: Lattice-based minimum error rate training for statistical machine translation.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.725-734. [PDF, 304KB]

(2008) Robert C.Moore & Chris Quirk: Random restarts in minimum error rate training for statistical machine translation.  Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.585-592. [PDF, 552KB]

(2008) Slav Petrov, Aria Haghighi, & Dan Klein: Coarse-fine syntactic machine translation using language projections.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.108-116. [PDF, 306KB]

(2008) Matt Post & Daniel Gildea: Parsers as language models for statistical machine translation. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.172-181. [PDF, 538KB]

(2008) Germán Sanchis-Trilles & Joan Andreu Sánchez: Using parsed corpora for estimating stochastic inversion transduction grammars.  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 3pp. [PDF, 48KB]

(2008) Libin Shen, Jinxi Xu, & Ralph Weischedel: A new string-to-dependency machine translation algorithm with a target dependency language model. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 577-585. [PDF, 147KB]

(2008) Anders Sřgaard: Learning context-sensitive synchronous rules. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.170-175. [PDF, 525KB]

(2008) Michael Subotin: Generalizing local translation models. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.28-36. [PDF, 148KB]

(2008) Roy W.Tromble, Shankar Kumar, Franz Och, & Wolfgang Machery: Lattice minimum Bayes-risk decoding for statistical machine translation. EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.620-629. [PDF, 285KB]

(2008) Jakob Uszkoreit & Thorsten Brants: Distributed word clustering for large scale class-based language modelling in machine translation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the conference, June 15-20, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 755-762. [PDF, 222KB]

(2008) Ashish Venugopal, Andreas Zollmann, Noah A. Smith, & Stephan Vogel: Wider pipelines: n-best alignments and parses in MT training. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.192-201. [PDF, 614KB]

(2008) Zhuoran Wang & John Shawe-Taylor: Kernel regression framework for machine translation: UCL system description for WMT 2008 shared translation task. ACL-08: HLT. Third Workshop on Statistical Machine Translation, Proceedings, June 19, 2008, The Ohio State University, Columbus, Ohio, USA (ACL WMT-08); pp.155-158. [PDF, 123KB]

(2008) Fai Wong & Kwok Kit Leung: The design of web based machine translation server based on grid infrastructure . The Fourth International Conference on Networked Computing and Advanced Information Management (NCM2008), Gyeongju, Korea, 2008; pp.713-718. [PDF, 226KB]

(2008) Keiji Yasuda, Ruiqiang Zhang, Hirofumi Yamamoto, & Eiichiro Sumita: Method of selecting training data to build a compact and efficient translation model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.655-660. [PDF, 447KB]

(2008) Hao Zhang, Daniel Gildea, & David Chiang: Extracting synchronous grammar rules from word-level alignments in linear time. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1081-1088. [PDF, 181KB]

(2008) Jingbo Zhu, Huizhen Wang, & Eduard Hovy: Learning a stopping criterion for active learning for word sense disambiguation and text classification. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.366-372. [PDF, 421KB]

(2007) Marcello Federico & Mauro Cettolo: Efficient handling of n-gram language models for statistical machine translation.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 88-95 [PDF, 361KB]

(2007) M. Federico: Efficient handling of n-gram language models for statistical machine translation. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 31pp. [PDF of PPT presentation, 474KB]

(2007) Liang Huang & David Chiang: Forest rescoring: faster decoding with integrated language models. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 144-151 [PDF, 215KB]

(2007) Kevin Knight: Capturing practical natural language transformations [abstract]. Machine Translation 21 (2), June 2007; pp.121-133.

(2007) Philippe Langlais, Alexandre Patry & Fabrizio Gotti: A greedy decoder for phrase-based statistical machine translation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.104-113 [PDF, 302KB]; presentation [PDF, 1120KB]

(2007) Adam Lopez: Hierarchical phrase-based translation with suffix arrays. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 976-985. [PDF, 313KB]

(2007) David Talbot & Miles Osborne: Randomised language modelling for statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 512-519 [PDF, 176KB]

(2007) David Talbot & Miles Osborne: Smooth bloom filter language models: saving space by flipping coins. First Machine Translation Marathon, Edinburgh, April 16-20, 2007; 45pp. [PDF of PPT presentation, 1394KB]

(2007) Ashish Venugopal, Andreas Zollmann, & Stephan Vogel: An efficient two-pass approach to synchronous-CFG driven statistical MT. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.500-507 [PDF, 279KB]

(2007) Richard Zens & Hermann Ney: Efficient phrase-table representation for machine translation with applications to online MT and speech translation. NAACL-HLT-2007 Human Language Technology: the conference of the North American Chapter of the Association for Computational Linguistics, 22-27 April 2007, Rochester, NY; pp.492-499 [PDF, 299KB]

(2007) Fai Wong : The parsing algorithm of translation corresponding tree (TCT) grammar . Third Asian-Pacific Congress on Computational Mechanics (APCOM'07) in conjunction with Eleventh International Conference on the Enhancement and Promotion of Computational Methods in Engineering and Science (EPMESC XI), December 3-6, 2007, Kyoto, Japan; 10pp. [PDF, 294KB]

(2007) Hao Zhang & Daniel Gildea: Factorization of synchronous context-free grammars in linear time. SSST, NAACL-HLT-2007 AMTA Workshop on  Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.25-32 [PDF, 151KB]

(2006) Srinivas Bangalore, Patrick Haffner, & Stephan Kanthak: Sequence classification for machine translation. Interspeech 2006: ICSLP Ninth International Conference on  Spoken Language Processing, Pittsburgh, PA, USA, September 17-21, 2006, paper 1722; abstract [PDF, 72KB]

(2006) Marcello Federico & Nicola Bertoldi: How many bits are needed to store probabilities for phrase-based translation?  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 94-101 [PDF, 204KB]

(2006) Liang Huang: Efficient algorithms for richer formalisms: parsing and machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 223-226 [PDF, 87KB]

(2006) Krzysztof Jassem & Kowalski Tomasz: An algorithm for extracting translation rules from scarce bilingual corpora. Proceedings of the International Multiconference on Computer Science and Information Technology, vol.1: XXII Autumn Meeting of Polish Information Processing Society, November 6-10, 2006, Wisla, Poland; pp.67-73. [PDF, 368KB]

(2006) Marian Olteanu, Pasin Suriyentrakorn, & Dan Moldovan: Language models and reranking for machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 150-153 [PDF, 115KB]

(2006) Daniel Ortiz Martínez, Ismael García Varea, & Francisco Casacuberta Nolla: Generalizing stack decoding algorithms for statistical machine translation.  HLT-NAACL 2006: Proceedings of the Workshop on Statistical Machine Translation, New York, NY, USA, June 2006; pp. 64-71 [PDF, 239KB]

(2006) Hao Zhang, Liang Huang, Daniel Gildea, & Kevin Knight: Synchronous binarization for machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 256-263 [PDF, 206KB]

(2005) Mathieu Lafourcade: Semantic analysis through ant algorithms, conceptual vectors and fuzzy UNL graphs. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.125-137 [abstract, PDF, 79KB]

(2005) Jorge Marques Pelizzoni & Maria das Graças Volpe Nunes: Flexibility, configurability and optimality in UNL deconversion via multiparadigm programming . In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute);  pp.175-194 [abstract, PDF, 13KB]

(2005) Ronaldo Martins, Ricardo Hasegawa & M.Graças V.Nunes: Hermeto: a NL-UNL enconverting environment. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.254-260 [abstract, PDF, 52KB]

Robustness

(2009) Jesús Giménez & Lluís Mŕrquez: On the robustness of syntactic and semantic features for automatic MT evaluation.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.250-258. [PDF, 171KB]

(2007) John Tinsley, Ventsislav Zhechev, Mary Hearne, & Andy Way: Robust language pair-independent sub-tree alignment. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.467-474 [PDF, 1310KB]

(2006) Pascale Fung & Benfeng Chen: Robust word sense translation by EM learning of frame semantics. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.239-246. [PDF, 223KB]

(2006) Xiaoyi Ma: Champollion: a robust parallel text sentence aligner.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.489-492 [PDF, 313KB]

Statistical analysis (see also Finite-state methods, Hidden Markov models)

(2009) Fabien Cromičres & Sadao Kurohashi: An alignment algorithm using belief propagation and a structure-based distortion model. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.166-174. [PDF, 221KB]

(2009) Loic Dugast, Jean Senellart, & Philipp Koehn: Statistical post editing and dictionary extraction: Systran/Edinburgh submissions for ACL-WMT2009.  Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.110-114. [PDF, 245KB]

(2009) Nizar Habash, Bonnie Dorr, & Christof Monz: Symbolic-to-statistical hybridization: extending generation-heavy machine translation [abstract]. Machine Translation 23 (1), February 2009; pp.23-63.

 (2009) Alon Halevy, Peter Norvig, & Fernando Pereira: The unreasonable effectiveness of data. IEEE Intelligent Systems, March-April 2009; pp.8-12. [PDF, 376KB]

(2009) Ulf Hermjakob: Improved word alignment with statistics and linguistic heuristics. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.229-237. [PDF, 107KB]

 (2009) A.-L.Lagarda, V.Alabau, F.Casacuberta, R.Silva, & E.Díaz-de-Liańo: Statistical post-editing of a rule-based machine translation system. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Short Papers, Boulder, Colorado, May 31 - June 5, 2009; pp.217-220. [PDF, 83KB]

(2009) Khalil Sima’an: Well-behaved nonparametric statistical learning for parsing and translation [abstract]. Benelearn 2009: proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, Tilburg, May 2009; p.65. [PDF, 139KB]

(2008) Stephan Busemann & Yajing Zhang: Identifying foreign person names in Chinese text. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 255KB]

(2008) Michael Carl, Maite Melero, Toni Badia, Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, & Olga Yannoutsou: METIS-II: low resource machine translation [abstract]. Machine Translation 22 (1/2), March-June 2008; pp.67-99.

(2008) Peter A.Chew, Brett W.Bader, & Ahmed Abdelali: Latent morpho-semantic analysis: multilingual information retrieval with character n-grams and mutual information. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.129-136. [PDF, 117KB]

(2008) A. Diaz de Ilarraza, G. Labaka, & K. Sarasola: Statistical post-editing: a valuable method in domain adaptation of RBMT systems for less-resourced languages. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 35-40. [PDF, 364KB]

(2008) Surya Ganesh, Sree Harsha, Prasad Pingali, & Vasudeva Varma: Statistical transliteration for cross language information retrieval using HMM alignment and CRF. IJCNLP 2008: 2nd International Workshop on Cross-Lingual Information Access (CLIA) Proceedings of the workshop, 11 January 2008, Hyderabad, India; pp.42-47. [PDF, 127KB]

(2008) Petra Gieselmann: Architecture of the Lucy translation system. Second Machine Translation Marathon, Wandlitz, Berlin, Germany, 12-17 May 2008; 28 slides. [PDF, 225KB]

(2008) Yvette Graham & Josef van Genabith: Packed rules for automatic transfer-rule induction. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.57-65. [PDF, 1052KB]

(2008) Chengguo Jin, Dong-Il Kim, Seung-Hoon Na, & Jong-Hyeok Lee: Automatic extraction of English-Chinese transliteration pairs using dynamic window and tokenizer. IJCNLP 2008: Sixth SIGHAN Workshop on Chinese Language Processing, Proceedings of the workshop, 11-12 January 2008, Hyderabad, India; pp.9-15. [PDF, 385KB]

(2008) Jin-Shea Kuo & Haizhou Li: Multi-view co-training of transliteration model. IJCNLP 2008: Third International Joint Conference on Natural Language Processing, January 7-12, 2008, Hyderabad, India; pp.373-380. [PDF, 465KB]

(2008) Rogelio Nazar, Leo Wanner, & Jorge Vivaldi: Two-step flow in bilingual lexicon extraction from unrelated corpora. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.140-149. [PDF, 537KB]

(2008) Vivek Kumar Rangarajan Sridhar, Shrikanth Narayanan, & Srinivas Bangalore: Enriching spoken language translation with dialog acts. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Short papers, June 16-17, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 225-228. [PDF, 383KB]

(2008) Sujith Ravi & Kevin Knight: Attacking decipherment problems optimally with low-order n-gram models.  EMNLP 2008: Proceedings of  the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.812-819. [PDF, 659KB]

(2008) J.D.Riding: Statistical glossing – language independent analysis in Bible translation. Translating and the Computer 30, 27-28 November 2008, London; 11pp. [PDF, 137KB]

(2008) Clare R.Voss, Matthew Aguirre, Jeffrey Micher, Richard Chang, Jamal Laoudi, & Reginald Hobbs: Boosting performance of weak MT engines automatically: using MT output to align segments & build statistical post-editors. EAMT 2008: 12th annual conference of the European Association for Machine Translation, September 22 & 23, 2008, Hamburg, Germany. Proceedings, ed. John Hutchins and Walther v.Hahn; pp.192-201. [PDF, 506KB]

(2007) Vicente Alabau, Alberto Sanchis, & Francisco Casacuberta: Improving speech-to-speech translation using word posterior probabilities. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.9-14 [PDF, 87KB]

(2007) Michael Carl & Paul Schmidt: A modular architecture for separating hypothesis formation from hypothesis evaluation in data-driven machine translation.  METIS-II Workshop: New Approaches to Machine Translation, Centre for Computational Linguistics, Katholieke Universiteit Leuven, Belgium, 11 January 2007; 8pp. [PDF, 155KB]

(2007) Michael Carl, Sandrine Garnier, & Paul Schmidt: Demonstration of the German to English METIS-II MT system. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.41-42 [PDF, 251KB]; poster [PDF, 276KB]

(2007) Michael Carl: METIS-II: the German to English MT system.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.65-72 [PDF, 166KB]

(2007) Loďc Dugast, Jean Senellart, & Philipp Koehn: Statistical post-editing on  SYSTRAN’s rule-based translation system.  ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 220-223 [PDF, 128KB]

(2007) Ehara Terumasa: Rule based machine translation combined with statistical post editor for Japanese to English patent translation. MT Summit XI Workshop on patent translation, 11 September 2007, Copenhagen, Denmark; pp.13-18. [PDF, 355KB]

(2007) Oren Etzioni, Kobi Reiter, Stephen Soderland, & Marcus Sammer: Lexical translation with application to image searching on the web. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.175-182 [PDF, 603KB]

(2007) Masaki Itagaki, Takako Aikawa, & Xiaodong He: Automatic validation of terminology translation consistency with statistical method.  MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.269-274 [PDF, 416KB]

(2007) Irene Langkilde-Geary: Declarative syntactic processing of natural language using concurrent constraint programming and probabilistic dependency modeling. MT Summit XI Workshop: Using corpora for natural language generation: language generation and machine translation (UCNLG+MT), 11 September 2007, Copenhagen, Denmark; pp.55-63 [PDF, 1969KB]

(2007) Stephan Oepen, Erik Velldal, Jan Tore Lřnning, Paul Meurer, Victoria Rosén, & Dan Flickinger: Towards hybrid quality-oriented machine translation – on linguistics and probabilities in MT. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.144-153 [PDF, 328KB]

(2007) Yves Scherrer: Adaptive string distance measures for bilingual dialect lexicon induction. ACL 2007: proceedings of the Student Research Worksho, Prague, Czech Republic, June 2007; pp. 55-60 [PDF, 122KB]

(2007) Michel Simard, Nicola Ueffing, Pierre Isabelle, & Roland Kuhn: Rule-based translation with statistical phrase-based post-editing. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 203-206 [PDF, 130KB]

(2007) A.Cüneyd Tantuğ, Eşref Adali, & Kemal Oflazer: A MT system from Turkmen to Turkish employing finite state and statistical methods. MT Summit XI, 10-14 September 2007, Copenhagen, Denmark. Proceedings; pp.459-465 [PDF, 399KB]

(2006) Nicola Cancedda: SMART: research directions. [Xerox, 2006]; 3pp. [PDF, 25KB]

(2006) Rashmi Gangadharaiah, Ralf Brown & Jaime Carbonell: Spectral clustering for example based machine translation.  HLT-NAACL 2006: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, New York, NY, USA, June 2006; pp. 41-44 [PDF, 63KB]

(2006) Saša Hasan, Shahram Khadivi, Richard Zens, & Hermann Ney: A flexible architecture for CAT applications. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.81-88 [PDF, 160KB]

(2006) Kevin Knight, Anish Nair, Nishit Rathod, & Kenji Yamada: Unsupervised analysis for decipherment problems. Coling-ACL 2006: Proceedings of the Coling/ACL 2006 Main Conference Poster Sessions, Sydney, July 2006; pp.499-506. [PDF, 200KB]

(2006) Paul McNamee & James Mayfield: Translation of multiword expressions using parallel suffix arrays.  AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.100-109 [PDF, 733KB]

(2006) Scott S.L.Piao, Guangfan Sun, Paul Rayson, & Qi Yuan: Automatic extraction of Chinese multiword expressions with a statistical tool. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp.17-24 [PDF, 396KB]

(2006) Brock Pytlik & David Yarowsky: Machine translation for languages lacking bitext via multilingual gloss transduction. AMTA 2006: Proceedings of the 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; pp.156-165 [PDF, 267KB]

(2006) Evan Ratliff: Me translate pretty one day. Wired 14 (12), 2006; pp.210-213. [PDF, 32KB]

(2006) Jean Senellart & John S. White: First strategies for integrating hybrid approaches into established systems (contribution to panel on “Hybrid machine translation”).  AMTA 2006: 7th Conference of the Association for Machine Translation in the Americas, “Visions for the Future of Machine Translation”, August 8-12, 2006, Cambridge, Massachusetts, USA; [PDF of PPT presentation, 68KB]

(2006) Radu Soricut & Daniel Marcu: Stochastic language generation using WIDL-expressions and its application in machine translation and summarization.  Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.1105-1112. [PDF, 233KB]

(2006) Daniel Stein, Jan Bungeroth, & Hermann Ney: Morpho-syntax based statistical methods for automatic sign language translation. EAMT-2006: 11th Annual Conference of the European Association for Machine Translation, June 19-20, 2006, Oslo, Norway. Proceedings; p.169-177 [PDF, 206KB]

(2005) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, & Naoto Kato: Probabilistic model for example-based machine translation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.219-226. [PDF, 250KB]

(2005) David Chiang: A hierarchical phrase-based model for statistical machine translation. ACL-2005: 43rd Annual meeting of the Association for Computational Linguistics, University of Michigan, Ann Arbor, 25-30 June 2005; pp. 263-270. [PDF, 203KB]

(2005) Antonio M.Corbi-Bellot, Mikel L. Forcada, Sergio Ortíz-Rojas, Juan Antonio Pérez-Ortiz, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, Ińaki Alegria, Aingeru Mayor, & Kepa Sarasola: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 79-86. [PDF, 130KB]

(2005) Philippe Langlais, Thomas Leplus, Simona Gandrabur, & Guy Lapalme: From the real world to real words: the METEO case. 10th EAMT conference "Practical applications of machine translation", 30-31 May 2005, Budapest; pp. 166-175. [PDF, 102KB]

(2005) Liu Zhanyi, Wang Haifeng & Wu Hua: Example-based machine translation based on TSC and statistical generation. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.25-32. [PDF, 476KB]

(2005) Robert C. Moore: Association-based bilingual word alignment. ACL-2005: Workshop on Building and Using Parallel Texts – Data-driven machine translation and beyond, University of Michigan, Ann Arbor, 29-30 June 2005; pp. 1-8. [PDF, 102KB]

(2005) Stefan Riezler & John T.Maxwell III: On some pitfalls in automatic evaluation and significance testing for MT. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 57-64. [PDF, 115KB]

(2005) Karen Spärck Jones: Some points in a time. Computational Linguistics 31 (1), pp. 1-14. [PDF, 1825KB]

Support vector machines

(2009) Asif Ekbal & Sivajii Banyopadhyay: Voted NER system using appropriate unlabeled data. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.202-210. [PDF, 98KB]

(2009) Katsuhiko Hayashi, Taro Watanabe, Hajime Tsukada, & Hideki Isozaki: Structural support vector machines for log-linear approach in statistical machine translation. IWSLT 2009: Proceedings of the International Workshop on Spoken Language Translation, National Museum of Emerging Science and Innovation, Tokyo, Japan, December 1-2, 2009; pp. 144-151. [PDF, 352KB]; presentation [PDF of PPT, 220KB]

(2009) Yifan He & Andy Way: Learning labeled dependencies in machine translation evaluation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Mŕrquez and Harold Somers, 14-15 May 2009, Universitat Politčcnica de Catalunya, Barcelona, Spain; pp.44-51. [PDF, 426KB]

(2009) Jana Kravalová & Zdeněk Žabokrtský: Czech named entity corpus and SVM-based recognizer. [ACL-IJCNLP-2009] Proceedings of the 2009 Named Entities Workshop ACL-IJCNLP 2009, Suntec, Singapore, 7 August 2009; pp.194-201. [PDF, 140KB]

(2009) Sylvain Raybaud, David Langlois & Kamel Smaili: Efficient combination of confidence measures for machine translation. Interspeech 2009: 10th Annual Conference of the International Speech Communication Association, 6-10 September 2009, Brighton, UK; abstract [PDF]

(2009) Zuoran Wang, John Shawe-Taylor, & Sándor Szedmák: Large-margin structured prediction via linear programming. SMART Workshop at EACL 2009, Barcelona, Spain, 13 May 2009. 89 slides. [PDF of PPT, 950KB]

(2009) Min Zhang & Haizhou Li: Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.698-707. [PDF, 294KB]

(2008) Colin Cherry & Chris Quirk: Discriminative, syntactic language modeling through latent SVMs. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.65-74. [PDF, 609KB]

(2008) Katsunori Kotani, Takehiko Yoshimi, Takeshi Kutsumi, Ichiko Sata, & Hitoshi Isahara: A method of automatically evaluating machine translations using a word-alignment based classifier. MATMT 2008: Mixing Approaches to Machine Translation, Donostia-San Sebastian [Spain], February 14th 2008: Proceedings; pp. 11-18. [PDF, 259KB]

(2008) Yohei Morishita, Takehito Utsuro, & Mikio Yamamoto: Integrating a phrase-based SMT model and a bilingual lexicon for human in semi-automatic acquisition of technical term translation lexicon. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.153-162. [PDF, 988KB]

(2008) Katsuhito Sudoh, Taro Watanabe, Jun Suzuki, Hajime Tsukada, & Hideki Isozaki: NTT statistical machine translation system for IWSLT 2008. IWSLT 2008: Proceedings of the International Workshop on Spoken Language Translation, 20-21 October 2008, Hawaii, USA; pp.92-97 [PDF, 234KB]; presentation [PDF, 248KB]

(2008) Suqi Sun, Yin Chen & Jufeng Li: A re-examination on features in regression based approach to automatic MT evaluation. ACL-08: HLT. 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Proceedings of the student research workshop, June 16, 2008, The Ohio State University, Columbus, Ohio, USA; pp. 25-30. [PDF, 234KB]

(2008) Jiajun Zhang, Chengqing Zong, & Shoushan Li: Sentence type based reordering model for statistical machine translation. Coling 2008:  22nd International Conference on Computational Linguistics, Proceedings of the conference, 18-22 August 2008, Manchester UK; pp.1089-1096. [PDF, 151KB]

(2007) Eiji Aramaki, Takeshi Imai, Kengo Miyo, & Kazuhiko Ohe: Support vector machine based orthographic disambiguation. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.21-30 [PDF, 979KB]; poster [PDF, 403KB]

(2007) Yee Seng Chan, Hwee Tou Ng, & David Chiang: Word sense disambiguation improves statistical machine translation. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 33-40 [PDF, 516KB]

(2007) Roxana Girju: Improving the interpretation of noun phrases with cross-linguistic information. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 568-575 [PDF, 156KB]

(2007) Andrew Mutton, Mark Dras, Stephen Wan, & Robert Dale: GLEU: automatic evaluation of sentence-level fluency. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 344-351 [PDF, 157KB]

(2006) Alexandru Ceauşu, Dan Ştefănescu, & Dan Tufiş: Acquis Communautaire sentence alignment using support vector machines.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Proceedings, Genoa, Italy, 22-28 May 2006; pp.2134-2137 [PDF, 381KB]

(2006) Masatoshi Tsuchiya, Takao Shime, Toshihiro Takagi, Takehito Utsuro, Kiyotaka Uchimoto, Suguru Matsuyoshi, Satoshi Sato, & Seiichi Nakagawa: Chunking Japanese compound functional expressions by machine learning. EACL-2006: 11th Conference of the European Chapter of the Association for Computational Linguistics, Workshop on Multi-word expressions in a Multilingual Context, Trento, Italy, April 3, 2006; pp. 25-32 [PDF, 793KB]

(2005) Chris Brockett & William B.Dolan: Support vector machines for paraphrase identification and corpus construction. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 1-8. [PDF, 115KB]

(2005) William B.Dolan & Chris Brockett: Automatically constructing a corpus of sentential paraphrases. IJCNLP-05: Third International Workshop on Paraphrasing (IWP 2005). Proceedings of the workshop, 12 October 2005, Jeju Island, Korea; pp. 9-16. [PDF, 115KB]

(2005) Takeshi Kutsumi, Takehiko Yoshimi, Katsunori Kotani, Ichiko Sata, & Hitoshi Isahara: Selection of entries for a bilingual dictionary from aligned translation equivalents using support vector machines. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.11-16. [PDF, 253KB]

System development

(2009) Aingeru Mayor & Francis M.Tyers: Matxin: moving towards language independence. Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, 2-3 November 2009, Universitat d’Alacant, Alacant, Spain; ed. Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Francis M.Tyers; pp. 11-17. [PDF, 205KB]

(2008) Jesús Giménez & Luís Mŕrquez: Towards heterogeneous automatic MT error analysis. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 8pp. [PDF, 90KB]

(2007) Gregor Thurmair: Automatic evaluation in MT system production. MT Summit XI Workshop: Automatic procedures in MT evaluation, 11 September 2007, Copenhagen, Denmark, [Proceedings]; 28pp. [PDF of PPT presentation, 153KB]

(2005) Adam Lopez & Philip Resnik: Pattern visualization for machine translation output.  HLT-EMNLP-2005: Proceedings of Human Technology Conference and Conference on Empirical Methods in Natural Language Processing, Vancouver, October 2005; Demonstration abstracts, pp. 12-13. [PDF, 160KB]

Telecommunications

(2005) Carlos A.Estombelo-Montesco & Dilvan A.Moreira: UCL – Universal Communication Language. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.326-336 [abstract, PDF, 82KB]

Virtual reality

(2009) Ying Zhang & Nguyen Bach: Virtual Babel: towards context-aware machine translation in virtual worlds. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 387-394. [PDF, 290KB]

Virtualization

(2009) Patrick Paul, Samuel Larkin, Ulrich Germann, Eric Joanis, & Roland Kuhn: PortageLive: delivering machine translation technology via virtualization. [Paper for technology showcase at] MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; 7pp.. [PDF, 77KB]

XML

(2009) Terence Lewis: Notes on a live demonstration [of the Trasy Dutch-English translation software]. Translating and the Computer 31, 19-20 November 2009, London; 7pp. [PDF, 362KB]

(2009) Yasser Salem & Brian Nolan: Designing an XML lexicon architecture for Arabic machine translation based on role and reference grammar.  MEDAR 2009: 2nd International Conference on Arabic Language Resources & Tools, 22-23 April 2009, Cairo, Egypt; pp.221-229. [PDF, 691KB]

(2008) Piotr Bański & Radsław Moszczyński: Enhancing an English-Polish electronic dictionary for multiword expression research. LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 4pp. [PDF, 56KB]

(2008) Doaa Samy & Ana González-Ledesma: Pragmatic annotation of discourse markers in a multilingual parallel corpus (Arabic-Spanish-English).  LREC 2008: 6th Language Resources and Evaluation Conference, Marrakech, Morocco, 26-30 May 2008; 7pp. [PDF, 301KB]

(2006) Silvia Hansen-Schirra, Stella Neumann, & Mihaela Vela : Multi-dimensional annotation and alignment in an English-German translation corpus. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.35-42. [PDF, 77KB]

(2006) Christian Lieske, Felix Sasaki, & Yves Savourel: W3C internationalization tag set – a gentle introduction.  Translating and the Computer 28: proceedings of the Twenty-eighth International Conference on Translating and the Computer, 16-17 November 2006, London. (London: Aslib, 2006); 22pp. [PDF, 126KB]; presentation, 39 slides [PDF, 1406KB]

(2006) Martin Volk, Sofia Gustafson-Capková, Joakim Lundborg, Torsten Marek, Yvonne Samuelsson, & Frida Tidström: XML-based phrase alignment in parallel treebanks. EACL-2006: Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-dimensional Markup in Natural Language Processing, April 4, 2006, Trento, Italy; pp.93-96. [PDF, 103KB]

(2006) Andrzej Zydroń: xml:tm – a radical new approach to translating XML based documents.  LREC-2006: Fifth International Conference on Language Resources and Evaluation. Third International Workshop on Language Resources for Translation Work, Research & Training (LR4Trans-III), Genoa, Italy, 28 May 2006; pp.59-66. [PDF, 570KB]

(2005) Mats Attnäs, Pierre Senellart, & Jean Senellart: Integration of SYSTRAN MT systems in an open workflow. MT Summit X, Phuket, Thailand, September 13-15, 2005, Conference Proceedings: the tenth Machine Translation Summit; pp.211-218. [PDF, 1951KB]

(2005) Carlos A.Estombelo-Montesco & Dilvan A.Moreira: UCL – Universal Communication Language. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.326-336 [abstract, PDF, 82KB]

(2005) Najeh Hajlaoui & Christian Boitet: A “pivot” XML-based architecture for multilingual, multiversion documents: parallel monolingual documents aligned through a central correspondence descriptor and possible use of UNL. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.309-325 [abstract, PDF, 85KB]

(2005) Wang-Ju Tsai: A platform for experimenting with UNL. In: Jesús Cardeńosa, Alexander Gelbukh, Edmundo Tovar (eds.): Universal Networking Language: advances in theory and applications (Mexico City: National Polytechnic Institute); pp.261-267 [abstract, PDF, 15KB]

(2005) José Vega, Benoît Lamey, & Jacques Vergne: MyCatex: a language-independent term extractor. Translating and the Computer 27: proceedings of the Twenty-seventh International Conference on Translating and the Computer, 24-25 November 2005, London. (London: Aslib, 2005); 10pp. [PDF, 179KB]