Machine Translation Archive

Index of Languages

Languages E: publications since 2010

[For languages A to D click here; for languages F to L click here, for languages M to Z click here]

[for publications before 2010 click here; for publications 2000-2009 click here]

 

Egyptian Arabic

(2012) Nizar Habash, Ramy Eskander, & Abdelati Hawwari: A morphological analyzer for Egyptian Arabic. SIGMORPHON-2012. Proceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology, Montréal, Canada, June 7, 2012; pp.1-9. [PDF, 147KB]

(2012) Amira Shoukry & Ahmed Rafea: Preprocessing Egyptian dialect tweets for sentiment mining. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.47-56. [PDF, 1193KB]

Egyptian Arabic to English

(2013) Hassan Sajjad, Kareem Darwish, & Yonatan Belinkov: Translating dialectal Arabic to English. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.1-6. [PDF, 182KB]

English

(2015) Michael Heck, Quoc Truong Do, Sakriani Sakti, Graham Neubig, & Satoshi Nakamura: The NAIST English speech recognition system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.105-111. [PDF, 2.8MB]

(2015) Van Huy Nguyen, Quoc Bao Nguyen, Tat Thang Vu, & Chi Mai Luong: The IOIT English ASR system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.84-87. [PDF, 1.2MB]

(2015) Tran Huy Dat, Jonathan William Dennis, & Ng Wen Zheng Terence: The I2R ASR system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.88-92. [PDF, 1.8MB]

(2014) Ahmed El Kholy & Nizar Habash: Alignment symmetrization optimization targeting phrase pivot statistical machine translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.63-70. [PDF, 478KB]

(2014) Fabrizio Gotti, Philippe Langlais, & Atefeh Farzindar: Hashtag occurrences, layout and translation: a corpus-driven analysis of tweets published by the Canadian government.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2254-2261. [PDF, 486KB]

(2014) Homa B.Hashemi & Rebecca Hwa: A comparison of MT errors and ESL errors.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 82KB]

(2014) Adam Kilgarriff, Miloš Jakubíček, Vojtěch Kovář, Pavel Rychlý, Vít Baisa, & Lucia Kocincová: Extrinsic corpus evaluation with a collocation dictionary task. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.545-552. [PDF, 143KB]

(2013) Peter Bell, Fergus McInnes, Siva Reddy Gangireddy, Mark Sinclair, Alexandra Birch, & Steve Renals: The UEDIN English ASR system for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 173KB]

(2013) Alexandra Birch, Nadir Durrani, & Philipp Koehn: English SLT and MT system description for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 9pp. [PDF, 225KB]

(2013) Ido Dagan, Bernardo Magnini, Guenter Neumann, & Sebastian Pado: EXploring Customer Interactions through Textual EntailMENT. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.417. [PDF, 201KB]

(2013) Daniele Falavigna, Roberto Gretter, Fabio Brugnara, & Diego Giuliani: FBK @ IWSLT 2013 – ASR tracks. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 4pp. [PDF, 104KB]

(2013) Juri Ganitkevitch, Benjamin Van Durme, & Chris Callison-Burch: PPDB: the paraphrase database. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.758-764. [PDF, 217KB]

(2013) Chien-Lin Huang, Paul R.Dixon, Shigeki Matsuda, Youzheng Wu, Xugang Lu, Masahiro Saiko, & Chiori Hori: The NICT ASR system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 225KB]

(2013) David Kauchak: Improving text simplification language modeling using unsimplified text data. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1537-1546. [PDF, 302KB]

(2013) Kevin Kilgour, Christian Mohr, Michael Heck, Quoc Bao Nguyen, Van Huy Nguyen, Evgeniy Shin, Igor Tseyzer, Jonas Gehring, Markus Müller, Matthias Sperber, Sebastian Stüker & Alex Waibel: The 2013 KIT IWSLT speech-to-text systems for German and English. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 86KB]

(2013) Karl Pichotta & John DeNero: Identifying phrasal verbs using many bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.636-646. [PDF, 267KB]

(2013) Sakriani Sakti, Keigo Kubo, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: The NAIST English speech recognition system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 109KB]

(2013) Svitlana Volkova, Theresa Wilson, & David Yarowsky: Exploring sentiment in social media: bootstrapping subjectivity clues from multilingual Twitter streams.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.505-510. [PDF, 317KB]

(2013) Lingxiao Wang & Christian Boitet: Online production of HQ parallel corpora and permanent task-based evaluation of multiple MT systems: both can be obtained through iMAGs with no added cost. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 103-110. [PDF, 2109KB]

(2013) Yulan Yan, Chikara Hashimoto, Kentaro Torisawa, Takao Kawai, Jun’ichi Kazama, & Stijn de Saeger: Minimally supervised method for multilingual paraphrase extraction from definition sentences on the web. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.63-73. [PDF, 381KB]

(2012) Cristina Bosco, Manuela Sanguinetti, & Leonardo Lesmo: The Parallel-TUT: a multilingual and multiformat treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3982-3987. [PDF, 460KB]

(2012) Chen Chen & Vincent Ng: Combining the best of two worlds: a hybrid approach to multilingual coreference resolution. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.56-63. [PDF, 378KB]

(2012) D.Falavigna, R.Gretter, F.Brugnara, & D.Giuliani: FBK@IWSLT 2012 – ASR track. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.81-86. [PDF, 732KB]

(2012) Roger Granada, Lucelene Lopes, Carlos Ramisch, Cassia Trojahn, Renata Vieira, & Aline Villavicencio: A comparable corpus based on aligned multilingual ontologies. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.25-31. [PDF, 108KB]

(2012) Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, & Pawel Swietojanski: The UEDIN systems for the IWSLT 2012 evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.46-53. [PDF, 651KB]; presentation: 103 slides [PDF of PPT, 384KB]

(2012) Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, & Alex Waibel: The KIT-NAIST (contrastive) English ASR system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.91-95. [PDF, 637KB]

(2012) Baoli Li: Learning to model multilingual unrestricted coreference in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.129-135. [PDF, 130KB]

(2012) Jiří Maršík & Ondřej Bojar: TrTok: a fast and trainable tokenizer for natural languages. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.75-85. [PDF, 141KB]

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

(2012) Stephan Peitz, Saab Mansour, Markus Freitag, Minwei Feng, Matthias Huck, Joern Wuebker, Malte Nuhn, Markus Nußbaum-Thom, & Hermann Ney: The RWTH Aachen speech recognition and machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.69-76. [PDF, 623KB]; presentation, 19 slides [PDF of PPT, 165KB]

(2012) Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebatian Stüker, Sakriani Sakri, Graham Neubig, Tomoki Toda, Satoshi Nakamura, & Alex Waibel: The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.87-90. [PDF, 621KB]

 (2012) Sanja Štajner & Ruslan Mitkov: Using comparable corpora to track diachronic and synchronic changes in lexical density and lexical richness.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.88-97. [PDF, 383KB]

(2012) Hao Xiong & Qun Liu: ICT: system description for CoNLL-2012. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.71-75. [PDF, 557KB]

(2012) Desislava Zhekova, Sandra Kübler, Joshua Bonner, Marwa Ragheb, & Yu-Yin Hsu: UBIU for multilingual coreference resolution in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.88-94. [PDF, 113KB]

(2011) Kazuhiko Abe, Youzheng Wu, Chien-lin Huang, Paul R.Dixon, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: The NICT ASR system for IWSLT2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.28-33. [PDF, 266KB]

(2011) A.Ryan Aminzadeh, Tim Anderson, Ray Slyh, Brian Ore, Eric Hansen, Wade Shen, Jennifer Drexler, & Terry Gleason: The MIT-LL/AFRL IWSLT-2011 MT system. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.34-40. [PDF, 254KB]

 (2011) Elena Lloret & Manuel Palomar: Finding the best approach for multi-lingual text summarisation: a comparative analysis. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.194-201. [PDF, 386KB]

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

(2011) Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk, & Yannick Estève: LIUM’s systems for the IWSLT 2011 speech translation tasks. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.79-85. [PDF, 250 KB]

(2011) N.Ruiz, A.Bisazza, F.Brugnara, D.Falavigna, D.Giuliani, S.Jaber, R.Gretter, & M.Federico: FBK@IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.86-93. [PDF, 203KB]

 (2011) Anders Søgaard & Martin Haulrich: Sentence-level instance-weighting for graph-based and transition-based dependency parsing. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.43-47. [PDF,

(2011) Sebastian Stüker, Kevin Kilgour, Christian Saam, & Alex Waibel: The 2011 KIT English ASR system for the IWSLT evaluation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.94-97. [PDF, 198KB]

 (2011) Marco Turchi, Vanni Zavarella, & Hristo Tanev: Pattern learning for event extraction using monolingual statistical machine translation. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.371-377. [PDF, 148KB]

(2010) Marina Litvak, Hagay Lipman, Assaf Ben Gur, Mark Last, Slava Kisilevich, & Daniel Keim: Towards multi-lingual summarization: a comparative analysis of sentence extraction methods on English and Hebrew corpora. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.61-69. [PDF, 228KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

(2010) Horacio Saggion, Juan-Manuel Torres-Moreno, Iria da Cunha, Eric SanJuan, & Patricia Velázquez-Morales: Multilingual summarization evaluation without human models. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1059-1067. [PDF, 238KB]

English to Amharic

 (2012) Michael Gasser: Toward a rule-based system for English-Amharic translation. [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.41-46. [PDF, 319KB]

(2011) Michael Gasser: Towards synchronous extensible dependency grammar. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.3-10. [PDF, 265KB]

English to Arabic (see also English-Iraqi Arabic)

(2015) Nina Seemann, Fabienne Braune, & Andreas Maletti: A systematic evaluation of MBOT in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.200-214. [PDF, 585KB]

(2013) Mahmoud Azab, Houda Bouamor, Behrang Mohit, & Kemal Oflazer: Dudley North visits North London: learning when to transliterate to Arabic. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.439-444. [PDF, 148KB]

(2013) Mahmoud Ghoneim & Mona Diab: Multiword expressions in the context of statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1181-1187. [PDF, 120KB]

(2013) Than-Le Ha, Teresa Herrmann, Jan Niehues, Mohammed Mediani, Eunah Cho, Yuqi Zhang, Isabel Slawik, & Alex Waibel: The KIT translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 147KB]

(2013) Hassan Sajjad, Francisco Guzmán, Preslav Nakov, Ahmed Abdelali, Kenton Murray, Fahad Al Obaidli, & Stephan Vogel: QCRI at IWSLT 2013: experiments in Arabic-English and English-Arabic spoken language translation.  [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 176KB]

(2013) Mohammad Salameh, Colin Cherry, & Grzegorz Kondrak: Reversing morphological tokenization in English-to-Arabic SMT. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.47-53. [PDF, 161KB]

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

(2013) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Integrating morpho-syntactic features in English-Arabic statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.74-81. [PDF, 491KB]

(2012) Sameh Alansary: A formalized reference grammar for UNL-based machine translation between English and Arabic. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.33-42. [PDF, 352KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Ahmed El Kholy & Nizar Habash: Translate, predict or generate: modeling rich morphology in statistical machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.27-34. [PDF, 174KB]

(2012) Spence Green & John DeNero: A class-based agreement model for generating accurately inflected translations. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.22-30. [PDF, 284KB]

(2012) Najeh Hajlaoui & Andrei Popescu-Belis: Translating English discourse connectives into Arabic: a corpus-based analysis and an evaluation metric. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.1-8. [PDF, 731KB]

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2012) Giuliano Lancioni & Marco Boella: Idiomatic MWEs and machine translation, a retrieval and representation model: the AraMWE project. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.9-16.[PDF, 786KB]

(2012) Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel & Alexander Waibel: The KIT translation systems for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.38-45. [PDF, 640KB]; presentation, 23 slides [PDF of PPT, 469KB]

(2011) Abhijit Bhole, Goutham Tholpadi, & Raghavendra Udupa: Mining multi-word named entity equivalents from comparable corpora. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.65-72. [PDF, 820KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Ahmed El Kholy & Nizar Habash: Automatic error analysis for morphologically rich languages. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.225-232. [PDF, 219KB]

(2011) Olivier Hamon & Khalid Choukri: Evaluation methodology and results for English-to-Arabic MT.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.480-487. [PDF,

(2011) Alon Lavie & Hassan Al-Haj: The impact of Arabic morphological segmentation on broad-scale phrase-based SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 38 slides [PDF of PPT, 362KB]

 (2011) Yuval Marton, Ahmed El Kholy, & Nizar Habash: Filtering antonymous, trend-contrasting, and polarity-dissimilar distributional paraphrases for improving statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.237-249. [PDF, 177KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

(2010) Hassan Al-Haj & Alon Lavie: The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 72KB]

(2010) Mossab Al-Hunaity, Bente Maegaard, & Dorte Hansen: Using English as a pivot language to enhance Danish-Arabic statistical machine translation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 108-113. [PDF, 768KB]

(2010) Marine Carpuat & Mona Diab: Task-based evaluation of multiword expressions: a pilot study in statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.242-245. [PDF, 92KB]

(2010) Michael Denkowski, Hassan Al-Haj, & Alon Lavie: Turker-assisted paraphrasing for English-Arabic machine translation. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.66-70. [PDF, 237KB]

(2010) Michael Denkowski & Alon Lavie: Exploring normalization techniques for human judgments of machine translation adequacy collected using Amazon Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.57-61. [PDF, 248KB]

(2010) Ahmed El Kholy & Nizar Habash: Orthographic and morphological processing for English-Arabic statistical machine translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 147KB]

English to Basque

(2014) Nora Aranberri, Gorka Labaka, Arantza Diaz de Ilarraza, & Kepa Sarasola Comparison of post-editing productivity between professional translators and lay users. AMTA 2014: proceedings of the eleventh conference of the  Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.20-33. [PDF, 247KB]

(2013) Olatz Perez-de-Viñaspre, Maite Oronoz, Manex Agirrezabal & Mikel Lersundi: A finite state approach to translate SNOMED CT terms into Basque using medical prefixes and suffixes. Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, St Andrews, Scotalnd, 15-17 July 2013; pp.99-103. [PDF, 169KB]

(2010) Alberto Barrón-Cedeño, Paolo Rosso, Eneko Agirre, & Gorka Labaka: Plagiarism detection across distant language pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.37-45. [PDF, 274KB]

English to Bengali

(2014) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: Word alignment-based reorderingof source chunks in PB-SMT.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3565-3571. [PDF, 185KB]

(2013) Rajdeep Gupta, Santanu Pal, & Sivaji Bandyopadhyay: Improving MT system using extracted parallel fragments of text from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.69-76. [PDF, 436KB]

(2013) Santanu Pal, Sudio Kumar Naskar, & Sivaji Bandyopadhyay: MWE alignment in phrase based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.61-68. [PDF, 539KB]

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) Debasis Ganguly, Johannes Leveling, & Gareth J.F.Jones: Cross-lingual topical relevance models. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.927-942. [PDF, 610KB]

(2012) Utpal Garain, Arjun Das, David S.Doermann, Douglas D.Oard: Leveraging statistical transliteration for dictionary-based English-Bengali CLIR of OCR’d text. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.339-348. [PDF, 235KB]

(2012) Santanu Pal & Sivaji Bandyopadhyay: Bootstrapping method for chunk alignment in phrase based SMT.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 93-100. [PDF, 246KB]

(2011) Khan Md. Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Example-based machine translation for low-resource language using chunk-string templates. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.488-495. [PDF, 531KB]

(2011) Santanu Pal, Tanmoy Chakraborty, & Sivaji Bandyopadhyay: Handling multiword expressions in phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.215-224. [PDF, 161KB]

(2010) Md.Zahurul Islam, Jörg Tiedemann & Andreas Eisele: English to Bangla phrase-based machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 601KB]

English to Brazilian Portuguese

(2015) Prashant Mathur, Marcello Federico, Selçuk Köprü, Sharam Khadivi, & Hassan Sawaf: Topic adaptation for machine translation of e-commerce content. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.270-282. [PDF, 657KB]

(2011) Fabio Alves, Adrianna Pagano, & Igor L. da Silva: Modeling (un)packing of meaning in translation: insights from effortful text production. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.153-163. [PDF, 1064KB]

(2011) Daniel Emilio Beck: Syntax-based statistical machine translation using tree automata and tree transducers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.36-40. [PDF, 124KB]

(2010) Helena de Medeiros Caseli, Bruno Akio Sugiyama, & Junia Coutinho Anacleto: Using common sense to generate culturally contextualized machine translation. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, Los Angeles, CA, June 2010; pp.24-31. [PDF, 353KB]

English to Bulgarian

(2014) Krasimir Angelov: Bootstrapping open-source English-Bulgarian computational dictionary. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1018-1023. [PDF, 158KB]

 (2014) Valia Kordoni & Iliana Simova: Multiword expressions in machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1208-1211. [PDF, 110KB]

(2013) Iliana Simova & Valia Kordoni: Improving English-Bulgarian statistical machine translation by phrasal verb treatment. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.62-71. [PDF, 279KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2010) Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, & Chris Quirk: A discriminative lexicon model for complex morphology. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 374KB]

English to Chinese

(2015) Matthias Huck & Alexandra Birch: The Edinburgh machine translation systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 31-38. [PDF, 3.1MB]

(2015) Michaeel Kazi, Brian Thompson, Elizabeth Salesky, Timothy Anderson, Grant Erdmann, Eric Hansen, Brian Ore, Katherine Young, Jeremy Gwinnup, Michael Hutt, & Christina May: The MITLL-AFRL IWSLT 2015 MT system. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 23-30. [PDF, 2.8MB]

(2015) Nina Seemann, Fabienne Braune, & Andreas Maletti: A systematic evaluation of MBOT in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.200-214. [PDF, 585KB]

(2014) Ann Bies, Justin Mott, Seth Kulick, Jennifer Garland, & Colin Warner: Incorporating alternate translations into English Translation Treebank.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1863-1868. [PDF, 339KB]

(2014) Wan Yu Ho, Christine Kng, Shan Wang & Francis Bond: Identifying idioms in Chinese translations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.716-721. [PDF, 204KB]

(2014) Liangyou Li, Andy Way, & Qun Liu: A discriminative framework of integrating translation memory features into SMT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 249-260. [PDF, 638KNB]

(2014) Nianwen Xue, Ondřej Bojar, Jan Hajič, Martha Palmer, Zdeňka Urešová, & Xiuhong Zhang: Not an interlingua, but close: comparison of English AMRs to Chinese and Czech. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1765-1772. [PDF, 417KB]

(2013) Lei Cui, Xilun Chen, Dongdong Zhang, Shujie Liu, Mu Li, & Ming Zhou: Multi-domain adaptation for SMT using multi-task learning. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1055-1065. [PDF, 302KB]

(2013) Than-Le Ha, Teresa Herrmann, Jan Niehues, Mohammed Mediani, Eunah Cho, Yuqi Zhang, Isabel Slawik, & Alex Waibel: The KIT translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 147KB]

(2013) An-Chang Hsieh, Hen-Hsen Huang, & Hsin-His Chen: Uses of monolingual in-domain corpora for cross-domain adaptation with hybrid MT approaches. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.117-122. [PDF, 275KB]

(2013) Fei Huang & Cezar Pendus: Generalized reordering rules for improved SMT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.387-392. [PDF, 113KB]

(2013) Georgios Kontonatsios, Ioannis Korkontzelos, Jun’ichi Tsujii, & Sophia Ananiadou: Using a random forest classifier to recognise translations of biomedical terms across languages.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.95-104. [PDF, 656KB]

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2013) Taesung Lee & Seung-won Hwang: Bootstrapping entity translation on weakly comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.631-640. [PDF, 624KB]

(2013) Tingting Li, Tiejun Zhao, Andrew Finch, & Chunyue Zhang: A tightly-coupled unsupervised clustering and bilingual alignment model for transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.393-398. [PDF, 698KB]

(2013) Chi-kiu Lo, Meriem Beloucif, & Dekai Wu: Improving machine translation into Chinese by tuning against Chinese MEANT. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 284KB]

(2013) Xingyuan Peng, Xiaoyin Fu, Wei Wei, Zhenbiao Chen, Wei Chen, & Bo Xu: The CASIA machine translation system for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 158KB]

(2013) Nan Yang, Shujie Liu, Mu Li, Ming Zhou, & Nenghai Yu: Word alignment modeling with context dependent deep neural network. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.166-175. [PDF, 554KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2013) Chao Zhang, Shiqi Zhao, & Haifeng Wang: Bootstrapping large-scale named entities using URL-text hybrid patterns. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.293-301. [PDF, 249KB]

(2012) Zeeshan Ahmed, Jie Jiang, Julie Carson-Berndsen, Peter Cahill, & Andy Way: Hierarchical phrase-based MT for phonetic representation-based speech translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 246KB]

(2012) Joseph Z.Chang, Jason S.Chang, & Jyh-Shing Roger Jang: Learning to find translations and transliterations on the web. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.130-134. [PDF, 270KB]

(2012) Han-Bin Chen, Hen-Hsen Huang, Hsin-His Chen, & Ching-Ting Tan: A simplification-translation-restoration framework for cross-domain SMT applications. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.545-560. [PDF, 744KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Wei He, Hua Wu, Haifeng Wang, & Ting Liu: Improve SMT quality with automatically extracted paraphrase rules. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.979-987. [PDF, 330KB]

 (2012) Hieu Hoang: Phrase-based translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 103 slides [PDF of PPT, 1227KB]

(2012) Maxim Khalilov & Rahzeb Choudury: Building English-Chinese and Chinese-English MT engines for the computer software domain. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.7-11. [PDF, 193KB]

(2012) Grzegorz Kondrak & Mohammad Salameh: Transliteration experiments on Chinese and Arabic. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.71-75. [PDF, 73KB]

(2012) Chan-Hung Kuo, Shih-Hung Liu, Mike Tian-Jian Jiang, Cheng-Wei Lee, & Wen-Lian Hsu: Cost-benefit analysis of two-stage conditional random fields based English-to-Chinese machine transliteration. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.76-80. [PDF, 121KB]

(2012) Chang Liu & Hwee Tou Ng: Character-level machine translation evaluation for languages with ambiguous word boundaries. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.921-929. [PDF, 196KB]

 (2012) Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel & Alexander Waibel: The KIT translation systems for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.38-45. [PDF, 640KB]; presentation, 23 slides [PDF of PPT, 469KB]

(2012) Hisami Suzuki & Jianfeng Gao: A unified approach to transliteration-based text input with online spelling correction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.609-618. [PDF, 432KB]

 (2012) Yunqing Xia, Guoyu Tang, Peng Jin, & Xia Yang: CLTC: a Chinese-English cross-lingual topic corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.532-537. [PDF, 955KB]

 (2012) Guangyou Zhou, Kang Liu, & Jun Zhao: Exploiting bilingual translation for question retrieval in community-based question answering. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3153-3170. [PDF, 352KB]

(2011) Han-Bin Chen, Hen-Hsen Huang, Jengwei Tjiu, Ching-Ting Tan, & Hsin-His Chen: Identification and translation of significant patterns for cross-domain SMT applications. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.277-284. [PDF, 137KB]

 (2011) Yu Chen, Rui Wang, & Yi Zhang: Statistical machine transliteration with multi-to-multi joint source channel model. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.101-105. [PDF, 194 KB]

(2011) Ruiji Fu, Bing Qin, & Ting Liu: Generating Chinese named entity data from a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.264-272. [PDF, 450KB]

 (2011) Wei He, Shiqi Zhao, Haifeng Wang, & Ting Liu: Enriching SMT training data via paraphrasing. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.803-810. [PDF, 157KB]

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

(2011) Yun Huang, Min Zhang, & Chew Lim Tan: Nonparametric Bayesian machine transliteration with asynchronous adaptor grammars. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.534-539. [PDF, 162KB]

(2011) Hideto Ikeda, Nguyen Thanh Hung, & Ze Zhong Li: Functional approach for patent translation. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.64-69. [PDF, 144KB]

(2011) Hongfei Jiang: Expanding the applications of MT in patent translation. Invited talk at [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.17-36. [PDF, 7898KB]

 (2011) Jie Jiang, Jinhua Du, & Andy Way: Incorporating source-language paraphrases into phrase-based SMT with confusion networks. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.31-40. [PDF, 1325KB]

(2011) Jie Jiang, Zeeshan Ahmed, Julie Carson-Berndsen, Peter Cahill, & Andy Way: Phonetic representation-based speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.81-88. [PDF, 320KB]

 (2011) Mike Tian-Jian Jiang, Chan-Hung Kuo, & Wen-Lai Hsu: English-toChinese machine transliteration using accessor variety features of source graphemes. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.86-90. [PDF, 282KB]

 (2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

 (2011) Oi Yee Kwong: English-Chinese personal name transliteration by syllable-based maximum matching. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.96-100. [PDF, 256KB]

(2011) Hyoung-Gyu Lee, Joo-Young Lee, Min-Jeong Kim, Hae-Chang Rim, Joong-Hwi Shin, & Young-Sook Hwang: Phrase segmentation model using collocation and translational entropy.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.198-205. [PDF, 203KB]

(2011) Chi-Ho Li, Shujie Liu, Chenguang Wang, & Ming Zhou: ENGtube: an integrated subtitle environment for ESL. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.373-377. [PDF, 360KB]

(2011) Maoxi Li, Chenqing Zong, & Hwee Tou Ng: Automatic evaluation of Chinese translation output: word-level or character-level? ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.159-164. [PDF, 151KB]

 (2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

(2011) Yanjun Ma, Yifan He, Andy Way, & Josef van Genabith: Consistent translation using discriminative learning: a trnslation memory-inspired approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1239-1248. [PDF, 193KB]

 (2011) Yuval Marton, Ahmed El Kholy, & Nizar Habash: Filtering antonymous, trend-contrasting, and polarity-dissimilar distributional paraphrases for improving statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.237-249. [PDF, 177KB]

(2011) Adam Meyers, Michiko Kosaka, Shasha Liao, & Nianwen Xue: Improving MT word alignment using aligned multi-stage parses. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.88-97. [PDF, 174KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

 (2011) Seung-Hoon Na & Hwee Tou Ng: Enriching document representation via translation for improved monolingual information retrieval. SIGIR July 24-28, 2011, Beijing, China; 10pp. [PDF, 543KB]

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

 (2011) Ying Qin & Guohua Chen: Forward-backward machine transliteration between English and Chinese based on combined CRFs. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.82-85. [PDF, 168KB]

(2011) Matthew Snover, Xiang Li, Wen-Pin Lin, Zheng Chen, Suzanne Tamang, Mingmin Ge, Adam Lee, Qi Li, Hao Li, Sam Anzaroot, & Heng Ji: Cross-lingual slot filling from comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.110-119. [PDF, 271KB]

(2011) Yanli Sun, Juan Li, & Yi Li: Deploying MT into a localisation workflow: pains and gains. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.239-243. [PDF, 83KB]

(2011) Hao Zhang, Licheng Fang, Peng Xu, & Xiaoyun Wu: Binarized forest to string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.835-845. [PDF, 212KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) Chen Yuncong & Pascale Fung: Unsupervised synthesis of multilingual Wikipedia articles. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.197-205. [PDF, 1040KB]

(2010) Hercules Dalianis, Hao-chun Xing, & Xin Zhang: Creating a reusable English-Chinese parallel corpus for bilingual dictionary construction. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1700-1705. [PDF, 409KB]

(2010) Jinhua Du, Jie Jiang, & Andy Way: Facilitating translation using source language paraphrase lattices.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.420-429. [PDF, 766KB]

(2010) Xiangyu Duan, Rafael E.Banchs, Jun Lang, Deyi Xiong, Aiti Aw, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.67-72. [PDF, 391KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Automatic determination of number of clusters for creating templates in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 582KB]; presentation: 37 slides [PDF, 496KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Monolingual distributional profiles for word substitution in machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.320-328. [PDF, 605KB]

(2010) Qin Gao & Stephan Vogel: Consensus versus expertise: a case study of word alignment with Mechanical Turk. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.30-34. [PDF, 389KB]

(2010) Ignacio Garcia: Does Google know better? Translators and machine translation. Translating and the Computer 32, 18-19 November 2010, London; presentation, 16 slides [PDF of PPT, 473KB]

(2010) Guillem Gascó, Vicent Alabau, Jesús Andrés-Ferrer, Jesús González-Rubio, Martha-Alicia Rocha, Germán Sanchis-Trilles, Francisco Casacuberta, Jorge González, & Joan-Andreu Sánchez: ITI-UPV system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.85-92. [PDF, 414KB]; presentation by Germán Sanchis-Trilles: 16 slides [PDF of PPT, 481KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation  system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Fei Huang & Bing Xiang: Feature-rich discriminative phrase rescoring for SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.492-500. [PDF, 260KB]

(2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

(2010) Maxim Khalilov & Khalil Sima’an: The ILLC-UvA SMT system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.197-203. [PDF, 1209KB]

 (2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

(2010) Lianhau Lee, Aiti Aw, Min Zhang, & Haizhou Li: EM-based hybrid model for bilingual terminology extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.639-646. [PDF, 114KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: The INESC-ID machine translation system for the IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.81-84. [PDF, 227KB]

(2010) Bin Lu, Tao Jiang, Kapo Chow, & Benjamin K. Tsou: Building a large English-Chinese parallel corpus from comparable patents and its experimental application to SMT.  [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.42-49. [PDF, 271KB]

 (2010) Luo Xuejuan: The course design of CAT for undergraduate English majors of Sun Yat-sen University [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 251-272.

(2010) Takashi Onishi, Masao Utiyama & Eiichiro Sumita: Paraphrase lattice for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.1-5. [PDF, 419KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

 (2010) Jocelyn Phillips, Carol Van Ess-Dykema, Timothy Allison & Laurie Gerber: Parallel corpus development at NVTC. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 7pp. [PDF, 173KB]; abstract

(2010) Hendra Setiawan, Chris Dyer, & Philip Resnik: Discriminative word alignment with a function word reordering model.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.534-544. [PDF, 200KB]

(2010) Lei Shi, Rada Mihalcea, & Mingjun Tian: Cross language text classification by model translation and semi-supervised learning. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1057-1067. [PDF, 525KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

(2010) Yanli Sun: Mining the correlation between human and automatic evaluation at sentence level. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1726-1730. [PDF, 619KB]

 (2010) Yanli Sun, Sharon O’Brien, Minako O’Hagan, & Fred Hollowood: A novel statistical pre-processing model for rule-based machine translation system.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 790KB]; presentation: 28 slides [PDF, 580KB]

(2010) Ling-Xiang Tang, Shlomo Geva, Andrew Trotman, & Yue Xu: A voting mechanism for named entity translation in English-Chinese question answering. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.43-51. [PDF, 554KB]

 (2010) Liang Tian, Fai Wong, & Sam Chao: An improvement of translation quality with adding key-words in parallel corpus. Ninth International Conference on Machine Learning and Cybernetics Qingdao, China, 11-14 July 2010; pp. 1273-1278. [PDF, 226KB]

(2010) Hao Xiong, Jun Xie, Hui Yu, Kai Liu, Wei Luo, Haitao Mi, Yang Liu, Yajuan Lü, & Qun Liu: The ICT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.73-79. [PDF, 510KB]

(2010) Yan Tian: YanFa: an online automatic scoring and intelligent feedback system of student English-Chinese translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Demonstrations volume; pp.25-28. [PDF, 227KB]

 (2010) Xiaojun Wan, Huiying Li, & Jianguo Xiao: Cross-language document summarization based on machine translation quality prediction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.834-843. [PDF, 318KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

(2010) Shumin Wu, Jinho D.Choi, & Martha Palmer: Detecting cross-lingual semantic similarity using parallel PropBanks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 274KB]

(2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

(2010) Li Zuo: Machine translation from English to Chinese: a study of Google’s performance with the UN documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

English to Croatian

(2014) Miquel Esplà-Gomis, Filip Klubička, Nikola Ljubešić, Sergio Ortiz-Rojas, Vassilis Papavassiliou, & Prokopis Prokopidis: Comparing two acquisition systems for automatically building an English-Croatian parallel corpus from multilingual websites.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1252-1258. [PDF, 158KB]

 (2014) Raphael Rubino, Antonio Toral, Nikola Ljubešić, & Gema Ramírez-Sánchez: Quality estimation for synthetic parallel data generation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1843-1849. [PDF, 248KB]

 (2014) Abu-MaTran: automatic building of machine translation. Project duration: January 2013 – December 2016. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.132. [PDF, 679KB]

(2012) Sanja Seljan, Tomislav Vičić, & Marija Brkić: BLEU evaluation of machine-translated English-Croatian legislation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2143-2148. [PDF, 332KB]

 (2011) Marija Brkić, Sanja Seljan, & Maja Matetić: Machine translation evaluation for Croatian-English and English-Croation language pairs. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.93-104. [PDF, 768KB]

English to Czech

(2015) Franck Burlot & François Yvon: Morphology-aware alignments for translation to and from a synthetic language. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 188-195. [PDF, 3.1MB]

(2015) Krzysztof Wolk & Krzysztof Marasek: PJAIT systems for the IWSLT 2015 evaluation campaign enhanced by comparable corpora. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.101-104. [PDF, 2.9MB]

(2014) Petra Barančíková: Parmesan: Meteor without paraphrases with paraphrased references. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.355-361. [PDF, 305KB]

(2014) Petra Barančiková, Rudolf Rosa, & Aleš Tamchyna: Improving evaluation of English-Czech MT through paraphrasing.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 127KB]

(2014) Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, & Daniel Zeman: Machine translation of medical texts in the Khresmoi project. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.221-228. [PDF, 513KB]

(2014) Yi Lu, Longyue Wang, Derek F.Wong, Lidia S.Chao, Yiming Wang, & Francisco Oliveira: Domain adaptation for medical text translation using web resources. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.233-238. [PDF, 485KB]

(2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

(2014) Aleš Tamchyna, Martin Popel, Rudolf Rosa, & Ondřej Bojar: CUNI in WMT14: Chimera still awaits Bellerophon. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.195-200. [PDF, 178KB]

(2014) Zdeňka Urešová, Ondřej Dušek, Jan Hajič, & Pavel Pecina: Multilingual test sets for machine translation of search queries for cross-lingual information retrieval in the medical domain. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3244-3247. [PDF, 116KB]

(2014) Longyue Wang, Yi Lu, Derek F.Wong, Lidia Chao, Yiming Wang, & Francisco Oliveira: Combining domain adaptation approaches for medical text translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.254-259. [PDF, 348KB]

(2014) Nianwen Xue, Ondřej Bojar, Jan Hajič, Martha Palmer, Zdeňka Urešová, & Xiuhong Zhang: Not an interlingua, but close: comparison of English AMRs to Chinese and Czech. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1765-1772. [PDF, 417KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Ondřej Bojar, Rudolf Rosa & Aleš Tamchyna: Chimera – three heads for English-to-Czech translation.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.92-98. [PDF, 175KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Petra Galuščáková, Martin Popel & Ondřej Bojar: PhraseFix: statistical post-editing of TectoMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.141-147. [PDF, 157KB]

(2013) Nathan David Green & Zdeněk Žabokrtský: Improvements to syntax-based machine translation using ensemble dependency parsers. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 19-24. [PDF, 202KB]

(2013) Thomas Meyer & Lucie Poláková: Machine translation with many manually labeled discourse connectives. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.43-50. [PDF, 307KB]

(2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

(2013) Michal Novák, Zdeněk Žabokrtský & Anna Nedoluzhko: Two case studies on translating pronouns in a deep syntax framework. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1037-1041. [PDF, 348KB]

(2013) Rudolf Rosa, David Mareček, & Aleš Tamchyna: Deepfix: statistical post-editing of statistical machine translation using deep syntactic analysis. ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.172-179. [PDF, 178KB]

(2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2012) Eleftherios Avramidis, Marta R.Costa-jussà, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

 (2012) Amittai Axelrod, QingJun Li, & William D.Lewis: Applications of data selection via cross-entropy difference for real-world statistical machine translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.201-208. [PDF, 610KB]

(2012) Ondřej Bojar & Dekai Wu: Towards a predicate-argument evaluation for MT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.30-38. [PDF, 126KB]

(2012) Ondřej Bojar, Zdeněk Žabokrtský, Ondřej Dušek, Petra Galuščáková, Martin Majliš, David Mareček, Jiří Maršík, Michal Novák, Martin Popel, & Aleš Tamchyna: The joy of parallelism with CzEng 1.0.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3921-3928. [PDF, 460KB]

(2012) Ondřej Bojar, Bushra Jawaid, & Amir Kamran: Probes in a taxonomy of factored phrase-based models. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.253-260. [PDF, 134KB]

(2012) Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, & David Mareček: Formemes in English-Czech deep syntactic MT. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.267-274. [PDF, 316KB]

 (2012) Liane Guillou: Improving pronoun translation for statistical machine translation. [EACL 2012] Proceedings of the EACL Student Research Workshop, Avignon, France, 26 April 2012; pp. 1-10. [PDF, 373KB]

(2012) Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučiková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiři Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, & Zdeněk Žabokrtský: Announcing Prague Czech-English dependency treebank 2.0.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3153-3160. [PDF, 544KB]

(2012) Majid Razmara, Baskaran Sankaran, Ann Clifton, & Anoop Sarkar: Kriya –the SFU system for translation task at WMT-12. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.356-361. [PDF, 130KB]

(2012) Rudolf Rosa, David Mareček, & Ondřej Dušek: DEPFIX: a system for automatic correction of Czech MT outputs. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.362-368. [PDF, 114KB]

(2012) Rudolf Rosa, Ondřej Dušek, David Mareček, & Martin Popel: Using parallel features in parsing of machine-translated sentences for correction of grammatical errors. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.39-48. [PDF, 186KB]

(2012) Aleš Tamchyna, Petra Galuščáková, Amir Kamran, Miloš Stanojević, & Ondřej Bojar: Selecting data for English-to-Czech machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.374-381. [PDF, 110KB]

 (2011) Jan Berka, Martin Černý, & Ondřej Bojar: Quiz-based evaluation of machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.77-86. [PDF, 190KB]

 (2011) Ondřej Bojar: Analyzing error types in English-Czech machine translation. Prague Bulletin of Mathematical Linguistics 95, April 2011; pp.63-76. [PDF, 166KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

 (2011) Matouš Macháček & Ondřej Bojar: Approximating a deep-syntactic metric for MT evaluation and tuning. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.92-98. [PDF, 130KB]

 (2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

 (2011) Martin Popel, David Mareček, Nathan Green, & Zdeněk Žabokrtský: Influence of parser choice on dependency-based MT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.433-439. [PDF, 113KB]

(2011) Michael Subotin: An exponential translation model for target language morphology. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.230-238. [PDF, 188KB]

(2011) Daniel Zeman: Hierarchical phrase-based MT at the Charles University for the WMT 2011 shared task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.496-500. [PDF, 274KB]

  (2010) Ergun Biçici & S.Serdar Kozat: Adaptive model weighting and transductive regression for predicting best system combinations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 276-281. [PDF, 181KB]

  (2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

  (2010) Ondřej Bojar & Kamil Kos: 2010 failures in English-Czech phrase-based MT.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 60-66. [PDF, 132KB]

  (2010) Ondřej Bojar, Kamil Kos, & David Mareček: Tackling sparse data issue in machine translation evaluation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.86-91. [PDF, 132KB]

(2010) Ondřej Bojar, Adam Liška, & Zdeněk Žabokrtský: Evaluating utility of data sources in a large parallel Czech-English corpus CzEng 0.9. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.447-452. [PDF, 359KB]

(2010) Jinhua Du, Pavel Pecina, & Andy Way: An augmented three-pass system combination framework: DCU combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 290-295. [PDF, 550KB]

(2010) Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, & Chris Quirk: A discriminative lexicon model for complex morphology. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 374KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) David Mareček, Martin Popel, & Zdeněk Žabokrtský: Maximum entropy translation model in dependency-based MT framework. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 201-206. [PDF, 254KB]

 (2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Daniel Zeman: Hierarchical phrase-based MT at the Charles University for the WMT 2010 shared task.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 212-215. [PDF, 269KB]

English to Danish

(2012) Lene Offersgaard & Dorte Haltrup Hansen: SMT systems for less-resourced languages based on domain-specific data.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.75-80. [PDF, 380KB]

(2011) Michael Carl, Barbara Dragsted, Jakob Elming, Daniel Hardt, & Arnt Lykke Jakobsen: The process of post-editing: a pilot study. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.131-142. [PDF, 1078KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

English to Dari

(2011) Sankaranarayanan Ananthakrishnan, Rohit Prasad, & Prem Natarajan: On-line language model biasing for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.445-449. [PDF, 95KB]

 (2011) Sankaranarayanan Ananthakrishnan, Shiv Vitaladevuni, Rohit Prasad, & Prem Natarajan: Source error-projection for sample selection in phrase-based SMT for resource-poor languages. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.819-827. [PDF, 425KB]

English to Dutch

(2014) Joke Daems, Lieve Macken, & Sonia Vandepitte: On the origin of errors: a fine-grained analysis of MT and PE errors and their relationship. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.62-68. [PDF, 120KB]

(2014) Federico Gaspari, Antonio Toral, Sudip Kumar Naskar, Declan Groves, & Andy Way: Perception vs. reality: measuring machine translation post-editing productivity. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.60-72. [PDF, 160KB]

(2013) Joke Daems, Lieve Macken, & Sonia Vandepitte: Quality as the sum of its parts: a two-step approach for the identification of translation problems and translation quality assessment for HT and MT+PE.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 63-71. [PDF, 209KB]

(2013) Nathalie de Sutter & Joachim van den Bogaert: Quality or productivity? Let’s do both!  Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.381-390. [PDF of PPT, 824KB]

(2012) Lene Offersgaard & Dorte Haltrup Hansen: SMT systems for less-resourced languages based on domain-specific data.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.75-80. [PDF, 380KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

 (2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

(2011) PaCo-MT: parse and corpus-based machine translation. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.347. [PDF, 100KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Maxim Khalilov & Khalil Sima’an: A discriminative syntactic model for source permutation via tree transduction. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.92-100. [PDF, 760KB]

(2010) Maxim Khalilov & Khalil Sima’an: Source reordering using MaxEnt classifiers and supertags. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 605KB]

 (2010) Lieve Macken & Walter Daelemans: A chunk-driven bootstrapping approach to extracting translation patterns [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

English to Estonian

 (2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

(2010) Maxim Khalilov, Lauma Pretkalniņa, Natalja Kuvaldina, & Veronika Pereseina: SMT of Latvian, Lithuanian and Estonian languages: a comparative study. Human Language Technologies – the Baltic Perpective,  International Conference, Riga, Latvia, October 8, 2010; 8pp. [PDF, 145KB]

English to Farsi see English to Persian

English to Filipino [see also English to Tagalog]

(2015) Jasmine Ang, Marc Randell Chan, John Paolo Genato, Joyce Uy, & Joel Ilao: Development of a Filipino-to-English bidirectional statistical machine translation system that dynamically updates via user feedback. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.225-231. [PDF, 2.3MB]

English to Finnish

(2013) Maarit Koponen: This translation is not too bad: an analysis of post-editor choices in a machine-translation post-editing task. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 1-9. [PDF, 148KB]

(2012) Jyrki Niemi & Krister Lindén: Representing the translation relation in a bilingual wordnet.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2439-2446. [PDF, 365KB]

(2011) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Methods for smoothing the optimizer instability in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.32-39. [PDF, 150KB]

(2011) Ann Clifton & Anoop Sarkar: Combining morpheme-based machine translation with post-processing morpheme prediction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.32-42. [PDF, 287KB]

(2011) Jaako Väyrynen, Timo Honkela, & Marcus Dobrinkat: Context in machine translation: challenge and data set. META-NET Workshop: Context in MT, June 14, 2011; 10 slides [PDF of PPT, 146KB]

 (2010) Mikko Kurimo, William Byrne, John Dines, Philip N.Garner, Matthew Gibson, Yong Guan, Teemu Hirsimäki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu, & Junichi Yamagishi: Personalising speech-to-speech translation in the EMIME project. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.48-53. [PDF, 679KB]

(2010) Minh-Thang Luong & Min-Yen Kan: Enhancing morphological alignment for translating highly inflected languages. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.743-751. [PDF, 610KB]

(2010) Minh-Thang Luong, Preslav Nakov, & Min-Yen Kan: A hybrid morpheme-word representation for machine translation of morphologically rich languages.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.148-157. [PDF, 192KB]

English to French

(2015) Mercedes Garcia Martínez, Loïc Barrault, Anthony Rousseau, Paul Deléglise, & Yannick Estève: The LIUM ASR and SLT systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 50-54. [PDF, 1.8MB]

(2015) Krzysztof Wolk & Krzysztof Marasek: PJAIT systems for the IWSLT 2015 evaluation campaign enhanced by comparable corpora. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.101-104. [PDF, 2.9MB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, & Daniel Zeman: Machine translation of medical texts in the Khresmoi project. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.221-228. [PDF, 513KB]

(2014) François Lanctôt: When to choose SMT: typology of documents. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.38-49. [PDF, 532KB]

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Christian Hardmeier: Discourse in statistical machine translation [abstract]. PhD thesis, Upsala University, 2014; 2pp. [PDF, 15KB]

(2014) Teresa Herrmann, Mohammed Mediani, Eunach Cho, Thanh-Le Ha, Jan Niehues, Isabel Slawik, Yuqi Zhang, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.130-135. [PDF, 201KB]

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Liangyou Li, Andy Way, & Qun Liu: A discriminative framework of integrating translation memory features into SMT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 249-260. [PDF, 638KNB]

(2014) Yi Lu, Longyue Wang, Derek F.Wong, Lidia S.Chao, Yiming Wang, & Francisco Oliveira: Domain adaptation for medical text translation using web resources. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.233-238. [PDF, 485KB]

(2014) Tsuyoshi Okita, Ali Hosseinzadeh Vahid, Andy Way, & Qun Liu: The DCU terminology translation system for the medical query subtask at WMT14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.239-245. [PDF, 352KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Nicolas Pécheux, Li Gong, Quoc Khanh Do, Benjamin Marie, Yulia Ivanishcheva, Alexandre Allauzen, Thomas Lavergne, Jan Niehues, Aurélien Max, & François Yvon: LIMSI @ WMT’14 medical translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.246-253. [PDF, 316KB]

(2014) Lise Rebout & Philippe Langlais: An iterative approach for mining parallel sentences in a comparable corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.648-655. [PDF, 201KB]

(2014) Raphael Rubino, Antonio Toral, Victor M.Sánchez-Cartagena, Jorge Ferrández-Tordera, Sergio Ortiz-Rojas, Gema Ramírez-Sánchez, Felipe Sánchez-Martínez, & Andy Way: Abu-MaTran at WMT 2014 translation task: two step data selection and RBMT-style synthetic rules. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.171-177. [PDF, 313KB]

(2014) Nasredine Semmar, Othman Zennaki, & Meriama Laib: Using cross-language information retrieval and statistical language modelling in example-based machine translation. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.36-44. [PDF, 274KB]

(2014) Michel Simard: Clean data for training statistical MT: the case of MT contamination. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.69-82. [PDF, 533KB]

(2014) Zdeňka Urešová, Ondřej Dušek, Jan Hajič, & Pavel Pecina: Multilingual test sets for machine translation of search queries for cross-lingual information retrieval in the medical domain. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3244-3247. [PDF, 116KB]

(2014) Longyue Wang, Yi Lu, Derek F.Wong, Lidia Chao, Yiming Wang, & Francisco Oliveira: Combining domain adaptation approaches for medical text translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.254-259. [PDF, 348KB]

 (2014) Marion Weller, Alexander Fraser, & Ulrich Heid: Combining bilingual terminology mining and morphological modeling for domain adaptation in SMT. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.11-18. [PDF, 387KB]

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2013) Anabela Barreiro, Johanna Monti, Brigitte Orliac, & Fernando Batista: When multiwords go bad in machine translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.26-33. [PDF, 219KB]

(2013) Nicola Bertoldi, M.Amin Farajian, Prashant Mathur, Nicholas Ruiz, & Marcello Federico: FBK’s machine translation systems for the IWSLT 2013 evaluation campaign. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 134KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Alexandra Birch, Nadir Durrani, & Philipp Koehn: English SLT and MT system description for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 9pp. [PDF, 225KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Context vector disambiguation for bilingual lexicon extraction from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.759-764. [PDF, 199KB]

(2013) Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, & Marcello Federico: Report on the 10th IWSLT evaluation campaign. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 18pp. [PDF, 270KB]

(2013) Eunah Cho, Thanh-Le Ha, Mohammed Mediani, Jan Niehues, Teresa Herrmann, Isabel Slawik & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2013. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.104-108. [PDF, 107KB]

(2013) Kevin Duh, Graham Neubig, Katsuhito Sudoh, & Hajime Tsukada: Adapation data selection using neural language models: experiments in machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.678-683. [PDF, 215KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Mark Fishel: Ranking translations using error analysis and quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.405-407. [PDF, 98KB]

(2013) Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Nadir Durrani, Matthias Huck, Philipp Koehn, Thanh-Le Ha, Jan Niehues, Mohammed Mediani, Teresa Herrmann, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 128KB]

(2013) Spence Green, Daniel Cer, Kevin Reschke, Rob Voigt, John Bauer, Sida Wang, Natalia Silveira, Julia Neidert & Christopher D. Manning: Feature-rich phrase-based translation: Stanford University’s submission to the WMT 2013 translation task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.148-153. [PDF, 272KB]

(2013) Than-Le Ha, Teresa Herrmann, Jan Niehues, Mohammed Mediani, Eunah Cho, Yuqi Zhang, Isabel Slawik, & Alex Waibel: The KIT translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 147KB]

(2013) Najeh Hajlaoui: Are ACT’s scores increasing with better translation quality? WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.408-413. [PDF, 255KB]

(2013) Rima Harastani, Béatrice Daille & Emmanuel Morin: Ranking translation candidates acquired from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.401-409. [PDF, 276KB]

(2013) Christian Hardmeier, Jörg Tiedemann & Joakim Nivre: Latent anaphora resolution for cross-lingual pronoun prediction. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.380-391. [PDF, 274KB]

(2013) Amir Hazem & Emmanuel Morin: A comparison of smoothing techniques for bilingual lexicon extraction from comparable corpora.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.24-33. [PDF, 546KB]

(2013) Nal Kalchbrenner & Phil Blunsom: Recurrent continuous translation models. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1700-1709. [PDF, 274KB]

(2013) Michaeel Kazi, Michael Coury, Elizabeth Salesky, Jessica Ray, Wade Shen, Terry Gleason, Tim Anderson, Grant Erdmann, Lane Schwartz, Brian Ore, Raymond Slyh, Jeremy Gwinnup, Katherine Young, & Michael Hutt: The MIT-LL/AFRL IWSLT-2013 MT system. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 195KB]

(2013) Georgios Kontonatsios, Ioannis Korkontzelos, Jun’ichi Tsujii, & Sophia Ananiadou: Using a random forest classifier to recognise translations of biomedical terms across languages.  Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.95-104. [PDF, 656KB]

(2013) Evgeny Matusov & Gregor Leusch: OmnifluentTM English-to-French and Russian-to-English systems for the 2013 Workshop on Statistical Ma­chine Translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.158-163. [PDF, 136KB]

(2013) Thomas Meyer, Cristina Grisot, & Andrei Popescu-Belis: Detecting narrativity to improve English to French translation of simple past verbs. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.33-42. [PDF, 307KB]

(2013) Thomas Meyer & Bonnie Webber: Implicitation of discourse connectives in (machine) translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.19-26. [PDF, 396KB]

(2013) Shachar Mirkin, Sriram Venkatapathy, Marc Dymetman, & Ioan Calapodescu: SORT: an interactive source-rewriting tool for improved translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.85-90. [PDF, 943KB]

(2013) Linda Mitchell, Johann Roturier, & Sharon O’Brien: Community-based post-editing of machine-translated content: monolingual vs. bilingual. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 35-43. [PDF, 216KB]

(2013) Sudip Kumar Naskar, Antonio Toral, Federico Gaspari, & Declan Groves: Meta-evaluation of a diagnostic quality metric for machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.135-142. [PDF, 460KB]

(2013) Ngoc-Quan Pham, Hai-Son Le, Tat-Thang Vu, & Chi-Mai Luong: The speech recognition and machine translation system of IOIT for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 240KB]

(2013) Carlos Ramish, Laurent Besacier, & Alexander Kobzar: How hard is it to automatically translate phrasal verbs from English to French?  [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.53-61. [PDF, 303KB]

(2013) Raphael Rubino, Antonio Toral, Santiago Cortés Vaíllo, Jun Xie, Xiaofeng Wu, Stephen Doherty, & Qun Liu: The CNGL-DCU-Prompsit translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.213-218. [PDF, 152KB]

(2013) Raphael Rubino, Jennifer Foster, Rasoul Samad Zadeh Kaljahi, Johann Roturier & Fred Hollowood: Estimating the quality of translated user-generated content. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1167-1173. [PDF, 164KB]

(2013) Rasoul Samad Zadeh Kaljahi, Jennifer Foster, Raphael Rubino, Johann Roturier, & Fred Hollowood: Parser accuracy in quality estimation of machine translation: a tree kernel approach. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1092-1096. [PDF, 114KB]

(2013) Eric Wehrli & Luka Nerima: Anaphora resolution, collocations and translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.12-17. [PDF, 202KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2013) Joern Wuebker, Stephan Peitz, Tamer Alkhouli, Jan-Thorsten Peter, Minwei Feng, Markus Freitag & Hermann Ney: The RWTH Aachen machine translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 128KB]

(2012) A.Ryan Aminzadeh, Jennifer Drexler, Timothy Anderson, & Wade Shen: Improved phrase translation modeling using MAP adaptation. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #496, 1p. [HTML]

(2012) Marianna Apidianaki, Guillaume Wisniewski, Artem Sokolov, Aurélien Max, & François Yvon: WSD for n-best reranking and local language modeling in SMT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.1-9. [PDF, 108KB]

(2012) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in SMT of user-generated forum content guided by OOV word reduction: normalization and/or supplementary data? EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.169-176. [PDF, 160KB]

(2012) Laurent Besacier, Benjamin Lecouteux, Marwen Azouzi, & Luong Ngoc Quang: The LIG English to French machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp. 102-108. [PDF,625KB]

(2012) Alexandra Balahur & Marco Turchi: Multilingual sentiment analysis using machine translation. [ACL 2012] Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, Jeju, Republic of Korea, 12 July 2012; pp.52-60. [PDF, 133KB]

(2012) Arianna Bisazza & Marcello Federico: Cutting the long tail: hybrid language models for translation style adaptation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 439-448. [PDF, 266KB]

(2012) Frédéric Blain, Holger Schwenk, & Jean Senellart: Incremental adaptation using translation information and post-editing analysis. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.229-236. [PDF, 873KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

 (2012) Houda Bouamor, Aurélien Max, & Anne Vilnat: Validation of sub-sentential paraphrases acquired from parallel monolingual corpora. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 716-725. [PDF, 173KB]

 (2012) Sandipan Dandapat, Sara Morrissey, Abdy Way, & Joseph van Genabith: Combining EBMT, SMT, TM and IR technologies for quality and scale. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.48-58. [PDF, 248KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Stephen Doherty: Investigating the effects of controlled language on the reading and comprehension of machine translated texts: a mixed-methods approach. Thesis sumitted for the degree of Doctor of Philosophy, … Dublin City University, January 2012; xiii, 275pp. and appendices. [PDF,

(2012) Stephen Doherty & Sharon O’Brien: A user-based usability assessment of raw machine translated technical instructions. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 143KB]

(2012) Jennifer Drexler, Wade Shen, Tim Anderson, Raymond Slyh, Brian Ore, Eric Hansen & Terry Gleason: The MIT-LL/AFRL IWSLT 2012 MT system. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.109-116. [PDF, 646KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Ramona Enache, Cristina España-Bonet, Aarne Ranta, & Lluís Màrquez: A hybrid system for patent translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.269-276. [PDF, 189KB]

(2012) Atefeh Farzindar & Wael Khreich: Evaluation of domain adaptation techniques for TRANSLI in a real-world environment. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 136KB]

(2012) M.Federico, M.Cettolo, L.Bentivogli, M.Paul, & S.Stüker: Overview of the IWSLT 2012 evaluation campaign. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.12-33. [PDF, 772KB]; presentation: 44 slides [PDF of PPT, 3701KB]

(2012) Douwe Gelling & Trevor Cohn: Using senses in HMM word alignment. NAACL-HLT Workshop on the Induction of Linguistic Structure, Montréal, Canada, June 3-8, 2012; pp.39-46. [PDF, 156KB]

(2012) Jorge González: A finite-state approach to phrase-based statistical machine translation. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.90-98. [PDF, 253KB]

(2012) Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, & Pawel Swietojanski: The UEDIN systems for the IWSLT 2012 evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.46-53. [PDF, 651KB]; presentation: 103 slides [PDF of PPT, 384KB]

(2012) Eva Hasler, Barry Haddow, & Philipp Koehn: Sparse lexicalised features and topic adaptation for SMT. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.268-275. [PDF, 630KB]

(2012) Matthias Huck, Stephan Peitz, Markus Freitag, Malte Nuhn, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.304-311. [PDF, 127KB]

(2012) Rasoul Samad Zadeh Kaljahi, Raphael Rubino, Johann Roturier, & Jennifer Foster: A detailed analysis of phrase-based and syntax-based machine translation: the search for systematic differences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 140KB]

(2012) Philipp Koehn & Barry Haddow: Toward effective use of training data in statistical machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.317-321. [PDF, 89KB]

(2012) Patrik Lambert, Holger Schwenk, & Frédéric Blain: Automatic translation of scientific documents in the HAL archive.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3933-3936. [PDF, 329KB]

(2012) Patrick Lambert, Jean Senellart, Laurent Romary, Holger Schwenk, Florian Zipser, Patrice Lopez, & Frédéric Blain: Collaborative machine translation service for scientific texts. [EACL 2012] Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 11-15. [PDF, 523KB]

(2012) Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurélien Max, Artem Sokolov, Guillaume Wisniewski, & François Yvon: LIMSI @ WMT’12.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.330-337. [PDF, 189KB]

(2012) Gennadi Lembersky, Noam Ordan, & Shuly Wintner: Adapting translation models to translationese improves SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.255-265. [PDF, 173KB]

(2012) Michel Lopez: Managing change when implementing MT systems. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 4pp. [PDF, 41KB]

(2012) Evgeny Matusov: Incremental re-training of a hybrid English-French MT system with customer translation memory data. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 8pp. [PDF, 116KB]

(2012) Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel & Alexander Waibel: The KIT translation systems for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.38-45. [PDF, 640KB]; presentation, 23 slides [PDF of PPT, 469KB]

(2012) Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui, & Andrea Gesmundo: Machine translation of labeled discourse connectives. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 196KB]

(2012) Thomas Meyer & Andrei Popescu-Belis: Using sense-labeled discourse connectives for statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.129-138. [PDF, 275KB]

(2012) Alexander Molchanov: PROMT DeepHybrid system for WMT12 shared translation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.345-348. [PDF, 88KB]

(2012) Jan Niehues & Alex Waibel: Continuous space language models using restricted Boltzmann machines. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.164-170. [PDF, 675KB]

(2012) Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.349-355. [PDF, 149KB]

(2012) Pavel Pecina, Antonio  Toral, Vassilis Papavassiliou, Prokopis Prokopidis, & Josef van Genabith: Domain adaptation of statistical machine translation using web-crawled resources: a case study. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.145-152. [PDF, 201KB]

(2012) Stephan Peitz, Saab Mansour, Markus Freitag, Minwei Feng, Matthias Huck, Joern Wuebker, Malte Nuhn, Markus Nußbaum-Thom, & Hermann Ney: The RWTH Aachen speech recognition and machine translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.69-76. [PDF, 623KB]; presentation, 19 slides [PDF of PPT, 165KB]

(2012) Stephan Peitz, Simon Wiesler, Markus Nußbaum-Thom, & Hermann Ney: Spoken language translation using automatically transcribed text in training. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.276-283. [PDF, 623KB]

(2012) Daniele Pighin, Lluís Màrquez, & Jonathan May: An analysis (and an annotated corpus) of user responses to machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1131-1135. [PDF, 355KB]

(2012) Alexandros Poulis & David Kolovratnik: To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 390KB]

(2012) Johann Roturier, Linda Mitchell, Robert Grabowski & Melanie Siegel: Using automatic machine translation metrics to analyze the impact of source reformulations. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 509KB]

(2012) N.Ruiz, A.Bisazza, R.Cattoni, & M.Federico: FBK’s machine translation systems for IWSLT 2012’s TED lectures. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.61-68. [PDF, 623KB]; presentation, 27 slides [PDF of PPT, 378KB]

 (2012) Holger Schwenk: Continuous space translation models for phrase-based statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1071-1080. [PDF, 124KB]

(2012) Christophe Servan, Patrik Lambert, Anthony Rousseau, Holger Schwenk, & Loïc Barrault: LIUM’s SMT machine translation systems for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.369-373. [PDF, 106KB]

(2012) Christophe  Servan & Simon Petitrenaud: Calculation of phrase probabilities for statistical machine translation by using belief functions. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1101-1110. [PDF, 120KB]

 (2012) Kashif Shah, Loïc Barrault, & Holger Schwenk: A general framework to weight heterogenous parallel data for model adaptation in statistical machine translation.  AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 194KB]

(2012) Radu Soricut & Sushant Narsale: Combining quality prediction and system selection for improved automatic translation output. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.163-170. [PDF, 135KB]

 (2012) Manuela Yapomo, Gloria Corpas, & Ruslan Mitkov: CLIR- and ontology-based approach for bilingual extraction of comparable documents.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.121-125. [PDF, 380KB]

(2012) Qian Yu, Aurélien Max, & François Yvon: Aligning bilingual literary works: a pilot study. NAACL-HLT Workshop on Computational Linguistics for Literature, Montréal, Canada, June 8, 2012; pp.36-44. [PDF, 232KB]

(2011) Alexandre Allauzen, Hélène Bonneau-Maynard, Hai-Son Le, Aurélien Max, Guillaume Wisniewski, François Yvon, Gilles Adda, Josep M.Crego, Adrien Lardilleux, Thomas Lavergne, & Artem Sokolov: LIMSI @ WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.309-315. [PDF, 107KB]

 (2011) A.Ryan Aminzadeh, Tim Anderson, Ray Slyh, Brian Ore, Eric Hansen, Wade Shen, Jennifer Drexler, & Terry Gleason: The MIT-LL/AFRL IWSLT-2011 MT system. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.34-40. [PDF, 254KB]

(2011) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in statistical machine translation of user-forum data using component-level mixture modelling. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.285-292. [PDF, 137KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Arianna Bisazza, Nick Ruiz, & Marcello Federico: Fill-up versus interpolation methods for phrase-based SMT adaptation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.136-143. [PDF, 323KB]

(2011) Frédéric Blain, Jean Senellart, Holger Schwenk, Mirko Plitt, & Johann Roturier: Qualitative analysis of post-editing for high quality machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.164-171. [PDF, 218KB]

(2011) Alexandru Ceauşu, John Tinsley, Jian Zhang, & Andy Way: Experiments on domain adaptation for patent machine translation in the PLuTO project. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.21-28. [PDF, 832KB]; presentation, 16 slides [PDF, 824KB]

(2011) Christian Hardmeier, Jörg Tiedemann, Markus Saers, Marcello Federico, & Mathur Prashant: The Uppsala-FBK systems at WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.372-378. [PDF, 115KB]

(2011) Amir Hazem, Emmanuel Morin & Sebastian Peña Saldarriaga: Bilingual lexicon extraction from comparable corpora as metasearch. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.35-43. [PDF, 159KB]

 (2011) Teresa Herrmann, Mohammed Mediani, Jan Niehues, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.379-385. [PDF, 106KB]

(2011) Matthias Huck, Saab Mansour, Simon Wiesler, & Hermann Ney: Lexicon models for hierarchical phrase-based machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.191-198. [PDF, 256KB]

(2011) Matthias Huck, Joern Wuebker, Christoph Schmidt, Markus Freitag, Stephan Peitz, Daniel Stein, Arnaud Dagnelies, Saab Mansour, Gregor Leusch, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.405-412. [PDF, 112KB]

 (2011) Stéphane Huet & Philippe Langlais: Identifying the translations of idiomatic expressions using TransSearch. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.45-56. [PDF, 1123KB]

(2011) Patrik Lambert, Holger Schwenk, Christophe Servan, & Sadaf Abdul-Rauf: Investigations on translation model adaptation using monolingual data. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.284-293. [PDF, 124KB]

(2011) Thomas Lavergne, Alexandre Allauzen, Hai-Son Le, & François Yvon: LIMSI’s experiments in domain adaptation for IWSLT11. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.62-67. [PDF, 358KB]

 (2011) Benjamin Lecouteux, Laurent Besacier, & Hervé Blanchon: LIG English-French spoken language translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.68-72 [PDF, 98KB]

 (2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

(2011) Hui Lin & Jeff Bilmes: Word alignment via submodular maximization over matroids. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.170-175. [PDF, 159KB]

(2011) Saab Mansour, Joern Wuebker, & Hermann Ney: Combining translation and language model scoring for domain-specific data filtering. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.222-229. [PDF, 257KB]

(2011) Mohammed Mediani, Eunach Cho, Jan Niehues,Teresa Herrmann & Alex Waibel: The KIT English-French translation systems for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.73-78. [PDF, 179KB]

(2011) Emmanuel Morin & Emmanuel Prochasson: Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.27-34. [PDF, 125KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

 (2011) Alexandre Patry & Philippe Langlais: Going beyond word cooccurrences in global lexicon selection for statistical machine translation using a multilayer perceptron. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.658-666. [PDF, 428KB]

(2011) Pavel Pecina, Antonio Toral, Andy Way, Vassilis Papavassiliou, Prokopis Prokopidis, & Maria Giagkou: Towards using web-crawled data for domain adaptation in statistical machine translation. [EAMT 2011]:  proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.297-304. [PDF, 363KB]; presentation, 25 slides [PDF]

 (2011) Stephan Peitz, Markus Freitag, Arne Mauser, & Hermann Ney: Modeling punctuation prediction as machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.238-245. [PDF, 214KB]

 (2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS projectTralogy, Paris, 3-4 March 2011;  17pp. [PDF, 169KB]

(2011) Bruno Pouliquen & Christophe Mazenc: COPPA, CLIR and TAPTA: three tools to assist in overcoming the patent barrier at WIPO. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.24-30. [PDF, 896KB]

(2011) Bruno Pouliquen, Christophe Mazenc & Aldo Iorio: Tapta: a user-driven translation system for patent documents based on domain-aware statistical machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.5-12. [PDF, 342KB]; presentation, 15 slides [PDF, 1642KB]

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

(2011) Johann Roturier & Anthony Bensadoun: Evaluation of MT systems to translate user generated content. MT  Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.244-251. [PDF, 284KB]

 (2011) Anthony Rousseau, Fethi Bougares, Paul Deléglise, Holger Schwenk, & Yannick Estève: LIUM’s systems for the IWSLT 2011 speech translation tasks. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.79-85. [PDF, 250 KB]

 (2011) N.Ruiz, A.Bisazza, F.Brugnara, D.Falavigna, D.Giuliani, S.Jaber, R.Gretter, & M.Federico: FBK@IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.86-93. [PDF, 203KB]

 (2011) Nick Ruiz & Marcello Federico: Topic adaptation for lecture translation through bilingual latent semantic models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.294-302. [PDF, 159KB]

(2011) Holger Schwenk, Patrik Lambert, Loïc Barrault, Christophe Servan, Haithem Afli, Sadaf Abdul-Rauf, & Kashif Shah: LIUM’s SMT machine translation systems for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.464-469. [PDF, 119KB]

(2011) Johan Segura & Violaine Prince: Using alignment to detect multiword expressions in bilingual corpora. .  Tralogy, Paris, 3-4 March 2011;  17pp. [PDF, 430KB]

(2011) Nasredine Semmar & Dhouha Bouamor: A new hybrid machine translation approach using cross-language information retrieval and only target text corpora. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.75-79. [PDF, 898KB]

 (2011) Kashif Shah, Loïc Barrault, & Holger Schwenk: Parametric weighting of parallel data for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1323-1331. [PDF, 754KB]

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Joern Wuebker, Matthias Huck, Saab Mansour, Markus Freitag, Minwei Feng, Stephan Peitz, Christoph Schmidt, & Hermann Ney: The RWTH Aachen machine translation system for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.106-113. [PDF, 193KB]

(2010) Alexandre Allauzen, Josep M.Crego, Ílknur Durgar El-Kahlout, & François Yvon: LIMSI’s statistical translation systems for WMT’10.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 54-59. [PDF, 129KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Laurent Besacier, Haitem Afli, Do Thi Ngoc Diep, Hervé Blanchon, & Marion Potet: LIG statistical machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.99-104. [PDF, 417KB]

 (2010) Ergun Biçici & S.Serdar Kozat: Adaptive model weighting and transductive regression for predicting best system combinations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 276-281. [PDF, 181KB]

 (2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010) Arianna Bisazza, Ioannis Klasinas, Mauro Cettolo, & Marcello Federico: FBK @ IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.53-58. [PDF, 346KB]

(2010) Christian Federmann, Andreas Eisele, Hans Uszkoreit, Yu Chen, Sabine Hunsicker, & Jia Xu: Further experiments with shallow hybrid MT systems.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 77-81. [PDF, 118KB]

(2010) George Foster, Cyril Goutte, & Roland Kuhn: Discriminative instance weighting for domain adaptation in statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.451-459. [PDF, 256KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Automatic determination of number of clusters for creating templates in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 582KB]; presentation: 37 slides [PDF, 496KB]

(2010) Guillem Gascó, Vicent Alabau, Jesús Andrés-Ferrer, Jesús González-Rubio, Martha-Alicia Rocha, Germán Sanchis-Trilles, Francisco Casacuberta, Jorge González, & Joan-Andreu Sánchez: ITI-UPV system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.85-92. [PDF, 414KB]; presentation by Germán Sanchis-Trilles: 16 slides [PDF of PPT, 481KB]

(2010) Olivier Hamon: Is my judge a good one? LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1735-1740. [PDF, 281KB]

(2010) Kenneth Heafield & Alon Lavie: CMU multi-engine machine translation for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 301-306. [PDF, 112KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Stéphane Huet, Julien Bourdaillet, Alexandre Patry, & Philippe Langlais: The RALI machine translation system for WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 103-109. [PDF, 115KB]

(2010) Michael Jellinghaus, Alexandros Poulis, & David Kolovratnik: Exodus – exploring SMT for EU institutions.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 110-114. [PDF, 109KB]

(2010) Patrik Lambert, Sadaf Abdul-Rauf, & Holger Schwenk: LIUM SMT machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 121-126. [PDF, 119KB]

(2010) Samuel Larkin, Boxing Chen, George Foster, Ulrich Germann, Eric Joanis, Howard Johnson, & Roland Kuhn: Lessons from NRC’s Portage system at WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 127-132. [PDF, 124KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Bo Li & Eric Gaussier: Improving corpus comparability for bilingual lexicon extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.644-652. [PDF, 276KB]

(2010) Aurélien Max: Example-based paraphrasing for improved phrase-based statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.656-666. [PDF, 234KB]

(2010) Jan Niehues, Mohammed Mediani, Teresa Herrmann, Michael Heck, Christian Herff, & Alex Waibel: The KIT translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.93-98. [PDF, 329KB]

(2010) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Online learning for interactive statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.546-554. [PDF, 284KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

(2010) Manny Rayner, Paula Estrella, & Pierrette Bouillon: A bootstrapped interlingua-based SMT architecture. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 30 slides [PDF, 737KB]

(2010) Anthony Rousseau, Loïc Barrault, Paul Deléglise, & Yannick Estève: LIUM’s statistical machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.113-117. [PDF, 246KB]

 (2010) Graham Russell: Moz: translation of structured terminology-rich text. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 157KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Word alignment with stochastic bracketing linear inversion transduction grammar.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.341-344. [PDF, 97KB]

(2010) Baskaran Sankaran, Ajeet Grewal, & Anoop Sarkar: Incremental decodings for phrase-based statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 216-223. [PDF, 137KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

(2010) Wade Shen, Tim Anderson, Ray Slyh, & A.Ryan Aminzadeh: The MIT-LL/AFRL IWSLT-2010 MT system. Proceedings of the 7th International Workshop on Spoken L anguage Translation, 2-3 December 2010, Paris, France; pp.127-134. [PDF, 341KB]; presentation: 11 slides [PDF of PPT, 422KB]

(2010) Lei Shi, Rada Mihalcea, & Mingjun Tian: Cross language text classification by model translation and semi-supervised learning. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1057-1067. [PDF, 525KB]

(2010) Hiroyuki Shindo, Akinori Fujino, & Masaaki Nagata: Word alignment with synonym regularization. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.137-141. [PDF, 492KB]

(2010) Katsuhito Sudoh, Kevin Duh, & Hajime Tsukada: NTT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.147-152. [PDF, 361KB]

(2010) Lin Sun, Anna Korhonen, Thierry Poibeau, & Cédric Messiant: Investigating the cross-linguistic potential of VerbNet-style classification. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp. 1056-1064. [PDF, 227KB]

(2010) Lonneke van der Plas, Tanja Samardžić, & Paola Merlo: Cross-lingual validity of PropBank in the manual annotation of French. [ACL-2010] Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden, 15-16 July 2010; pp.113-117. [PDF, 102KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Ventsislav Zhechev & Josef van Genabith: Seeding statistical machine translation with translation memory output through tree-based structural alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.43-51. [PDF, 510KB]

English to German

(2015) Thanh-Le Ha, Jan Niehues, Eunah Cho, Mohammed Mediani & Alex Waibel: The KIT translation systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.62-69. [PDF, 3.0MB]

(2015) Teresa Herrmann, Jan Niehues, &Alex Waibel: Source discriminative word lexicon for translation disambiguation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.135-142. [PDF, 3.1MB]

(2015) Matthias Huck & Alexandra Birch: The Edinburgh machine translation systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 31-38. [PDF, 3.1MB]

(2015) Laura Jehl, Patrick Simianer, Julian Hirschler, & Stefan Riezler: The Heidelberg University English-German translation system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp. 45-49. [PDF, 1.7MB]

(2015) Minh-Thang Luong & Christopher Manning: Stanford neural machine translation systems for spoken language domains. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.76-79. [PDF, 1.2MB]

(2015) Markus Müller, Thai-Son Nguyen, Matthias Sperber, Kevin Kilgour, Sebastian Stüker, & Alex Waibel: The 2015 KIT IWSLT speech-to-text systems for English and German. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.70-75. [PDF, 2.1MB]

(2015) Nina Seemann, Fabienne Braune, & Andreas Maletti: A systematic evaluation of MBOT in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.200-214. [PDF, 585KB]

(2015) Krzysztof Wolk & Krzysztof Marasek: PJAIT systems for the IWSLT 2015 evaluation campaign enhanced by comparable corpora. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.101-104. [PDF, 2.9MB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Daniel Beck, Kashif Shah, & Lucia Specia: SHEF-Lite 2.0: sparse multi-task Gaussian processes for translation quality estimation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.307-312. [PDF, 306KB]

(2014) Fabienne Cap, Marion Weller, Anita Ramm, & Alexander Fraser: CimS – the CIS and IMS joint submission to WMT2014 translating from English into German. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.71-78. [PDF, 197KB

(2014) Ondřej Dušek, Jan Hajič, Jaroslava Hlaváčová, Michal Novák, Pavel Pecina, Rudolf Rosa, Aleš Tamchyna, Zdeňka Urešová, & Daniel Zeman: Machine translation of medical texts in the Khresmoi project. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.221-228. [PDF, 513KB]

(2014) Kevin Flanagan; Bilingual phrase-to-phrase alignment for arbitrarily-small datasets. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 83-95. [PDF, 1203KB]

(2014) Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Matthias Huck, Rico Sennrich, Nadir Durrani, Maria Nadejde, Philip Williams, Philipp Koehn, Teresa Herrmann, Eunach Cho, & Alex Waibel; EU-BRIDGE MT: combined machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.105-113. [PDF, 217KB]

(2014) Federico Gaspari, Antonio Toral, Sudip Kumar Naskar, Declan Groves, & Andy Way: Perception vs. reality: measuring machine translation post-editing productivity. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.60-72. [PDF, 160KB]

(2014) Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann, & Bonnie Webber: ParCor 1.0: a parallel pronoun-coreference corpus to support statistical MT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3191-3198. [PDF, 126KB]

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Teresa Herrmann, Mohammed Mediani, Eunach Cho, Thanh-Le Ha, Jan Niehues, Isabel Slawik, Yuqi Zhang, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.130-135. [PDF, 201KB]

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Jianri Li, Se-Jong Kim, Hwidong Na, & Jong-Hyeok Lee: Postech’s system description for medical text translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.229-232. [PDF, 262KB]

 (2014) Arle Lommel, Aljoscha Burchardt, Maja Popović, Kim Harris, Eleftherios Avramidis, & Hans Uszkoreit: Using a new analytic measure for the annotation and analysis of MT errors on real data. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.165-172. [PDF, 1781KB]

(2014) Yi Lu, Longyue Wang, Derek F.Wong, Lidia S.Chao, Yiming Wang, & Francisco Oliveira: Domain adaptation for medical text translation using web resources. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.233-238. [PDF, 485KB]

(2014) Julia Neidert, Sebastian Schuster, Spence Green, Kenneth Heafield, & Christopher D.Manning: Stanford University’s submissions to the WMT 2014 translation task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.150-156. [PDF, 333KB]

(2014) Nicolas Pécheux, AlexandreAllauzen, & Francçois Yvon: Rule-based reordering space in statistical machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1800-1806. [PDF, 607]

(2014) Lucia Specia, & Kashif Shah: Predicting human translation quality. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.288-300. [PDF, 717KB]

(2014) Dan Tufiş, Radu Ion, Ştefan Dumitrescu, & Dan Ştefănescu: Large SMT data-sets extracted from Wikipedia. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.656-663. [PDF, 232KB]

(2014) Zdeňka Urešová, Ondřej Dušek, Jan Hajič, & Pavel Pecina: Multilingual test sets for machine translation of search queries for cross-lingual information retrieval in the medical domain. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3244-3247. [PDF, 116KB]

(2014) Pavel Vondřička: Aligning parallel texts with InterText. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1875-1879. [PDF, 476KB]

(2014) Longyue Wang, Yi Lu, Derek F.Wong, Lidia Chao, Yiming Wang, & Francisco Oliveira: Combining domain adaptation approaches for medical text translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.254-259. [PDF, 348KB]

(2014) Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, & Philipp Koehn: Edinburgh’s syntax-based systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.207-214. [PDF, 222KB]

(2013) Ahmet Aker, Monica Paramita, & Robert Gaizauskas: Extracting bilingual terminologies from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.402-411. [PDF, 204KB]

(2013) Mihael Arcan & Paul Buitelaar: Ontology label translation. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Student Research Workshop, 13 June 2013, Atlanta, Georgia; pp.40-46. [PDF, 127KB]

(2013) Mihael Arcan, Susan Marie Thomas, Derek de Brandt, & Paul Buitelaar: Translating the FINREP taxonomy using a domain-specific corpus. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.199-206. [PDF, 473KB]

(2013) Michael Auli, Michel Galley, Chris Quirk, & Geoffrey Zweig: Joint language and translation modeling with recurrent neural networks. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 308KB]

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Fabienne Braune, Nina Seemann, Daniel Quernheim, & Andreas Maletti: Shallow local multi bottom-up tree transducers in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.811-821. [PDF, 276KB]

(2013) Lars Bungum, Björn Gambäck, André Lynum, & Erwin Marsi: Improving word translation disambiguation by capturing multiword expressions with dictionaries. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, June 13-14, 2013; pp.21-30. [PDF, 155KB]

(2013) Mauro Cettolo, Christophe Servan, Nicola Bertoldi, Marcello Federico, Loïc Barrault, & Holger Schwenk: Issues in incremental adaptation of statistical MT from human post-edits. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 111-118. [PDF, 186KB]

(2013) Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, & Marcello Federico: Report on the 10th IWSLT evaluation campaign. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 18pp. [PDF, 270KB]

(2013) Eunah Cho, Thanh-Le Ha, Mohammed Mediani, Jan Niehues, Teresa Herrmann, Isabel Slawik & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2013. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.104-108. [PDF, 107KB]

(2013) Kevin Duh, Graham Neubig, Katsuhito Sudoh, & Hajime Tsukada: Adapation data selection using neural language models: experiments in machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.678-683. [PDF, 215KB]

(2013) Melania Duma, Cristina Vertan, & Wolfgang Menzel: A new syntatctic metric for evaluation of machine translation.  ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.130-135. [PDF, 346KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) İlknur Durgar El-Kahlout & Coşkun Mermer: TÜBİTAK-BİLGEM German-English machine translation systems for W13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.109-113. [PDF, 100KB]

(2013) Mark Fishel: Ranking translations using error analysis and quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.405-407. [PDF, 98KB]

(2013) Spence Green, Daniel Cer, Kevin Reschke, Rob Voigt, John Bauer, Sida Wang, Natalia Silveira, Julia Neidert & Christopher D. Manning: Feature-rich phrase-based translation: Stanford University’s submission to the WMT 2013 translation task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.148-153. [PDF, 272KB]

(2013) Than-Le Ha, Teresa Herrmann, Jan Niehues, Mohammed Mediani, Eunah Cho, Yuqi Zhang, Isabel Slawik, & Alex Waibel: The KIT translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 147KB]

(2013) Najeh Hajlaoui: Are ACT’s scores increasing with better translation quality? WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.408-413. [PDF, 255KB]

(2013) Silvana Hartmann & Iryna Gurevych: FrameNet on the way to Babel: creating a bilingual FrameNet using Wiktionary as interlingual connection.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1363-1373. [PDF, 271KB]

(2013) Teresa Herrmann, Jochen Weiner, Jan Niehues, & Alex Waibel: Analyzing the potential of source sentence reordering in statistical machine translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 166KB]

(2013) Thomas Meyer & Bonnie Webber: Implicitation of discourse connectives in (machine) translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.19-26. [PDF, 396KB]

(2013) Linda Mitchell, Johann Roturier, & Sharon O’Brien: Community-based post-editing of machine-translated content: monolingual vs. bilingual. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 35-43. [PDF, 216KB]

(2013) Hwidong Na & Jong-Hyeok Lee: A discriminative reordering parser for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 105KB]

(2013) Maria Nadejde, Philip Williams, & Philipp Koehn: Edinburgh’s syntax-based machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.170-176. [PDF, 161KB]

(2013) Maja Popović, Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Sven Schmeier, Cindy Tscherwinka, David Vilar, & Hans Uszkoreit: Learning from human judgments of machine translation output. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.231-238. [PDF, 328KB]

(2013) Rico Sennrich, Holger Schwenk & Walid Aransa: A multi-domain translation model framework for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.832-840. [PDF, 1387KB]

(2013) M.Ali Basha Shaik, Zoltan Tüske, Simon Wiesler, Markus Nußbaum-Thom, Stephan Peitz, Ralf Schlüter & Hermann Ney: The RWTH Aachen German and English LVCSR systems for IWSLT-2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 230KB]

(2013) Sara Stymne, Christian Hardmeier, Jörg Tiedemann & Joakim Nivre: Tunable distortion limits and corpus cleaning for SMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.225-231. [PDF, 146KB]

(2013) Katsuhito Sudoh, Graham Neubig, Kevin Duh, & Hajime Tsukada: NTT-NAIST SMT systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 91KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2013) Marion Weller, Alexander Fraser, & Sabine Schulte im Walde: Using subcategorization knowledge to improve case prediction for translation into German.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.593-603. [PDF, 252KB]

(2013) Joern Wuebker, Stephan Peitz, Tamer Alkhouli, Jan-Thorsten Peter, Minwei Feng, Markus Freitag & Hermann Ney: The RWTH Aachen machine translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 128KB]

(2013) Wenduan Xu, Yue Zhang, Philip Williams, & Philipp Koehn: Learning to prune: context-sensitive pruning for syntactic MT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.352-357. [PDF, 334KB]

(2012) Mihael Arcan, Christian Federmann, & Paul Buitelaar: Experiments with term translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.67-82. [PDF, 178KB]

(2012) Mihael Arcan, Paul Buitelaar, & Christian Federmann: Using domain-specific and collaborative resources for term translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.86-94. [PDF, 122KB]

(2012) Eleftherios Avramidis, Aljoscha Burchardt, Christian Federmann, Maja Popović, Cindy Tscherwinka, & David  Vilar: Involving language professionals in the evaluation of machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1127-1130. [PDF, 320KB]]

(2012) Eleftherios Avramidis, Marta R.Costa-jussà, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

(2012) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in SMT of user-generated forum content guided by OOV word reduction: normalization and/or supplementary data? EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.169-176. [PDF, 160KB]

(2012) Alexandra Balahur & Marco Turchi: Multilingual sentiment analysis using machine translation. [ACL 2012] Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, Jeju, Republic of Korea, 12 July 2012; pp.52-60. [PDF, 133KB]

(2012) Phil Blunsom: Synchronous grammars and translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 45 slides [PDF of PPT, 456KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Stephen Doherty & Sharon O’Brien: A user-based usability assessment of raw machine translated technical instructions. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 143KB]

(2012) Greg Durrett, Adam Pauls, & Dan Klein: Syntactic transfer using a bilingual lexicon. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1-11. [PDF, 286KB]

(2012) Marcello Federico, Alessandro Cattelan, & Marco Trombetti: Measuring user productivity in machine translation enhanced computer assisted translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 261KB]

(2012) Christian Federmann: Can machine learning algorithms improve phrase selection in hybrid machine translation?  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.113-118. [PDF, 200KB]

(2012) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Cap: Modeling inflection and word formation in SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.664-674. [PDF, 146KB]

(2012) Anita Gojun & Alexander Fraser: Determining the placement of German verbs in English-to-German SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 726-735. [PDF, 138KB]

(2012) Bettina Harriehausen-Mühlbauer & Timm Heuss: Semantic web based machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.1-9. [PDF, 196KB]

(2012) Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, & Pawel Swietojanski: The UEDIN systems for the IWSLT 2012 evaluation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.46-53. [PDF, 651KB]; presentation: 103 slides [PDF of PPT, 384KB]

(2012) Maria Holmqvist, Sara Stymne, Lars Ahrenberg, & Magnus Merkel: Alignment-based reordering for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3436-3440. [PDF, 300KB]

(2012) Matthias Huck, Stephan Peitz, Markus Freitag, Malte Nuhn, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.304-311. [PDF, 127KB]

(2012) Sabine Hunsicker, Chen Yu, & Christian Federmann: Machine learning for hybrid machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.312-316. [PDF, 131KB]

(2012) Rasoul Samad Zadeh Kaljahi, Raphael Rubino, Johann Roturier, & Jennifer Foster: A detailed analysis of phrase-based and syntax-based machine translation: the search for systematic differences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 140KB]

(2012) Gerhard Kremer, Matthias Hartung, Sebastian Padó & Stefan Riezler: Statistical machine translation support improves human adjective translation. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.103-126. [PDF, 266KB]

(2012) Hai-Son Le, Thomas Lavergne, Alexandre Allauzen, Marianna Apidianaki, Li Gong, Aurélien Max, Artem Sokolov, Guillaume Wisniewski, & François Yvon: LIMSI @ WMT’12.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.330-337. [PDF, 189KB]

(2012) Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel & Alexander Waibel: The KIT translation systems for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.38-45. [PDF, 640KB]; presentation, 23 slides [PDF of PPT, 469KB]

(2012) Linda Mitchell & Johann Roturier: Evaluation of machine-translated user generated content: a pilot study based on user ratings. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.61-64. [PDF, 161KB]

(2012) Jiří Navrátil, Karthik Visweswariah, & Ananthakrishnan Ramanathan: A comparison of syntactic reordering methods for English-German machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2043-2058. [PDF, 213KB]

(2012) Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.349-355. [PDF, 149KB]

(2012) Alexandros Poulis & David Kolovratnik: To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 9pp. [PDF, 390KB]

 (2012) Johann Roturier, Linda Mitchell, Robert Grabowski & Melanie Siegel: Using automatic machine translation metrics to analyze the impact of source reformulations. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 509KB]

(2012) Erich Steiner: Methodological cross-fertilization: empirical methodologies in (computational) linguistics and translation studies. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.3-21. [PDF, 1090KB]

(2012) Sara Stymne: Clustered word classes for preordering in statistical machine translation. EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP,  23-27 April 2012, Avignon, France; pp. 28-34. [PDF, 108KB]

(2012) David Vilar: DFKI’s SMT system for WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.382-387. [PDF, 90KB]

(2012) Philip Williams & Philipp Koehn: GHKM rule extraction and scope-3 parsing in Moses. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.388-394. [PDF, 122KB]

(2012) Mei Yang & Katrin Kirchhoff: Unsupervised  translation disambiguation for cross-domain statistical machine translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 424KB]

(2011) Alexandre Allauzen, Hélène Bonneau-Maynard, Hai-Son Le, Aurélien Max, Guillaume Wisniewski, François Yvon, Gilles Adda, Josep M.Crego, Adrien Lardilleux, Thomas Lavergne, & Artem Sokolov: LIMSI @ WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.309-315. [PDF, 107KB]

(2011) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in statistical machine translation of user-forum data using component-level mixture modelling. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.285-292. [PDF, 137KB]

 (2011) Ergun Biçici & Deniz Yuret: Instance selection for machine translation using feature decay algorithms. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.272-283. [PDF, 244KB]

 (2011) Oliver Čulo, Silvia Hansen-Schirra, Karin Maksymski, & Stella Neumann: Empty links and crossing lines: querying multi-layer annotation and alignment in parallel corpora. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.75-104. [PDF, 923KB]

(2011) Sarah Ebling, Andy Way, Martin Volk, & Sundip Kumar Naskar: Combining semantic and syntactic generalization in example-based machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.209-216. [PDF, 354KB]; presentation, 37 slides [PDF]

(2011) Jakob Elming & Martin Haulrich: Reordering by parsing. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.35-42. [PDF, 361KB]

 (2011) Christian Federmann & Sabine Hunsicker: Stochastic parse tree selection for an existing RBMT system. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.351-357. [PDF, 112KB]

(2011) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Fritzinger: Morphological generation of German for SMT. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 21KB];  presentation: 44 slides [PDF of PPT, 307KB]

 (2011) Teresa Herrmann, Mohammed Mediani, Jan Niehues, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.379-385. [PDF, 106KB]

 (2011) Maria Holmqvist, Sara Stymne, & Lars Ahrenberg: Experiments with word alignment, normalization and clause reordering for SMT between English and German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.393-398. [PDF, 102KB]

(2011) Matthias Huck, Joern Wuebker, Christoph Schmidt, Markus Freitag, Stephan Peitz, Daniel Stein, Arnaud Dagnelies, Saab Mansour, Gregor Leusch, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.405-412. [PDF, 112KB]

 (2011) Maxim Khalilov & Khalil Sima’an: ILLC-UvA translation system for EMNLP-WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.413-419. [PDF, 591KB]

 (2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

 (2011) Erwin Marsi, André Lynum, Lars Bungum, & Björn Gambäck: Word translation disambiguation without parallel texts. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.66-74. [PDF, 898KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

(2011) Johann Roturier & Anthony Bensadoun: Evaluation of MT systems to translate user generated content. MT  Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.244-251. [PDF, 284KB]

(2011) Anne H.Schneider & Saturnino Luz: Speaker alignment in synthesised, machine translation communication. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.254-260. [PDF, 231KB]

 (2011) Rico Sennrich: The UZH system combination system for WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.166-170.  [PDF, 104KB]

 (2011) Philip Williams & Philipp Koehn: Agreement constraints for statistical machine translation into German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.217-226. [PDF, 146KB]

 (2011) Petra Wolf, Ulrike Bernardi, Christian Federmann, & Sabine Hunsicker: From statistical term extraction to hybrid machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.225-232. [PDF, 329KB]; presentation, 24 slides [PDF]

(2010) Ahmet Aker & Robert Gaizauskas: Model summaries for location-related images. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3119-3124. [PDF, 429KB]

(2010) Alexandre Allauzen, Josep M.Crego, İlknur Durgar El-Kahlout, Le Hai-Son, Guillaume Wisniewski, & François Yvon: LIMSI @ IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.105-112. [PDF, 386KB]

 (2010) Ergun Biçici & S.Serdar Kozat: Adaptive model weighting and transductive regression for predicting best system combinations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 276-281. [PDF, 181KB]

 (2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010) Yu Chen & Andreas Eisele: Hierarchical hybrid translation between English and German . EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 570KB]; presentation: 18 slides [PDF, 1652KB]

(2010) Christian Federmann, Andreas Eisele, Hans Uszkoreit, Yu Chen, Sabine Hunsicker, & Jia Xu: Further experiments with shallow hybrid MT systems.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 77-81. [PDF, 118KB]

 (2010) Christian Hardmeier, Arianna Bisazza, & Marcello Federico: FBK at WMT 2010: word lattices for morphological reduction and chunk-based reordering.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 88-92. [PDF, 132KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Kenneth Heafield & Alon Lavie: CMU multi-engine machine translation for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 301-306. [PDF, 112KB]

(2010) Carmen Heger, Joern Wuebker, Matthias Huck, Gregor Leusch, Saab Mansour, Daniel Stein, & Hermann Ney: The RWTH Aachen machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 93-97. [PDF, 98KB]

(2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

 (2010) Laura Elisabeth Jehl: Machine translation for Twitter. Master of Science, University of Edinburgh, 2010. iv, 53pp. [PDF, 398KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Jan Niehues, Teresa Herrmann, Mohammed Mediani, & Alex Waibel: The Karlsruhe Institute for Technology translation system for the ACL-WMT 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 138-142. [PDF, 87KB]

(2010) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Online learning for interactive statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.546-554. [PDF, 284KB]

(2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

(2010) Anne H.Schneider, Ielka van der Sluis, & Saturnino Luz: Comparing intrinsic and extrinsic evaluation of MT output in a dialogue system. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.329-336. [PDF, 448KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

(2010) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Vs and OOVs: two problems for translation between German and English. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 183-188. [PDF, 105KB]

(2010) Jörg Tiedemann: To cache or not to cache? Experiments with adaptive models in statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 189-194. [PDF, 127KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

English to Greek

 (2012) Konstantinos Chatzitheodorou & Alexandros Poulis: Experiments on domain-specific statistical machine translation at the European Parliament. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 6pp. [PDF, 171KB]; presentation by Alexandros Poulis: 16 slides [PDF, 134KB]

(2012) Pavel Pecina, Antonio  Toral, Vassilis Papavassiliou, Prokopis Prokopidis, & Josef van Genabith: Domain adaptation of statistical machine translation using web-crawled resources: a case study. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.145-152. [PDF, 201KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Pavel Pecina, Antonio Toral, Andy Way, Vassilis Papavassiliou, Prokopis Prokopidis, & Maria Giagkou: Towards using web-crawled data for domain adaptation in statistical machine translation. [EAMT 2011]:  proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.297-304. [PDF, 363KB]; presentation, 25 slides [PDF]

 (2011) Mara Tsoumari & Georgios Petasis: A new annotation tool for aligned bilingual corpora. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.43-52. [PDF, 184KB]

English to Gujarati

(2010) Niraj Aswani & Robert Gaizauskas: English-Hindi transliteration using similarity metrics. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1786-1793. [PDF, 530KB]

English to Haitian creole

(2010) Ralf D.Brown: Taming structured perceptrons on wild feature vectors. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 384-391. [PDF, 132KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Automatic determination of number of clusters for creating templates in example-based machine translation. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 582KB]; presentation: 37 slides [PDF, 496KB]

 (2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Monolingual distributional profiles for word substitution in machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.320-328. [PDF, 605KB]

(2010) William D.Lewis: Haitian Creole: how to build and ship an MT engine from scratch in 4 days, 17 hours, & 30 minutes. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 518KB]; presentation: 24 slides [PDF, 630KB]

English to Hebrew

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

 (2012) Amittai Axelrod, QingJun Li, & William D.Lewis: Applications of data selection via cross-entropy difference for real-world statistical machine translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.201-208. [PDF, 610KB]

(2011) Raphael Cohen, Yoav Goldberg, & Michael Elhadad: Transliterated pairs acquisition in medical Hebrew. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 36KB];  presentation: 37 slides [PDF of PPT, 315KB]

(2011) Yoav Goldberg: Agreement matters: challenges of translating into a morphologically rich language, and the advantages of a syntax-based system. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 41KB];  presentation: 108 slides [PDF of PPT, 342KB]

 (2011) Canasai Kruengkrai, Thatsanee Charoenporn, & Virach Sornlertlamvanich: Simple discriminative training for machine transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.28-31. [PDF, 514KB]

(2011) Jason Naradowsky & Kristina Toutanova: Unsupervised bilingual morpheme segmentation and alignment with context-rich hidden semi-Markov models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.895-904. [PDF, 285KB]

(2010) Yulia Tsvetkov & Shuly Wintner: Automatic acquisition of parallel corpora from websites with dynamic content.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3389-3392. [PDF, 447KB]

English to Hindi

(2014) Ondřej Bojar, Vojtěch Diatka, Pavel Rychlý, Pavel Straňák, Vit Suchomel, Aleš Tamchyna, & Daniel Zeman: HindEnCorp – Hindi-English and Hindi-only corpus for machine translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3550-3555. [PDF, 107KB]

(2014) Marta R.Costa-jussà, Parth Gupta, Rafael E.Banchs & Paolo Rosso: English-to-Hindi system descrption for WMT 2014: deep source-context features for Moses. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.79-83. [PDF, 296KB]

(2014) Piyush Dungarwal, Rajen Chatterjee, Abhijit Mishra, Anoop Kunchukuttan, Ritesh Shah, & Pushpak Bhattacharyya: The IIT Bombay HindiÛEnglish translation system at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.90-96. [PDF, 222KB]

(2014) Nadir Durrani & Philipp Koehn: Improving machine translation via triangulation and transliteration. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.71-78. [PDF, 452KB]

(2014) Jayendra Rakesh Yeka, Prasanth Kolachina, & Dipti Misra Sharma: Benchmarking of English-Hindi parallel corpora. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1812-1818. [PDF, 627KB]

(2014) Aleš Tamchyna, Martin Popel, Rudolf Rosa, & Ondřej Bojar: CUNI in WMT14: Chimera still awaits Bellerophon. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.195-200. [PDF, 178KB]

(2014) Liling Tan & Santanu Pal: Manawi: using multi-word expressions and named entities to improve machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.201-206. [PDF, 191KB]

(2013) Ankur Gandhe & Rashmi Gangadharaiah: Hypothesis refinement using agreement constraints in machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.429-437. [PDF, 170KB]

(2013) Aditya Joshi, Kashyap Popat, Shubham Gautam, & Pushpak Bhattacharyya: Making headlines in Hindi: automatic Englsih to Hindi news headline translation. International Joint Conference on Natural Language Processing, System Demonstrations, Nagoya, Japan, 14-18 October 2013; pp.21-24. [PDF, 303KB]

(2013) Raj Nath Patel, Rohit Gupta, Prakash B.Pimpale & Sasikumar M: Reordering rules for English-Hindi MT. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.34-41. [PDF, 321KB]

(2012) Karunesh Kumar Arora & R.Mahesh K.Sinha: Improving statistical machine translation through co-joining parts of verbal constructs in English-Hindi translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.95-101. [PDF, 164KB]

(2012) Renu Balyan, Sudip Kumar Naskar, Antonio Toral, & Niladri Chatterjee: A diagnostic evaluation approach targeting MT systems for Indian languages. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 61-71. [PDF, 359KB]

(2012) Ram Prakash H: Quillpad multilingual predictive transliteration system. COLING 2012: Proceedings of the Second Workshop on Advances in Text Input Methods (WTIM 2), Mumbai, December 2012; pp.107-113. [PDF, 310KB]

(2012) Amit Sangodkar & Om P.Damani: Re-ordering source sentences for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2164-2171. [PDF, 649KB]

(2011) Aditya Bhargava, Bradley Hauer, & Grzegorz Kondrak: Leveraging transliterations from multiple languages. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.36-40. [PDF, 111KB]

(2011) Abhijit Bhole, Goutham Tholpadi, & Raghavendra Udupa: Mining multi-word named entity equivalents from comparable corpora. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.65-72. [PDF, 820KB]

(2011) Aswarth Dara, Prashanth Mannem, Hemanth Sagar Bayyarapu & Avinesh PVS: Transferring syntactic relations from English to Hindi using alignments on local word groups. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.447-455. [PDF, 188KB]

(2011) Pradeep Dasigi & Mona Diab: Named entity transliteration generation leveraging statistical machine translation technology. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.106-111. [PDF, 116KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Ankur Gandhe, Rashmi Gangadharaiah, Karthik Visweswariah, & Ananthakrishnan Ramanathan: Handling verb phrase morphology in highly inflected Indian languages for machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.111-119. [PDF, 256KB]

(2011) Rashmi Gangadharaiah: Reducing asymmetry between language-pairs to improve alignment and translation quality. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1346-1350. [PDF, 239KB]

(2011) Peter Nabende: Mining transliterations from Wikipedia using dynamic Bayesian networks. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.385-391. [PDF, 580KB]

(2011) Ananthakrishnan Ramanathan, Pushpak Bhattacharyya, Karthik Visweswariah, Kushal Ladha, & Ankur Gandhe: Clause-based reordering constraints to improve statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1351-1355. [PDF, 132KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

(2011) Karthik Visweswariah, Rajakrishnan Rajkumar, Ankur Gandhe, Ananthakrishnan Ramanathan, & Jiri Navratil: A word reordering model for improved machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.486-496. [PDF, 252KB]

(2010) Arafat Ahsan, Prasanth Kolachina, Sudheer Kolachina, Dipti Misra Sharma, & Rajeev Sangal: Coupling statistical machine translation with rule-based transfer and generation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 162KB]

(2010) Niraj Aswani & Robert Gaizauskas: English-Hindi transliteration using similarity metrics. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1786-1793. [PDF, 530KB]

(2010) Avinesh PVS & Ankur Parikh: Phrase-based transliteration system with simple heuristics. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.81-84. [PDF, 67KB]

(2010) Ondřej Bojar, Pavel Straňák, & Daniel Zeman: Data issues in English-to-Hindi machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1771-1777. [PDF, 557KB]

(2010) Diptesh Chatterjee, Sudeshna Sarkar, & Arpit Mishra: Co-occurrence graph based iterative bilingual lexicon extraction from comparable corpora. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.35-42. [PDF, 618KB]

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

(2010) Sriram Venkatapathy, Rajeev Sangal, Aravind Joshi, & Karthik Gali: A discriminative approach to dependency based statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.66-74. [PDF, 187KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

English to Hungarian

(2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

 (2013) László J.Laki, Attila Novák, & Borbála Siklósi: An English-to-Hungarian morpheme-based statistical machine translation system with reordering rules. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.42-50. [PDF, 246KB]

(2012) Veronika Vincze: Light verb constructions in the SzegedParallelFX English-Hungarian parallel corpus. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2381-2388. [PDF, 349KB]

(2010) Minh-Thang Luong & Min-Yen Kan: Enhancing morphological alignment for translating highly inflected languages. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.743-751. [PDF, 610KB]

(2010) Gábor Recski, András Rung, Attila Zséder, & András Kornai: NP alignment in bilingual corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3379-3382. [PDF, 305KB]

English to Indonesian

(2012) Septina Dian Larasati: IDENTIC corpus: morphologically enriched Indonesian-English parallel corpus.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.902-906. [PDF, 719KB]

English to Iraqi Arabic

(2015) Rohit Kumar, Sanjika Hewavitharana, Nina Zinovieva, Matthew E.Roy, & Edward Pattison-Gordon: Error-tolerant speech-to-speech translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.229-239. [PDF, 1.2MB]

(2013) Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Rohit Kumar, Enoch Kan, Rohit Prasad, & Prem Natarjan: Semi-supervised word sense disambiguation for mixed-initiative conversational spoken language translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 43-50. [PDF, 536KB]

(2013) Sankaranarayanan Ananthakrishnan, Wei Chen, Rohit Kumar, & Dennis Mehay: Source aware phrase-based decoding for robust conversational spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 222KB]

(2013) Sanjika Hewavitharana, Dennis N.Mehay, Sankaranarayanan Ananthakrishnan, & Prem Natarajan: Incremental topic-based translation model adaptation for conversational spoken language translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.697-701. [PDF, 279KB]

(2012) Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Wei Chen, Sanjika Hewavitharana, Matthew Roy, Frederick Choi, Aaron Challenner, Enoch Kan, Arvid Neelakantan, & Prem Natarajan: Active error detection and resolution for speech-to-speech translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.150-157. [PDF, 1156KB]

(2011) Avneesh Saluja, Ian Lane, & Ying Zhang: Context-aware language modeling for conversational speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.97-104. [PDF, 177KB]

(2010) Sherri Condon, Dan Parvaz, John Aberdeen, Christy Doran, Andrew Freeman, & Marwan Awad: Evaluation of machine translation errors in English and Iraqi Arabic. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.729-735. [PDF, 644KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

English to Italian

(2015) Ke Tran, Arianna Bisazza, & Christof Monz: A distributed inflection model for translating into morphologically rich languages. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.145-159. [PDF, 720KB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Marco Turchi & Bruno Kessler: Automatic annotation of machine translation datasets with binary quality  judgements.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1788-1792. [PDF, 99KB]

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2013) Anabela Barreiro, Johanna Monti, Brigitte Orliac, & Fernando Batista: When multiwords go bad in machine translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.26-33. [PDF, 219KB]

(2013) Mauro Cettolo, Christophe Servan, Nicola Bertoldi, Marcello Federico, Loïc Barrault, & Holger Schwenk: Issues in incremental adaptation of statistical MT from human post-edits. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 111-118. [PDF, 186KB]

(2013) Mauro Cettolo, Nicola Bertoldi, & Marcello Federico: Project adaptation for MT-enhanced computer assisted translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.27-34. [PDF, 399KB]

(2013) Vivi Nastase & Carlo Strapparava: Bridging languages through etymology: the case of cross language text categorization.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.651-659. [PDF, 229KB]

(2012) Jacob Dlougach & Irina Galinskaya: Building a reordering system using tree-to-string hierachical model. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.27-36. [PDF, 1115KB]

(2012) Marcello Federico, Alessandro Cattelan, & Marco Trombetti: Measuring user productivity in machine translation enhanced computer assisted translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 261KB]

(2012) Rohit Gupta, Raj Nath Patel, & Ritesh Shah: Learning improved reordering models for Urdu, Farsi and Italian using SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.37-45. [PDF, 520KB]

(2011) Giuseppe Attardi, Atanas Chanev, & Antonio Valerio Miceli Barone: A dependency based statistical translation model. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.79-87. [PDF, 174KB]

(2010) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Statistical machine translation of texts with misspelled words. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.412-419. [PDF, 160KB]

(2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

English to Japanese

(2015) Masaru Fuji, Atsushi Fujita, Masao Utiyama, Eiichiro Sumita, & Yuji Matsumoto: Patent claim translation based on sublanguage-specific sentence structure. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.1-16. [PDF, 700KB]

(2014) Hideki Isozaki, Natsume Kouchi, & Tsutomu Hirao: Dependency-based automatic enumeration of semantically equivalent word orders for evaluating Japanese translations. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.287-292. [PDF, 264KB]

(2014) Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, & Satoshi Nakamura: Collection of a simultaneous translation corpus for comparative analysis.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.670-673. [PDF, 70KB]

(2013) Daniel Andrade, Masaaki Tsuchida, Takashi Onishi, & Kai Ishikawa: Translation acquisition using synonym sets. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.655-660. [PDF, 500KB]

(2013) Yuki Arase & Ming Zhou: Machine translation detection from monolingual web-text.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1597-1607. [PDF, 591KB]

(2013) Fei Huang & Cezar Pendus: Generalized reordering rules for improved SMT.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.387-392. [PDF, 113KB]

(2013) Shuhei Kondo, Kevin Duh & Yuji Matsumoto: Hidden Markov tree model for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.503-511. [PDF, 531KB]

(2013) Graham Neubig: Travatar: a forest-to-string machine translation engine based on tree transducers. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.91-96. [PDF, 344KB]

(2013) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Robust transliteration mining from comparable corpora with bilingual topic models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.261-269. [PDF, 460KB]

(2013) Akiko Sakamoto, Kazuhiko Abe, Kazuo Sumita & Satoshi Kamatani: Evaluation of a simultaneous interpretation system and analysis of speech log for user experience assessment. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 691KB]

(2013) Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, & Satoshi Nakamura: Constructing a speech translation system using simultaneous interpretation data. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 1041KB]

(2012) Stephen Doherty & Sharon O’Brien: A user-based usability assessment of raw machine translated technical instructions. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 143KB]

(2012) Nan Duan, Mu Li, & Ming Zhou: Forced derivation tree based model training to statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.445-454. [PDF, 761KB]

 (2012) Hideki Isozaki: Head finalization: translation from SVO to SOV [abstract]. Keynote at: IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; 1p. [PDF, 526KB]

 (2012) Toshiaki Nakazawa & Sadao Kurohashi: Alignment by bilingual generation and monolingual derivation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1963-1978. [PDF, 1172KB]

(2012) Graham Neubig, Taro Watanabe, & Shinsuke Mori: Inducing a discriminative parser to optimize machine translation reordering. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.843-853. [PDF, 644KB]

(2012) Tsuyoshi Okita: Annotated corpora for word alignment between Japanese and English and its evaluation with MAP-based word aligner.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3241-3248. [PDF, 522KB]

(2012) Yoh Okuno: Applying mpaligner to machine transliteration with Japanese-specific heuristics. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.61-65. [PDF, 348KB]

(2012) Akihiro Tamura, Taro Watanabe, & Eiichiro Sumita: Bilingual lexicon extraction from comparable corpora using label propagation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.24-36. [PDF, 347KB]

(2012) Dominikus Wetzel & Francis Bond: Enriching parallel corpora for statistical machine translation with semantic negation rephrasing. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.20-29. [PDF, 163KB]

(2012) Nan Yang, Mu Li, Dongdong Zhang, & Nenghai Yu: A ranking-based approach to word reordering for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.912-920. [PDF,

(2011) Takako Aikawa & Spencer Rarrick: Are numbers good enough for you? A linguistically meaningful MT evaluation method. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.332-337. [PDF, 145KB]

(2011) Chooi-Ling Goh, Takashi Onishi, & Eiichiro Sumita: Rule-based reordering constraints for phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.113-120. [PDF, 446KB]

(2011) Isao Goto, Masao Utiyama, Takashi Onishi, & Eiichiro Sumita: A comparison study of parsers for patent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.448-455. [PDF, 156KB]

(2011) Bin Lu, Ka Po Chow, & Benjamin K.Tsou: The cultivation of a Chinese-English-Japanese trilingual parallel corpus from comparable patents. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.472-479. [PDF, 207KB]

 (2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Tsuyoshi Okita & Andy Way: MWE-sensitive word alignment in factored translation model. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 42KB]; presentation: 51 slides [PDF of PPT, 650KB]

(2011) Takashi Onishi, Masao Utiyama, & Eiichiro Sumita: Reordering constraint based on document-level context. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.434-438. [PDF, 128KB]

(2011) Manny Rayner, Paula Estrella, & Pierrette Bouillon: Bootstrapping a statistical speech translator from a rule-based one. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.21-28. [PDF, 199KB]

 (2011) David Talbot, Hideto Kazawa, Hiroshi Ichikawa, Jason Katz-Brown, Masakazu Seno, & Franz J.Och: A lightweight evaluation framework for machine translation reordering. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.12-21. [PDF, 371KB]

(2011) Masao Utiyama, Graham Neubig, Takashi Onishi, & Eiichiro Sumita: Searching translation memories for paraphrases.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.325-331. [PDF, 86KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2011) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Extracting pre-ordering rules from predicate-argument structures. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.29-37. [PDF, 1079KB]

(2010) Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita, & Kyo Kageura: Community-based construction of draft and final translation corpus through a translation hosting site Minna no Hon’yaku (MNH).  LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3662-3669. [PDF, 1695KB]

(2010) Andrew Finch & Eiichiro Sumita: A Bayesian model of bilingual segmentation for transliteration. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.259-266. [PDF, 511KB]

(2010) Hideki Isozaki, Katsuhito Sudoh, Hajime Tsukada, & Kevin Duh: Head finalization: a simple reordering rule for SOV languages.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 133-137. [PDF, 321KB]

(2010) Young-Suk Lee, Bing Zhao, & Xiaoqiang Luo: Constituent reordering and syntax models for English-to-Japanese statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.626-634. [PDF, 214KB]

(2010) Takashi Onishi, Masao Utiyama & Eiichiro Sumita: Paraphrase lattice for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.1-5. [PDF, 419KB]

(2010) Manny Rayner, Paula Estrella, & Pierrette Bouillon: A bootstrapped interlingua-based SMT architecture. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 611KB]; presentation: 30 slides [PDF, 737KB]

(2010) Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, & Masaaki Nagata: Divide and translate: improving long distance reordering in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 418-427. [PDF, 569KB]

(2010) Kirti Vashee & Rustin Gibbs: Scenarios for customizing an SMT engine based on availability of data. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 153KB]

English to Korean

(2013) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Robust transliteration mining from comparable corpora with bilingual topic models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.261-269. [PDF, 460KB]

(2012) Chun-Kai Wu, Yu-Chun Wang, & Richard Tzong-Han Tsai: English-Korean named entity transliteration using substring alignment and re-ranking rules. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.57-60. [PDF, 103KB]

(2011) Hyoung-Gyu Lee, Joo-Young Lee, Min-Jeong Kim, Hae-Chang Rim, Joong-Hwi Shin, & Young-Sook Hwang: Phrase segmentation model using collocation and translational entropy.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.198-205. [PDF, 203KB]

(2011) Yu-Chun Wang & Richard Tzong-Han Tsai: English-Korean named entity transliteration using statistical substring-based and rule-based approaches. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.32-35. [PDF, 237KB]

(2010) Minwoo Jeong, Kristina Toutanova, Hisami Suzuki, & Chris Quirk: A discriminative lexicon model for complex morphology. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 374KB]

(2010) Jae-Hee Lee, Seung-Wook Lee, Gumwon Hong, Young-Sook Hwang, Sang-Bum Kim, & Hae-Chang Rim: A post-processing approach to statistical word alignment reflecting alignment tendency between part-of-speeches. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.623-629. [PDF, 438KB]

English to Latvian

(2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

(2011) Raivis Skadiņš, Kārlis Goba, & Valters Šics: Improving SMT with morphology knowledge for Baltic languages. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 3pp. [PDF, 171KB]; presentation: 24 slides [PDF of PPT, 342KB]

(2011) Raivis Skadiņš, Maris Puriņš, Inguna Skadiņa, & Andrejs Vasiļjevs: Evaluation of SMT in localization to under-resourced inflected language. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.35-40. [PDF, 287KB]; presentation, 17 slides [PDF, 796KB]

 (2011) Andrejs Vasiļjevs, Raivis Skadiņš, & Inguna Skadiņa: Towards application of user-tailored machine translation in localization.  Proceedings of the Third Joint EM+/CNGL Workshop “Bringing MT to the User: Research Meets Translators” (JEC ’11), Luxembourg, 14 October 2011; pp.23-31. [PDF, 1896KB]

(2010) Maxim Khalilov, Lauma Pretkalniņa, Natalja Kuvaldina, & Veronika Pereseina: SMT of Latvian, Lithuanian and Estonian languages: a comparative study. Human Language Technologies – the Baltic Perpective,  International Conference, Riga, Latvia, October 8, 2010; 8pp. [PDF, 145KB]

(2010) Maxim Khalilov, José A.R.Fonollosa, Inguna Skadiņa, Edgar Brālītis, & Lauma Pretkalniņa: Towards improving English-Latvian translation: a system comparison and a new rescoring feature. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1719-1725. [PDF, 398KB]

 (2010) Raivis Skadiņš, Kārlis Goba, & Valters Šics: Improving SMT for Baltic languages with factored models. Human Language Technologies – the Baltic Perpective, 4th International Conference, Riga, Latvia, October 8, 2010; 17 slides [PDF of PPT, 742KB]

English to Lithuanian

(2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

(2010) Maxim Khalilov, Lauma Pretkalniņa, Natalja Kuvaldina, & Veronika Pereseina: SMT of Latvian, Lithuanian and Estonian languages: a comparative study. Human Language Technologies – the Baltic Perpective,  International Conference, Riga, Latvia, October 8, 2010; 8pp. [PDF, 145KB]

English to Luo

(2010) Guy De Pauw, Naomi Maajabu, & Peter Waiganjo Wagacha: A knowledge-light approach to Luo machine translation and part-of-speech tagging. [LREC 2010]: Proceedings of the Second Workshop on African Language Technology, AFLAT 2010, Valetta, Malta; pp.15-20. [PDF, 141KB]

English to Malay

(2014) Mozhgan Ghassemiazghandi & Tengku Sepora Tengku Mahadi: Losses and gains in computer-assisted translation: some remarks on online translation of English to Malay. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.194-201. [PDF, 160KB]

(2013) Lian Tze Lim, Lay-Ki Soon, Tek Yong Lim, Enya Kong Tang, & Bali Ranaivo-Malançon: Context-dependent multilingual lexical lookup for under-resourced languages.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.294-299. [PDF, 373KB]

(2010) Lianhau Lee, Aiti Aw, Min Zhang, & Haizhou Li: EM-based hybrid model for bilingual terminology extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.639-646. [PDF, 114KB]

English to Malayalam

(2012) Jayan V, Sunil R, & Bhadran V K: Disambiguation of pre/post positions in English – Malayalam text translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 93-101. [PDF, 249KB]

English to Manipuri

(2013) Thoudam Doren Singh: Taste of two different flavors: which Manipuri script works better for English-Manipuri language pair SMT stsrem? [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 11-18. [PDF, 162KB]

(2012) Thoudam Doren Singh: Addressing some issues of data sparsity towards improving English-Manipuri SMT using morphological information. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.46-54. [PDF, 454KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.83-91. [PDF, 510KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Statistical machine translation of English-Manipuri using morpho-syntactic and semantic information. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 680KB]

English to Maori

(2011) Te Taka Keegan & Hori Manuirirangi: Minority languages & translation technologies case study: te reo Māori & Google Translator Toolkit. Translating and the Computer 33, 17-18 November 2011, London; 6pp. [PDF, 339KB]

English to Norwegian

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

English to Oromo

(2010) Sisay Adugna & Andreas Eisele: English-Oromo machine translation: an experiment using a statistical approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2196-2199. [PDF, 368KB]

English to Pashto

(2011) Sankaranarayanan Ananthakrishnan, Rohit Prasad, & Prem Natarajan: On-line language model biasing for statistical machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.445-449. [PDF, 95KB]

 (2011) Sankaranarayanan Ananthakrishnan, Shiv Vitaladevuni, Rohit Prasad, & Prem Natarajan: Source error-projection for sample selection in phrase-based SMT for resource-poor languages. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.819-827. [PDF, 425KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: Discriminative sample selection for statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.626-635. [PDF, 329KB]

(2010) Sankaranarayanan Ananthakrishnan, Rohit Prasad, David Stallard, & Prem Natarajan: A semi-supervised batch-mode active learning strategy for improved statistical machine translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.126-134. [PDF, 256KB]

(2010) Bing Xiang, Yonggang Deng, & Bowen Zhou: Diversify and combine: improving word alignment for machine translation on low-resource languages. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.22-26. [PDF, 86KB]

English to Persian

(2013) Nicola Bertoldi, M.Amin Farajian, Prashant Mathur, Nicholas Ruiz, & Marcello Federico: FBK’s machine translation systems for the IWSLT 2013 evaluation campaign. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 134KB]

(2013) Sajad Ebrahimi, Kourosh Meshgi, Shahram Khadivi, & Mohammad Ebrahim Shiri Ahmad Abady: Meta-level statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1151-1157. [PDF, 264KB]

(2012) Jacob Dlougach & Irina Galinskaya: Building a reordering system using tree-to-string hierachical model. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.27-36. [PDF, 1115KB]

(2012) Rohit Gupta, Raj Nath Patel, & Ritesh Shah: Learning improved reordering models for Urdu, Farsi and Italian using SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.37-45. [PDF, 520KB]

(2012) Fattaneh Jabbari, Somayeh Bakhshaei, Seyed Mohammad Mohammadzadeh Ziabary, & Shahram Khadivi: Developing an open-domain English-Farsi translation system using AFEC, Amirkabir bilingual Farsi-English corpus. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.17-23. [PDF, 683KB]

(2012) Mahsa Mohaghegh, Abdolhossein Sarrafzadeh, & Mehdi Mohammadi: GRAFIX: automated rule-based post editing system to improve English-Persian SMT output. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.873-882. [PDF, 405KB]

(2011) Pradeep Dasigi & Mona Diab: Named entity transliteration generation leveraging statistical machine translation technology. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.106-111. [PDF, 116KB]

(2011) Najmeh Mousavi Nejad, Shahram Khadivi, & Kaveh Taghipour: The Amirkabir machine transliteration system for NEWS 2011: Farsi-English task. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.91-95. [PDF, 116KB]

(2011) Najmeh Mousavi Nejad & Shahram Khadivi: An unsupervised alignment model for sequence labeling: application to name transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.73-81. [PDF, 155KB]

(2010) Jalal Maleki: Syllable based transcription of English words into Perso-Arabic writing system. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 62-65. [PDF, 341KB]

(2010) Mahsa Mohaghegh, Abdolhossein Sarrafzadeh, & Tom Moir: Improved language modeling for English-Persian statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.75-82. [PDF, 167KB]

(2010) Mahsa Mohaghegh & Abdolhossein Sarrafzadeh: Performance evaluation of various training data in English-Persian statistical machine translation. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1092-1100. [PDF, 2233KB]

(2010) Mohammad Taher Pilevar & Heshaam Faili: PersianSMT: a first attempt to English-Persian statistical machine translation. JADT 2010: 10th International Conference on Statistical Analysis of Textual Data, 9-11 juin 2010, Rome, Italie; pp.1101-1111. [PDF, 945KB]

English to Polish

 (2014) Raivis Skadiņš, Mārcis Pinnis, Andrejs Vasiļjevs, Inguna Skadiņa, & Tomas Hudik: Application of machine translation in localization into low-resourced languages.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.209-216. [PDF, 871KB]

(2010) Kirti Vashee & Rustin Gibbs: Scenarios for customizing an SMT engine based on availability of data. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 153KB]

English to Portuguese (see also English to Brazilian Portuguese)

 (2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Angela Costa, Tiago Luís, & Luísa Coheur: Translation errors from English to Portuguese: an annotated corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1231-1234. [PDF, 407KB]

 (2013) Anabela Barreiro, Johanna Monti, Brigitte Orliac, & Fernando Batista: When multiwords go bad in machine translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.26-33. [PDF, 219KB]

(2012) Wilker Aziz, Sheila C. M. de Sousa, & Lucia Specia: Cross-lingual sentence compression for subtitles. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.103-110. [PDF, 233KB]

(2012) Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes, & Luísa Coheur: An English-Portuguese parallel corpus of questions: translation guidelines and application in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2172-2176. [PDF, 388KB]

(2011) Maria José Machado & Hilário Leal Fontes: Machine translation: case study – English into Portuguese – evaluation of Moses in DGT Portuguese Language Department using Moses for Mere Mortals. A Folha: Boletim da Língua Portuguesa nas Instituições Europeias, no.37, autumn 2011; English version, 11pp. [PDF, 967KB]

 (2011) Sheila C.M.de Sousa, Wilker Aziz & Lucia Specia: Assessing the post-editinng effort for automatic and semi-automatic translations of DVD subtitles. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.97-103. [PDF, 109KB]

(2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

English to Romanian

(2014) Dan Tufiş, Radu Ion, Ştefan Dumitrescu, & Dan Ştefănescu: Large SMT data-sets extracted from Wikipedia. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.656-663. [PDF, 232KB]

(2012) Elena Irimia: Experimenting with extracting lexical dictionaries from comparable corpora for English-Romanian language pair.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.49-55. [PDF, 738KB]

(2012) Dan Ştefănescu: Mining for term translations in comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.98-103. [PDF, 616KB]

(2012) Dan Tufiş & Ştefan Daniel Dumitrescu: Cascaded phrase-based statistical machine translation systems. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.129-136. [PDF, 252KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

 (2011) Monica Gavrila & Natalia Elita: Comparing corpus-based MT approaches using restricted resources. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.43-49. [PDF, 304KB]

 (2011) Monica Gavrila & Natalia Elita: Experiments with small-sized corpora in CBMT. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 2011, Hissar, Bulgaria, 13 September 2011; pp.67-72. [PDF, 113KB]

 (2011) Monica Gavrila & Cristina Vertan: Training data in statistical machine translation – the more, the better?  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.551-556. [PDF, 282KB]

(2011) Monica Gavrila: Constrained recombination in an example-based machine translation system. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. M ikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.193-200. [PDF, 417KB]; presentation, 36 slides [PDF]

English to Russian

(2015) Ke Tran, Arianna Bisazza, & Christof Monz: A distributed inflection model for translating into morphologically rich languages. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.145-159. [PDF, 720KB]

(2014) Irina Burukina: Translating implicit elements in RBMT. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.182-193. [PDF, 209KB]

(2014) Irina Galinskaya, Valentin Gusev, Elena Meshcheryakova, & Mariya Shmatova: Measuring the impact of spelling errors on the quality of machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2683-2689. [PDF, 448KB]

(2013) Waleed Ammar, Victor Chahuneau, Michael Denkowski, Greg Hanneman, Wang Ling, Austin Matthews, Kenton Murray, Nicola Segall, Alon Lavie & Chris Dyer: The CMU machine translation systems at WMT 2013: syntax, synthetic translation options, and pseudo-references.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.70-77. [PDF, 201KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Alexey Borisov,Jacob Dlougach & Irina Galinskaya: Yandex School of Data Analysis machine translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.99-103. [PDF, 125KB]

(2013) Pavel Braslavski, Alexander Beloborodov, Maxim Khalilov, & Serge Sharoff: English→Russian MT evaluation campaign. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.262-267. [PDF, 162KB]

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

(2013) Kevin Duh, Graham Neubig, Katsuhito Sudoh, & Hajime Tsukada: Adapation data selection using neural language models: experiments in machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.678-683. [PDF, 215KB]

(2013) Nadir Durrani, Alexander Fraser, Helmut Schmid, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions of OSM systems at WMT13.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.122-127. [PDF, 166KB]

(2013) Stéphane Huet, Elena Manishina & Fabrice Lefèvre: Factored Machine Translation Systems for Russian-English. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.154-157. [PDF, 98KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Patrick Simianer, Laura Jehl, & Stefan Riezler: The Heidelberg University machine translation systems for IWSLT2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 224KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Angelina Ivanova: Evaluation of a bilingual dictionary extracted from Wikipedia.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.62-66. [PDF, 337KB]

(2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2011) Igor Boguslavsky: Processing of Russian by the ETAP-3 linguistic processor. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 110KB]; presentation: 48 slides [PDF of PPT, 227KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2010) Peter Nabende: Mining transliterations from Wikipedia using pair HMMs. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.76-80. [PDF, 145KB]

English to Slovenian

(2013) Joern Wuebker, Stephan Peitz, Tamer Alkhouli, Jan-Thorsten Peter, Minwei Feng, Markus Freitag & Hermann Ney: The RWTH Aachen machine translation systems for IWSLT 2013. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 128KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

(2012) Špela Vintar, Darja Fišer, & Aljoša Vrščaj: Were the clocks striking or surprising? Using WSD to improve MT performance. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.87-92. [PDF, 283KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Darja Fišer, Nikola Ljubešić, Špela Vintar, & Senja Pollak: Building and using comparable corpora for domain-specific bilingual lexicon extraction. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.19-26. [PDF, 287KB]

(2011) Špela Vintar & Darja Fišer: Enriching Slovene WordNet with domain-specific terms. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.29-44. [PDF, 631KB]

English to Spanish

(2015) Miguel Á.Candel-Mora: Evaluation of English to Spanish MT output of Tourism 2.0 consumer-generated reviews with post-editing purposes. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.37-47. [PDF, 284KB]

(2015) Christophe Servan, Ngoc Tien Le, Ngoc Quang Luong, Benjamin Lecouteux, & Laurent Besacier: An open-source toolkit for word-level confidence estimation in machine translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.196-203. [PDF, 3.2MB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Daniel Beck, Kashif Shah, & Lucia Specia: SHEF-Lite 2.0: sparse multi-task Gaussian processes for translation quality estimation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.307-312. [PDF, 306KB]

(2014) Grégoire Détrez, Víctor M.Sánchez-Cartagena, & Aarne Ranta: Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4394-4400. [PDF, 146KB]

(2014) Jennifer Drexler, Pushpendre Rastogi, Jacqueline Aguilar, Benjamin Van Durme, & Matt Post: A Wikipedia-based corpus for contextualized machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3593-3596. [PDF, 72KB]

(2014) Kevin Flanagan; Bilingual phrase-to-phrase alignment for arbitrarily-small datasets. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 83-95. [PDF, 1203KB]

(2014) Scott Gaskill: Reducing time and tedium with translation technology: the six-pound challenge. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.25-30. [PDF, 1034KB]

(2014) Yvette Graham, Nikita Mathur, & Timothy Baldwin: Randomized significance tests in machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.266-274. [PDF, 358KB]

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Arle Lommel, Aljoscha Burchardt, Maja Popović, Kim Harris, Eleftherios Avramidis, & Hans Uszkoreit: Using a new analytic measure for the annotation and analysis of MT errors on real data. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.165-172. [PDF, 1781KB]

(2014) Sharon O’Brien, Joss Moorkens, Joris Vreeke: Kanjingo: a mobile app for post-editing. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Workshop on Post-editing Technology and Practice (WPTP-3); pp.125-127. [PDF, 231KB]

(2014) John E.Ortega, Felipe Sánchez-Martinez, & Mikel L.Forcada: Using any machine translation source for fuzzy-match repair in a computer-aided translation setting. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.42-53. [PDF, 557KB]

(2014) Maja Popović, Arle Lommel, Aljoscha Burchardt, Eleftherios Avramidis, & Hans Uszkoreit: Relations between different types of post-editing operations, cognitive effort and temporal effort. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.191-198. [PDF, 402KB]

(2014) Lucia Specia, & Kashif Shah: Predicting human translation quality. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.288-300. [PDF, 717KB]

(2014) Dan Tufiş, Radu Ion, Ştefan Dumitrescu, & Dan Ştefănescu: Large SMT data-sets extracted from Wikipedia. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.656-663. [PDF, 232KB]

(2014) Marco Turchi & Bruno Kessler: Automatic annotation of machine translation datasets with binary quality  judgements.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1788-1792. [PDF, 99KB]

(2013) V.Alabau, J.González-Rubio, L.A.Leiva, D.Ortiz-Martínez, G.Sanchis-Trilles, F.Casacuberta, B.Meso-Lao, R.Bonk. M.Carl, & M.García-Martínez: User evaluation of advanced interaction features for a computer-assisted translation workbench. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.361-368. [PDF, 1166KB]

(2013) Eleftherios Avramidis & Maja Popović: Selecting feature sets for comparative and time-oriented quality estimation of machine translation output. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.329-336. [PDF, 196KB]

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2013) Karel Bílek & Daniel Zeman: CUni multilingual matrix in the WMT 2013 shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.85-91. [PDF, 246KB]

(2013) Marine Carpuat: NRC: a machine translation approach to cross-lingual word sense disambiguation (SemEval-2013 Task 10). [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.188-192. [PDF, 100KB]

(2013) Kevin Duh, Graham Neubig, Katsuhito Sudoh, & Hajime Tsukada: Adapation data selection using neural language models: experiments in machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.678-683. [PDF, 215KB]

(2013) Mark Fishel: Ranking translations using error analysis and quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.405-407. [PDF, 98KB]

(2013) Lluís Formiga, Marta R. Costa-jussà, José B. Mariño, José A. R. Fonollosa, Alberto Barrón-Cedeño & Lluis Marquez: The TALP-UPC phrase-based translation systems for WMT13: system combination with morphology generation, domain adaptation and corpus filtering. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.134-140. [PDF, 159KB]

(2013) Lluís Formiga, Meritxell Gonzàlez, Alberto Barrón-Cedeño, José A. R. Fonollosa & Lluís Màr­quez: The TALP-UPC approach to system selection: Asiya features and pairwise classification using ran­dom forests. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.359-364. [PDF, 181KB]

(2013) Jesús González-Rubio, J.Ramón Navarro-Cerdán & Francisco Casacuberta: Emprical study of a two-step approach to estimate translation quality. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 384KB]

(2013) David Langlois & Kamel Smaili: LORIA system for the WMT13 quality estimation shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.380-385. [PDF, 177KB]

(2013) Adrian Laurenzi, Megumu Brownstein, Anne M.Turner, & Katrin Kirchhoff: Integrated post-editing and translation management for lay user communities.  Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 27-34. [PDF, 357KB]

(2013) Oscar Mendoza Rivera, Ruslan Mitkov, & Gloria Corpas Pastor: A flexible framework for collocation retrieval and translation from parallel and comparable corpora. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.18-25. [PDF, 356KB]

(2013) Tsuyoshi Okita, Qun Liu, & Josef van Genabith: Shallow semantically-informed PBSMT and HPBSMT. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.177-184. [PDF, 281KB]

(2013) Raphael Rubino, Antonio Toral, Santiago Cortés Vaíllo, Jun Xie, Xiaofeng Wu, Stephen Doherty, & Qun Liu: The CNGL-DCU-Prompsit translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.213-218. [PDF, 152KB]

(2013) Raphael Rubino, Joachim Wagner, Jennifer Foster, Johann Roturier, Rasoul Samad Zadeh Kaljahi & Fred Hollowood: DCU-Symantec at the WMT 2013 quality estimation shared task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.392-397. [PDF, 162KB]

(2013) Alex Rudnick & Michael Gasser: Lexical selection for hybrid MT with sequence labeling. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.102-108. [PDF, 251KB]

(2012) Vicent Alabau, Luis A. Leiva, Daniel OrtizMartínez, & Francisco Casacuberta: User evaluation of interactive machine translation systems. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.20-23. [PDF, 411KB]

(2012) Eleftherios Avramidis, Marta R.Costa-jussà, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

(2012) Amittai Axelrod, QingJun Li, & William D.Lewis: Applications of data selection via cross-entropy difference for real-world statistical machine translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.201-208. [PDF, 610KB]

(2012) Alexandra Balahur & Marco Turchi: Multilingual sentiment analysis using machine translation. [ACL 2012] Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, Jeju, Republic of Korea, 12 July 2012; pp.52-60. [PDF, 133KB]

(2012) Michael Denkowski & Alon Lavie: Challenges in predicting machine translation utility for human post-editors. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 166KB]

(2012) Stephen Doherty & Sharon O’Brien: A user-based usability assessment of raw machine translated technical instructions. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 143KB]

(2012) Cecilia Elizalde, Bruno Pouliquen, Christophe Mazenc & José García-Verdugo: TAPTA4UN: collaboration on machine translation between the World Intellectual Property Organization and the United Nations. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 3pp. [PDF, 168KB]

(2012) Mariano Felice & Lucia Specia: Linguistic features for quality estimation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.96-103. [PDF, 257KB]

(2012) Lluís Formiga & José A.R.Fonollosa: Dealing with input noise in statistical machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.319-328. [PDF, 129KB]

(2012) Lluís Formiga, Adolfo Hernández, José B.Mariño, & Enric Monte: Improving English to Spanish out-of-domain translations by morphology generalization and generation. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.6-16. [PDF, 274KB]

(2012) Lluís Formiga, Carlos A.Henríquez Q., Adolfo Hernández, José B.Mariño, Enric Monte, & José A.R.Fonollosa: The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.275-282. [PDF, 500KB]

(2012) Jesús González Rubio, Alberto Sanchís, & Francisco Casacuberta: PRHLT submission to the WMT12 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.104-108. [PDF, 111KB]

(2012) Katrin Kirchhoff, Daniel Capurro, & Anne Turner: Evaluating user preferences in machine translation using conjoint analysis. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.119-126. [PDF, 223KB]

(2012) Philipp Koehn & Barry Haddow: Toward effective use of training data in statistical machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.317-321. [PDF, 89KB]

(2012) Alexander Molchanov: PROMT DeepHybrid system for WMT12 shared translation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.345-348. [PDF, 88KB]

(2012) Daniele Pighin, Lluís Màrquez, & Jonathan May: An analysis (and an annotated corpus) of user responses to machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1131-1135. [PDF, 355KB]

(2012) Daniele Pighin, Lluís Màrquez, & Lluís Formiga: The FAUST corpus of adequacy assessments for real-world machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.29-35. [PDF, 389KB]

(2012) Daniele Pighin, Lluís Formiga, & Lluís Màrquez: A graph-based strategy to streamline translation quality assessments. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 243KB]

(2012) Daniele Pighin, Meritxell González, & Lluís Màrquez: The UPC submission to the WMT 2012 shared task on quality estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.127-132. [PDF, 123KB]

(2012) Bruno Pouliquen, Christophe Mazenc, Cecilia Elizalde, & Jose Garcia-Verdugo: Statistical machine translation prototype using UN parallel documents. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.12-19. [PDF, 251KB]

(2012) Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann Roturier, Rasul Samad Zadeh Kaljahi, & Fred Hollowood: DCU-Symantec submission for the WMT 2012 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.138-144. [PDF, 125KB]

(2012) Víctor M.Sánchez-Cartagena, Miquel Esplà-Gomis, & Juan Antonio Pérez-Ortiz: Source-language dictionaries help non-expert users to enlarge target-language dictionaries for machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3422-3429. [PDF, 416KB]

(2012) Antonio Toral, Marc Poch, Pavel Pecina, & Gregor Thurmair: Efficiency-based evaluation of aligners for industrial applications. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.57-60. [PDF, 210KB]

(2011) Wilker Aziz, Miguel Rios, & Lucia Specia: Shallow semantic trees for SMT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.316-322. [PDF, 106KB]

(2011) Luciano Barbosa, Srinivas Bangalore, & Vivek Kumar Sridhar Rangarajan: Crawling back and forth: using back and out links to locate bilingual sites. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.429-437. [PDF, 192KB]

(2011) Christian Federmann, Sabine Hunsicker, Yu Chen, & Rui Wang: DFKI system combination using syntactic information at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.104-109. [PDF, 294KB]

(2011) Maxim Khalilov & Khalil Sima’an: Context-sensitive syntactic source-reordering by statistical translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.38-46. [PDF, 327KB]

(2011) Véronique López-Ludeña & Rubén San-Segundo: UPM system for the translation task. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.420-425. [PDF, 160KB]

(2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

(2011) Yashar Mehdad, Matteo Negri, & Marcello Federico: Using bilingual parallel corpora for cross-lingual textual entailment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1336-1345. [PDF, 236KB]

 (2011)  Daniel Ortiz Martínez: Advances in fully-automatic and interactive phrase-based statistical machine translation. Thesis presented at Universidad Politécnica de Valencia, Departamento de Sistemas Informáticos y Computación, September 2011; 259pp. [PDF, 2121KB]

(2011) Víctor M.Sánchez-Cartagena, Felipe Sánchez-Martínez, & Juan Antonio Pérez-Ortíz: Integrating shallow-transfer rules into phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.562-569. [PDF, 106KB]

(2011) Felipe Sánchez-Martínez: Choosing the best machine translation system to translate a sentence by using only source-language information. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.97-104. [PDF, 338KB] ; presentation, 32 slides [PDF]

 (2011) Luciana M.Scocco: Machine translation aid to the test: a study on the benefits of post-editing for translation into the weaker language. Dissertation submitted as part requirement for Master of Arts Translation and Interpreting Studies, University of Western Sydney, 4th November 2011; 14pp. [PDF, 170KB]

(2011) Mark Seligman & Mike Dillinger: Real-time multi-media translation for healthcare: a usability study. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.595-602. [PDF, 574KB]

  (2011) Marianne Starlander & Paula Estrella: Looking for the best evaluation method for interlingua-based spoken language translation in the medical domain. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.81-92. [PDF, 905KB]

(2011) Arda Tezcan & Vincent Vandeghinste: SMT-CAT integration in a technical domain: handling XML markup using pre & post-processing methods. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.55-62. [PDF, 343KB]

(2011) Antonio Toral & Andy Way: Automatic acquisition of named entities for rule-based machine translation.  Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.37-43. [PDF, 151KB]

 (2011) Petra Wolf, Ulrike Bernardi, Christian Federmann, & Sabine Hunsicker: From statistical term extraction to hybrid machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.225-232. [PDF, 329KB]; presentation, 24 slides [PDF]

 (2011) Francisco Zamora-Martínez & M.J.Castro-Bleda: CEU-UPV English-Spanish system for WMT11. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.490-495. [PDF, 364KB]

(2010) Giselle de Almeida & Sharon O’Brien: Analysing post-editing performance: correlations with years of translation experience. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 511KB]

 (2010) Ergun Biçici & S.Serdar Kozat: Adaptive model weighting and transductive regression for predicting best system combinations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 276-281. [PDF, 181KB]

 (2010) Ergun Biçici & Deniz Yuret: L1 regularized regression for reranking and system combination in machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 282-289. [PDF, 197KB]

(2010)  Marta R.Costa-jussà, Vidas Daudaravicius, & Rafael E.Banchs: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 975KB]

(2010) Christian Federmann, Andreas Eisele, Hans Uszkoreit, Yu Chen, Sabine Hunsicker, & Jia Xu: Further experiments with shallow hybrid MT systems.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 77-81. [PDF, 118KB]

(2010) Gregor Leusch & Hermann Ney: The RWTH system combination system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 315-320. [PDF, 152KB]

(2010) Matteo Negri & Yashar Mehdad: Creating bi-lingual entailment corpus through translations with Mechanical Turk: $100 for a 10-day rush. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.212-216. [PDF, 101KB]

(2010) Daniel Ortiz-Martínez, Ismael García-Varea, & Francisco Casacuberta: Online learning for interactive statistical machine translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.546-554. [PDF, 284KB]

 (2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Mirko Plitt & François Masselot: A productivity test of statistical machine translation post-editing in a typical localization context. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.7-16. [PDF, 387KB]

(2010) Luciana Cecilia Ramos: Post-editing free machine translation: from a language vendor’s perspective. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 5pp. [PDF, 37KB]

(2010) Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez, & Francisco Casacuberta: UPV-PRHLT English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 172-176. [PDF, 90KB]

(2010 Lane Schwartz: Reproducible results in parsing-based machine translation: the JHU shared task submission.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 177-182. [PDF, 101KB]

(2010) Radu Soricut & Abdessamad Echihabi: TrustRank: inducing trust in automatic translations via ranking. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.612-621. [PDF, 222KB]

(2010) Lucia Specia, Nicola Cancedda, & Marc Dymetman: A dataset for assessing machine translation evaluation metrics. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3375-3378. [PDF, 362KB]

(2010) Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, & Nanda Kambhatla: Syntax based reordering with automatically derived rules for improved statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1119-1127. [PDF, 169KB]

(2010) Francisco Zamora-Martínez & Germán Sanchis-Trilles: UCH-UPV English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 207-211. [PDF, 451KB]

English to Swahili

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

(2011) Guy De Pauw, Peter Waiganjo Wagacha, & Gilles-Maurice de Schryver: Towards English-Swahili machine translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23-27 January, 2011; 2pp. [PDF, 77KB]

English to Swedish

(2014) Lars Borin, Jens Allwood, & Gerard de Melo: Bring vs.MTRoget: evaluating automatic thesaurus translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2115-2121. [PDF, 252KB]

(2013) Sara Stymne, Christian Hardmeier, Jörg Tiedemann, & Joakim Nivre: Feature weight optimization for discourse-level MT.  Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.60-69. [PDF, 363KB]

(2012) Maria Holmqvist, Sara Stymne, Lars Ahrenberg, & Magnus Merkel: Alignment-based reordering for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3436-3440. [PDF, 300KB]

(2012) Danielle L.Mowery, Sumithra Velupillai, & Wendy W.Chapman: Medical diagnosis lost in translation – analysis of uncertainty and negation expressions in English and Swedish clinical texts. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing (BioNLP 2012), Montréal, Canada, June 8, 2012; pp.56-64. [PDF, 288KB]

(2012) Lene Offersgaard & Dorte Haltrup Hansen: SMT systems for less-resourced languages based on domain-specific data.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.75-80. [PDF, 380KB]

(2012) Sara Stymne, Henrik Danielsson, Sofia Bremin, Hongzhan Hu, Johanna Karlsson, Anna Prytz Lillkull, & Martin Wester: Eye tracking as a tool for machine translation error analysis.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1121-1126. [PDF, 357KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

(2010) Lars Ahrenberg: Alignment-based profiling of Europarl data in an English-Swedish parallel corpus. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3398-3404. [PDF, 349KB]

(2010) Maria Holmqvist: Heuristic word alignment with parallel phrases. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.744-748. [PDF, 307KB]

(2010) Sara Stymne & Lars Ahrenberg: Using a grammar checker for evaluation and postprocessing of statistical machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2175-2181. [PDF,

(2010) Jörg Tiedemann: Lingua-Align: an experimental toolkit for automatic tree-to-tree alignment.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.736-743. [PDF, 296KB]

(2010) Kirti Vashee & Rustin Gibbs: Scenarios for customizing an SMT engine based on availability of data. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 4pp. [PDF, 153KB]

English to Tamil

(2012) Loganathan Ramasamy, Ondřej Bojar, & Zdeněk Žabokrtský: Morphological processing for English-Tamil statistical machine translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.113-122. [PDF, 542KB]

(2011) Abhijit Bhole, Goutham Tholpadi, & Raghavendra Udupa: Mining multi-word named entity equivalents from comparable corpora. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.65-72. [PDF, 820KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Peter Nabende: Mining transliterations from Wikipedia using dynamic Bayesian networks. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.385-391. [PDF, 580KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

English to Thai

(2011) Canasai Kruengkrai, Thatsanee Charoenporn, & Virach Sornlertlamvanich: Simple discriminative training for machine transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.28-31. [PDF, 514KB]

(2010) Sanooch Nathalang: Don’t use big words with me: an evaluation of English-Thai statistical-based machine translation [abstract]. UCCTS 2010: Using Corpora in Contrastive and Translation Studies, Edge Hill University, UK, 27-29 July 2010; p.11. [PDF, 90KB]

(2010) Chai Wutiwiwatchai & Ausdang Thangthai: Syllable-based Thai-English machine transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.66-70. [PDF, 189KB]

English to Turkish

(2014) Burak Aydın & Arzucan Özgür: Expanding machine translation training data with an out-of-domain corpus using language modeling based vocabulary saturation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.180-192. [PDF, 523KB]

(2012) M.Talha Çakmak, Süleyman Acar, & Gülşen Eryiğit: Word alignment for English-Turkish language pair.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2177-2180. [PDF, 577KB]

 (2012) Sandipan Dandapat, Sara Morrissey, Abdy Way, & Joseph van Genabith: Combining EBMT, SMT, TM and IR technologies for quality and scale. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.48-58. [PDF, 248KB]

 (2012) Jie Jiang, Andy Way, & Rejwanul Haque: Translating user-generated content in the social networking space. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 9pp. [PDF, 130KB]

(2011) Sandipan Dandapat, Sara Morrissey, Andy Way, & Mikel L.Forcada: Using example-based MT to support statistical MT when translating homogeneous data in a resource-poor setting. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.201-208. [PDF, 413KB]; presentation, 24 slides [PDF]

(2011) Kemal Oflazer: Syntax-to-morphology mapping in factored SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; presentation: 60 slides [PDF of PPT, 491KB]

(2010) Reyyan Yeniterzi & Kemal Oflazer: Syntax-to-morphology mapping in factored phrase-based statistical machine translation from English to Turkish. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.454-464. [PDF, 1381KB]

English to Urdu

(2014) Nadir Durrani & Philipp Koehn: Improving machine translation via triangulation and transliteration. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.71-78. [PDF, 452KB]

(2012) Jacob Dlougach & Irina Galinskaya: Building a reordering system using tree-to-string hierachical model. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.27-36. [PDF, 1115KB]

(2012) Rohit Gupta, Raj Nath Patel, & Ritesh Shah: Learning improved reordering models for Urdu, Farsi and Italian using SMT. COLING: Proceedings of the Workshop on Reordering for Statistical Machine Translation, 9 December 2012; pp.37-45. [PDF, 520KB]

(2011) Bushra Jawaid & Daniel Zeman: Word-order issues in English-to-Urdu statistical machine translation. Prague Bulletin of Mathematical Linguistics, no.95, April 2011; pp.87-106. [PDF, 1006KB]

(2011) V.Shukla & R.M.K.Sinha: Divergence patterns for Urdu to English and English to Urdu translation. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.21-28. [PDF, 773KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

English to Vietnamese

(2015) Thanh-Le Ha, Jan Niehues, Eunah Cho, Mohammed Mediani & Alex Waibel: The KIT translation systems for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.62-69. [PDF, 3.0MB]

(2015) Minh-Thang Luong & Christopher Manning: Stanford neural machine translation systems for spoken language domains. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.76-79. [PDF, 1.2MB]

(2015) Viet Tran Hong, Huyen Vu Thong, Trung Le Tien, Luan Ngia Pham, & Vinh Nuyen Van: The English-Vietnamese machine translation system for IWSLT 2015. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.80-83. [PDF, 1.1MB]

(2015) Krzysztof Wolk & Krzysztof Marasek: PJAIT systems for the IWSLT 2015 evaluation campaign enhanced by comparable corpora. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.101-104. [PDF, 2.9MB]

(2013) Hien Vu Huy, Phuong-Thai Nguyen, Tung-Lam Nguyen, & M.L.Nguyen: Bootstrapping phrase-based statistical machine translation via WSD integration. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1042-1046. [PDF, 231KB]

(2012) Thi Thanh Thao Phan, & Izabella Thomas: E-V MT of proper names: error analysis and some proposed solutions. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #530, 1p. [HTML]

(2012) Quoc Hung-Ngo & Werner Winiwarter: A visualizing annotation tool for semi-automatically building a bilingual corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.67-74. [PDF, 820KB]

Estonian

(2010) Einar Meister, Jaak Vilo & Neeme Kahusk: National programme for Estonian language technology: a pre-final summary. Human Language Technologies—The Baltic Perspective, 4th International Conference ,Riga, Latvia, October 7-8, 2010; 24 slides [PDF of PPT, 543KB]

Estonian to English

(2010) Mark Fishel & Harri Kirik: Linguistically motivated unsupervised segmentation for machine translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1741-1745. [PDF, 308KB]

(2010) Maxim Khalilov, Lauma Pretkalniņa, Natalja Kuvaldina, & Veronika Pereseina: SMT of Latvian, Lithuanian and Estonian languages: a comparative study. Human Language Technologies – the Baltic Perpective,  International Conference, Riga, Latvia, October 8, 2010; 8pp. [PDF, 145KB]