Machine Translation Archive

Index to linguistic categories, features, grammars and processes

Publications since 2010

For other periods go to: publications 2005-2009; publications 2000-2004; publications 1990-1999; publications 1970-1989; publications before 1970

To return to home page click here

Adjectives

(2012) Gerhard Kremer, Matthias Hartung, Sebastian Padó & Stefan Riezler: Statistical machine translation support improves human adjective translation. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.103-126. [PDF, 266KB]

(2011) Alex Rudnick: Towards cross-language word sense disambiguation for Quechua. [RANLP 2011] Proceedings of the Student Research Workshop associated with RANLP 201, Hissar, Bulgaria, 13 September 2011; pp.133-138. [PDF, 125KB]

Affixes

(2013) Zhiyang Wang, Yajuan Lü, Meng Sun, & Qun Liu: Stem translation with affix-based rule selection for agglutinative languages. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.364-369. [PDF, 667KB]

Agglutinative languages

(2013) Zhiyang Wang, Yajuan Lü, Meng Sun, & Qun Liu: Stem translation with affix-based rule selection for agglutinative languages. rdCL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.364-369. [PDF, 667KB]

(2012) M.Talha Çakmak, Süleyman Acar, & Gülşen Eryiğit: Word alignment for English-Turkish language pair.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2177-2180. [PDF, 577KB]

(2012) Thoudam Doren Singh: Addressing some issues of data sparsity towards improving English-Manipuri SMT using morphological information. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.46-54. [PDF, 454KB]

(2012) Gábor Prószéky: Language technology methods inspired by an agglutinative, free-phrase-order language [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.26-27. [PDF]

(2012) Loganathan Ramasamy, Zdeněk Žabokrtský, & Sowmya Vajjala: The study of effect of length in morphological segmentation of agglutinative languages. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.18-24. [PDF, 151KB]

(2012) Annette Rios & Anne Göhring: A tree is a Baum is an árbol is a sach’a: creating a trilingual treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1874-1879. [PDF, 360KB]

(2011) Zhiyang Wang, Yajuan Lü, & Qun Liu: Multi-granularity word alignment and decoding for agglutinative language translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.360-367. [PDF, 169KB]

Analysis see Parsing; Semantic analysis; Syntactic analyis

Anaphora resolution

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Christian Hardmeier: Discourse in statistical machine translation [abstract]. PhD thesis, Uppsala University, 2014; 2pp. [PDF, 15KB]

(2013) Christian Hardmeier, Jörg Tiedemann & Joakim Nivre: Latent anaphora resolution for cross-lingual pronoun prediction. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.380-391. [PDF, 274KB]

 (2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

 (2013) Eric Wehrli & Luka Nerima: Anaphora resolution, collocations and translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.12-17. [PDF, 202KB]

 (2012) Chen Chen & Vincent Ng: Combining the best of two worlds: a hybrid approach to multilingual coreference resolution. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.56-63. [PDF, 378KB]

(2012) Baoli Li: Learning to model multilingual unrestricted coreference in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.129-135. [PDF, 130KB]

(2012) Xinxin Li, Xuan Wang, & Wingwei Liao: Simple maximum entropy models for multilingual coreference resolution. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.83-87. [PDF, 205KB]

(2012) Hirotoshi Taira, Katsuhito Sudoh, & Masaaki Nagata: Zero pronoun resolution can improve the quality of J-E translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.111-118. [PDF, 249KB]

(2012) Olga Uryupina, Alessandro Moschitti, & Massimo Poesio: BART goes multilingual: the UniTN/Essex submission to the CoNLL-2012 shared task. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.122-128. [PDF, 122KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2012) Hao Xiong & Qun Liu: ICT: system description for CoNLL-2012. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.71-75. [PDF, 557KB]

(2012) Desislava Zhekova, Sandra Kübler, Joshua Bonner, Marwa Ragheb, & Yu-Yin Hsu: UBIU for multilingual coreference resolution in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.88-94. [PDF, 113KB]

(2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2011) Sergey Kulikov: What is web-based machine translation up to? Tralogy, Paris, 3-4 March 2011; 11pp. [PDF, 131KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Ronan Le Nagard & Philipp Koehn: Aiding pronouns translation with co-reference resolution. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 252-261. [PDF, 146KB]

Annotation and features [see also Linguistic features]

(2014) Vicent Alabau & Luis A.Leiva: Collaborative web UI localization, or how to build feature-rich multilingual datasets. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp..151-154. [PDF, 352KB]

(2014) Eleftherios Avramidis, Aljoscha Burchardt, Sabine Hunsicker, Maja Popović, Cindy Tscherwinka, David Vilar, & Hans Uszkoreit: The TaraXŰ corpus of human-annotated  machine translations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 2679-2682. [PDF, 98KB]

(2014) Ann Bies, Justin Mott, Seth Kulick, Jennifer Garland, & Colin Warner: Incorporating alternate translations into English Translation Treebank.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1863-1868. [PDF, 339KB]

(2014) Eunah Cho, Sarah Fünfer, Sebastian Stüker, & Alex Waibel: A corpus of spontaneous speech in lectures: the KIT lecture corpus for spoken language processing and translation.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1554-1559. [PDF, 146KB]

(2014) Mikel L.Forcada: On the annotation of TMX translation memories for advanced leveraging in computer-aided translation.LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4374-4378. [PDF, 123KB]

(2014) Miriam Kaeshammer & Anika Westburg: On complex word alignment configurations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1773-1780. [PDF, 171KB]

(2014) Chi-kiu Lo & Dekai Wu: On the reliability and inter-annotator agreement of human semantic MT evaluation via HMEANT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.602-607. [PDF, 279KB]

(2014) Arle Lommel, Aljoscha Burchardt, Maja Popović, Kim Harris, Eleftherios Avramidis, & Hans Uszkoreit: Using a new analytic measure for the annotation and analysis of MT errors on real data. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.165-172. [PDF, 1781KB]

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2014) Marco Turchi & Bruno Kessler: Automatic annotation of machine translation datasets with binary quality  judgements.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1788-1792. [PDF, 99KB]

(2013) Heike Adel, Ngoc Thang Vu, & Tanja Schultz: Combination of recurrent neural networks and factored language models for code-switching language modeling. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.206-211. [PDF, 210KB]

(2013) Colin Cherry: Improved reordering for phrase-based translation using sparse features. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.22-31. [PDF, 187KB]

(2013) Trevor Cohn & Lucia Specia: Modelling annotator bias with multi-task Gaussian processes: an application to machine translation quality estimation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp. 32-42. [PDF, 265KB]

(2013) Pedro Luís Díez Orzas: ITS 2.0 automated translation of natural language content: multilingual web linguistic technology. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 9pp. [PDF, 417KB]; presentation, 42 slides [PDF of PPT, 4089KB]

(2013) Vladimir Eidelman, Ke Wu, Ferhan Ture, Philip Resnik & Jimmy Lin: Towards efficient large-scale feature-rich statistical machine translation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.128-133. [PDF, 342KB]

(2013) Spence Green, Sida Wang, Daniel Cer, & Christopher D.Manning: Fast and adaptive online training of feature-rich translation models. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.311-321. [PDF, 363KB]

(2013) Spence Green, Daniel Cer, Kevin Reschke, Rob Voigt, John Bauer, Sida Wang, Natalia Silveira, Julia Neidert & Christopher D. Manning: Feature-rich phrase-based translation: Stanford University’s submission to the WMT 2013 translation task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.148-153. [PDF, 272KB]

(2013) Zhongqiang Huang, Jacob Devlin & Rabih Zbib: Factored soft source syntactic constraints for hierarchical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.556-566. [PDF, 350KB]

(2013) Stéphane Huet, Elena Manishina & Fabrice Lefèvre: Factored Machine Translation Systems for Russian-English. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.154-157. [PDF, 98KB]

(2013) Mikhail Kozhevnikov & Ivan Titov: Cross-lingual transfer of semantic role labeling models. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1190-1200. [PDF, 188KB]

(2013) Uri Lerner & Slav Petrov: Source-side classifier preordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.513-523. [PDF, 132KB]

(2013) Lemao Liu, Tiejun Zhao, Taro Watanabe, & Eiichiro Sumita: Tuning SMT with a large number of features via online feature grouping. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.279-285. [PDF, 153KB]; revised, 7pp. [PDF]

(2013) Thomas Meyer & Lucie Poláková: Machine translation with many manually labeled discourse connectives. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.43-50. [PDF, 307KB]

(2013) Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová, & Eva Hajičová: (Pre-)annotation of topic-focus articulation in Prague Czech-English dependency treebank. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.55-63. [PDF, 990KB]

(2013) Maria Nadejde, Philip Williams, & Philipp Koehn: Edinburgh’s syntax-based machine translation systems. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.170-176. [PDF, 161KB]

(2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

(2013) Kashif Shah, Trevor Cohn, & Lucia Specia: An investigation on the effectiveness of features for translation quality estimation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.167-174. [PDF, 523KB]

(2013) Sara Stymne, Christian Hardmeier, Jörg Tiedemann, & Joakim Nivre: Feature weight optimization for discourse-level MT.  Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.60-69. [PDF, 363KB]

(2013) Marco Turchi, Matteo Negri & Marcello Federico: Coping with the subjectivity of human judgements in MT quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.240-251. [PDF, 472KB]

(2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2013) Darnes Vilariño, David Pinto, Saúl León, Yuridiana Alemán, & Helena Gómez-Adorno: BUAP: N-gram based feature evaluation for the cross-lingual textual entailment task. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.124-127. [PDF, 83KB]

(2013) Jennifer Williams, Rafael Banchs, & Haizhou Li: Meaning unit segmentation in English and Chinese: a new approach to discourse phenomena.  Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.1-9. [PDF, 885KB]

(2013) Mo Yu, Tiejun Zhao, Yalong Bai, Hao Tian, & Dianhai Yu: Cross-lingual projections between languages from differenr families. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.312-317. [PDF, 187KB]

(2012) Eleftherios Avramidis: Quality estimation for machine translation output using linguistic analysis and decoding features. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.84-90. [PDF, 111KB]

(2012) Eleftherios Avramidis, Marta R.Costa-jussà, Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: A richly annotated, multilingual parallel corpus for hybrid machine translation. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2189-2193. [PDF, 433KB]

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Ondřej Bojar, Bushra Jawaid, & Amir Kamran: Probes in a taxonomy of factored phrase-based models. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.253-260. [PDF, 134KB]

(2012) Cristina Bosco, Manuela Sanguinetti, & Leonardo Lesmo: The Parallel-TUT: a multilingual and multiformat treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3982-3987. [PDF, 460KB]

(2012) Christian Buck: Black box features for the WMT 2012 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.91-95. [PDF, 115KB]

(2012) Colin Cherry & Barry Haddow: Sparse features in Moses. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 10 slides [PDF of PPT, 55KB]

(2012) Jonathan H.Clark, Alon Lavie, & Chris Dyer: One system, many domains: open-domain statistical machine translation via feature augmentation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 396KB]

(2012) Greg Durrett, Adam Pauls, & Dan Klein: Syntactic transfer using a bilingual lexicon. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1-11. [PDF, 286KB]

(2012) Ondřej Dušek, Zdeněk Žabokrtský, Martin Popel, Martin Majliš, Michal Novák, & David Mareček: Formemes in English-Czech deep syntactic MT. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.267-274. [PDF, 316KB]

(2012) Vladimir Eidelman, Jordan Boyd-Graber, & Philip Resnick: Topic models for dynamic translation model adaptation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.115-119. [PDF, 140KB]

(2012) Christian Federmann: Hybrid machine translation using joint, binarised feature vectors. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 198KB]

(2012) Christian Federmann: System combination using joint, binarised feature vectors. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.77-83. [PDF, 116KB]

(2012) Christian Federmann, Eleftherios Avramidis, Marta R.Costa-jussà, Maite Melero, Pavel Pecina, & Josef van Genabith: The ML4HMT workshop on optimising the division of labour in hybrid machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3430-3435. [PDF, 631KB]

(2012) Christian Federmann, Maite Melero, Pavel Pecina, & Josef van Genabith: Towards optimal choice selection for improved hybrid machine translation. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.5-22. [PDF, 268KB]

(2012) Mariano Felice & Lucia Specia: Linguistic features for quality estimation.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.96-103. [PDF, 257KB]

(2012) Mark Fishel, Ondřej Bojar, & Maja Popović: Terra: a collection of translation error-annotated corpora.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.7-14. [PDF, 367KB]

(2012) Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, & Chris Callison-Burch: Joshua 4.0: packing, PRO, and paraphrases. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.283-291. [PDF, 195KB]

(2012) Muhammad Usman Ghani Khan, Rao Muhammad Adeel Nawab, & Yoshihiko Gotoh: Natural language descriptions of visual scenes: corpus generation and analysis. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 38-47. [PDF, 641KB]

(2012) Jesús González Rubio, Alberto Sanchís, & Francisco Casacuberta: PRHLT submission to the WMT12 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.104-108. [PDF, 111KB]

(2012) Liane Guillou: Improving pronoun translation for statistical machine translation. [EACL 2012] Proceedings of the EACL Student Research Workshop, Avignon, France, 26 April 2012; pp. 1-10. [PDF, 373KB]

(2012) Eva Hajičová & Petr Sgall: Formal models and practice of annotation [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.17-18. [PDF]

(2012) Eva Hasler, Barry Haddow, & Philipp Koehn: Sparse lexicalised features and topic adaptation for SMT. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.268-275. [PDF, 630KB]

(2012) Matthias Huck, Stephan Peitz, Markus Freitag, & Hermann Ney: Discriminative reordering extensions for hierarchical phrase-based machine translation. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.313-320. [PDF, 211KB]

(2012) Quoc Hung-Ngo & Werner Winiwarter: A visualizing annotation tool for semi-automatically building a bilingual corpus.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.67-74. [PDF, 820KB]

 (2012) Svetla Koeva: ATLAS – human language technologies integrated within a multilingual web content management system. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.72-76. [PDF, 331KB]

(2012) David Langlois, Sylvain Raybaud, & Kamel Smaïli: LORIA system for the WMT12 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.114-119. [PDF, 96KB]

(2012) Yashar Mehdad, Matteo Negri, & Marcello Federico: Match without a referee: evaluating MT adequacy without reference translations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.171-180. [PDF, 126KB]

(2012) Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui, & Andrea Gesmundo: Machine translation of labeled discourse connectives. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 196KB]

(2012) Dávid Márk Nemeskey & Eszter Simon: Automatically generated NE tagged corpora for Englsih and Hungarian. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.38-46. [PDF, 110KB]

(2012) Danielle L.Mowery, Sumithra Velupillai, & Wendy W.Chapman: Medical diagnosis lost in translation – analysis of uncertainty and negation expressions in English and Swedish clinical texts. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing (BioNLP 2012), Montréal, Canada, June 8, 2012; pp.56-64. [PDF, 288KB]

 (2012) Tsuyoshi Okita: Annotated corpora for word alignment between Japanese and English and its evaluation with MAP-based word aligner.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3241-3248. [PDF, 522KB]

(2012) Daniele Pighin, Lluís Formiga, & Lluís Màrquez: A graph-based strategy to streamline translation quality assessments. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 243KB]

(2012) Daniele Pighin, Lluís Màrquez, & Jonathan May: An analysis (and an annotated corpus) of user responses to machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1131-1135. [PDF, 355KB]

(2012) Daniele Pighin, Lluís Màrquez, & Lluís Formiga: The FAUST corpus of adequacy assessments for real-world machine translation output.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.29-35. [PDF, 389KB]

(2012) Matt Post & Juri Ganitkevich: Sparse features in Joshua. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 10 slides [PDF of PPT, 1002KB]

(2012) Daniel Quernheim & Kevin Knight: Towards probabilistic acceptors and transducers for feature structures. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.76-85. [PDF, 180KB]

(2012) Ananthakrishnan Ramanathan & Karthik Visweswariah: A study of word-classing for MT reordering.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3971-3976. [PDF, 409KB]

(2012) Annette Rios & Anne Göhring: A tree is a Baum is an árbol is a sach’a: creating a trilingual treebank.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1874-1879. [PDF, 360KB]

(2012) Raphael Rubino, Jennifer Foster, Joachim Wagner, Johann Roturier, Rasul Samad Zadeh Kaljahi, & Fred Hollowood: DCU-Symantec submission for the WMT 2012 quality estimation task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.138-144. [PDF, 125KB]

(2012) Avneesh Saluja, Ian Lane, & Ying Zhang: Machine translation with binary feedback: a large-margin approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 322KB]

(2012) Patrick Simianer, Stefan Riezler, & Chris Dyer: Joint feature selection in distributed stochastic learning for large-scale discriminative training in SMT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.11-21. [PDF, 281KB]

(2012) Radu Soricut, Nguyen Bach, & Ziyuan Wang: The SDL Language Weaver systems in the WMT12 quality estimation shared task. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.145-151. [PDF, 106KB]

(2012) Radu Soricut & Sushant Narsale: Combining quality prediction and system selection for improved automatic translation output. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.163-170. [PDF, 135KB]

(2012) Antonio Toral, Sudip Kumar Naskar, Federico Gaspari, & Declan Groves: DELiC4MT: a tool for diagnostic MT evaluation over user-defined linguistic phenomena. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.121-131. [PDF, 156KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2012) Rui Wang, Petya Osenova, & Kiril Simov: Linguistically-augmented Bulgarian-to-English statistical machine translation model. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.119-128. [PDF, 163KB]

(2012) Youzheng Wu, Xugang Lu, Hitoshi Yamamoto, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored language model based on recurrent neural network. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2835-2850. [PDF, 3816KB]

(2012) PANACEA (Platform for Automatic, Normalised Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies). [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.90. [PDF, 87KB]

(2012) TOSCA-MP: task-oriented search and content annotation for media production.  [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.208. [PDF, 78KB]

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

(2011) Vamshi Ambati, Sanjika Hewavitharana, Stephan Vogel, & Jaime Carbonell: Active learning with multiple annotations for comparable data classification task. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.69-77. [PDF, 201KB]

(2011) Marilisa Amoia, Kerstin Kunz, & Ekaterina Lapshinova-Koltunski: Discontinuous constituents: a problematic case for parallel corpora annotation and querying. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.2-10. [PDF, 184KB]

(2011) Eleftherios Avramidis: DFKI system combination with sentence ranking at ML4HMT-2011. [LIHMT] ML4HMT 2011: Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation, 19th November, Barcelona Media, Barcelona; pp.99-103. [PDF, 260KB]

(2011) Wilker Aziz, Miguel Rios, & Lucia Specia: Improving chunk-based semantic role labelling with lexical features.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.226-232. [PDF, 99KB]

(2011) Alexandru Ceausu: Rich morpho-syntactic descriptors for factored machine translation with highly inflected languages as target. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 53KB]; presentation: 32 slides [PDF of PPT, 349KB]

(2011) Boxing Chen, Roland Kuhn, George Foster, & Howard Johnson: Unpacking and transforming feature functions: new ways to smooth phrase tables.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.269-275. [PDF, 158KB]

(2011) Christian Chiarcos & Tomaž Erjavec: OWL/DL formalization of the MULTEXT-East morphosyntactic specifications. Proceedings of the Fifth LAW Workshop (LAW V), Portland, Oregon, USA, 23-24 June 2011; pp.11-20. [PDF, 118KB]

(2011) Tagyoung Chung, Licheng Fang, & Daniel Gildea: SCFG latent annotation for machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.151-158. [PDF, 122KB]

(2011) Oliver Čulo, Silvia Hansen-Schirra, Karin Maksymski, & Stella Neumann: Empty links and crossing lines: querying multi-layer annotation and alignment in parallel corpora. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.75-104. [PDF, 923KB]

(2011) Qin Gao & Stephan Vogel: Corpus expansion for statistical machine translation with semantic role label substitution rules. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.294-298. [PDF, 151KB]

(2011) Yifan He, Yanjun Ma, Andy Way, & Josef van Genabith: Rich linguistic features for translation memory-inspired consistent translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.456-463. [PDF, 159KB]

(2011) Philipp Koehn: Factored models for morphology [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; presentation: 49 slides [PDF of PPT, 425KB]

(2011) Chi-kiu Lo & Dekai Wu: A radically simple, effective annotation and alignment methodology for semantic frame based SMT and MT evaluation. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.58-65. [PDF, 898KB]

(2011) Erwin Marsi, André Lynum, Lars Bungum, & Björn Gambäck: Word translation disambiguation without parallel texts. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.66-74. [PDF, 898KB]

(2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

(2011) Kemal Oflazer: Syntax-to-morphology mapping in factored SMT [abstract]. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; presentation: 60 slides [PDF of PPT, 491KB]

(2011) Tsuyoshi Okita & Andy Way: MWE-sensitive word alignment in factored translation model. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 42KB]; presentation: 51 slides [PDF of PPT, 650KB]

(2011) Eva Safar: Machine translation with corpus linguistics and HPSG. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 12 slides [PDF of PPT, 288KB]

(2011) Mara Tsoumari & Georgios Petasis: A new annotation tool for aligned bilingual corpora. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.43-52. [PDF, 184KB]

(2011) Lonneke van der Plas, Paola Merlo, & James Henderson: Scaling up automatic cross-lingual semantic role annotation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.299-304. [PDF, 161KB]

 (2011) Viet Ha-Thuc & Nicola Cancedda: Confidence-weighted learning of factored discriminitave language models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.439-444. [PDF,

(2011) Keiji Yasuda, Hideo Okuma, Masao Utiyama, & Eiichiro Sumita: Annotating data selection for improving machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.269-274. [PDF, 352KB]

(2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) Vamshi Ambati, Stephen Vogel, & Jaime Carbonell: Active learning and crowd-sourcing for machine translation. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2169-2174. [PDF, 436KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, & Christine Piatko: A modality lexicon and its use in automatic tagging. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1402-1407. [PDF, 375KB]

(2010) Yassine Benajiba & Mona Diab: A web application for dialectal Arabic text annotation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 91-98. [PDF, 747KB]

(2010) Michael Bloodgood & Chris Callison-Burch: Bucking the trend: large-scale cost-focused active learning for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.854-864. [PDF, 489KB]

(2010) Simon Carter & Christof Monz: Discriminative syntactic reranking for statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 143KB]

(2010) Mona Diab, Nizar Habash, Owen Rambow, Mohamed Altantawy, & Yassine Benajiba: COLABA: Arabic dialect annotation and processing. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 66-74. [PDF, 747KB]

(2010) Kais Dukes & Nizar Habash: Morphological annotation of Quranic Arabic. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.2530-2536. [PDF, 995KB]

(2010) Achille Falaise, David Rouquet, Didier Schwab, Hervé Blanchon, & Christian Boitet: Ontology driven content extraction using interlingual annotation of texts in the OMNIA project. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.52-60. [PDF, 359KB]

(2010) Christian Federmann: Appraise: an open-source toolkit for manual phrase-based evaluation of translations. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1731-1734. [PDF, 368KB]

(2010) Yvette Graham & Josef van Genabith: Factor templates for factored machine translation models. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.275-282. [PDF, 366KB]

(2010) Spence Green & Christopher D.Manning: Better Arabic parsing: baselines, evaluations, and analysis. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.394-402. [PDF, 148KB]

(2010) Fei Huang & Bing Xiang: Feature-rich discriminative phrase rescoring for SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.492-500. [PDF, 260KB]

(2010) Mitesh M.Khapra, Saurabh Sohoney, Anup Kulkarni, & Pushpak Bhattacharyya: Value for money: balancing annotation effort, lexicon building and accuracy for multilingual WSD. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.555-563. [PDF, 237KB]

(2010) Philipp Koehn, Barry Haddow, Philip Williams, & Hieu Hoang: More linguistic annotation for statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 115-120. [PDF, 101KB]

(2010) Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M.Strassel, & Kazuaki Maeda: Enriching word alignment with linguistic tags. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2189-2195. [PDF, 509KB]

(2010) Chi-kiu Lo & Dekai Wu: Semantic vs. syntactic vs. n-gram structure for machine translation evaluation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.52-60. [PDF, 401KB]

(2010) Lieve Macken: An annotation scheme and gold standard for Dutch-English word alignment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3369-3374. [PDF, 379KB]

(2010) Simon Mille & Leo Wanner: Syntactic dependencies for multilingual and multilevel corpus annotation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1889-1896. [PDF, 395KB]

(2010) Smruthi Mukund, Debanjan Ghosh, & Rohini K.Srihari: Using cross-lingual projections to generate semantic role labeled corpus for Urdu – a resource poor language. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.797-805. [PDF, 300KB]

(2010) Mohamed Outahajala, Lahbib Zekouar, Paolo Rosso, & M.Antònia Martí: Tagging Amazigh with AnCoraPipe. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 52-56. [PDF, 326KB]

(2010) Sergio Penkale, Rejwanul Haque, Sandipan Dandapat, Pratyush Banerjee, Ankit K.Srivastava, Jinhua Du, Pavel Pecina, Sudip Kumar Naskar, Mikel L.Forcada, & Andy Way: MaTrEx: the DCU MT system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 143-148. [PDF, 535KB]

(2010) Jan Niehues & Alex Waibel: Domain adaptation in statistical machine translation using factored translation models. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 581KB]; presentation: 27 slides [PDF, 887KB]

(2010) Narges Sharif Razavian & Stephan Vogel: Fixed length word suffix for factored statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.147-150. [PDF, 268KB]

 (2010) Libin Shen, Bing Zhang, Spyros Matsoukas, Jinxi Xu, & Ralph Weischedel: Statistical machine translation with a factorized grammar.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.616-625. [PDF, 147KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Lonneke van der Plas, Tanja Samardžić, & Paola Merlo: Cross-lingual validity of PropBank in the manual annotation of French. [ACL-2010] Proceedings of the Fourth Linguistic Annotation Workshop, Uppsala, Sweden, 15-16 July 2010; pp.113-117. [PDF, 102KB]

(2010) Deyi Xiong, Min Zhang, Aiti Aw, & Haizhou Li: Linguistically annotated reordering: evaluation and analysis. Computational Linguistics 36 (3), pp. 535-568 [PDF, 745KB]

(2010) Omar F.Zaidan & Juri Ganitkevitch: An enriched MT grammar for under $100. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.93-98. [PDF, 186KB]

Antonyms

(2011) Yuval Marton, Ahmed El Kholy, & Nizar Habash: Filtering antonymous, trend-contrasting, and polarity-dissimilar distributional paraphrases for improving statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.237-249. [PDF, 177KB]

Articles

(2013) Yulia Tsvetkov, Chris Dyer, Lori Levin & Archna Bhatia: Generating English determiners in phrase-based translation with synthetic translation options.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.271-280. [PDF, 291KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

Aspect

(2011) Maria Stambolieva: Parallel corpora in aspectual studies of non-aspect languages. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.31-42. [PDF, 233KB]

Belief representation

(2012) Jungi Kim, Jinseok Nam, & Iryna Gurevych: Learning semantics with deep belief network for cross-language information retrieval. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.579-588. [PDF, 517KB]

(2012) Christophe  Servan & Simon Petitrenaud: Calculation of phrase probabilities for statistical machine translation by using belief functions. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1101-1110. [PDF, 120KB]

Bracketing transduction grammar see Transduction grammars

Capitalization see Written forms

Case grammar and case frames

(2014) Lars Hellan, Dorothee Beermann, Tore Bruland, Mary Esther Kropp Dakubu, Montserrat Marimon: MultiVal – towards a multilingual valence lexicon. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2478-2485. [PDF, 247KB]

(2013) Marion Weller, Alexander Fraser, & Sabine Schulte im Walde: Using subcategorization knowledge to improve case prediction for translation into German.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.593-603. [PDF, 252KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Alex Rudnick: A resource-light approach to learning verb valencies. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 54KB]; presentation: 31 slides [PDF of PPT, 697KB]

(2010) Maria Rosenberg: Lexical representation of agentive nominal compounds in French and Swedish [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

Categorial grammar

(2013) Hala Almaghout & Lucia Specia: A CCG-based quality estimation metric for statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.223-230. [PDF, 426KB]

(2012) Hala Almaghout, Jie Jiang, & Andy Way: Extending CCG-based syntactic constraints in hierarchical phrase-based SMT. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.193-200. [PDF, 228KB]

(2012) Dennis N.Mehay & Chris Brew: CCG syntactic reordering models for phrase-based machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.210-221. [PDF, 184KB]

(2012) Jonathan Weese, Chris Callison-Burch, & Adam Lopez: Using categorial grammar to label translation rules. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.222-231. [PDF, 165KB]

(2012) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: A comparative study of target dependency structures for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.100-104. [PDF, 750KB]

(2012) Yue Zhang, Graeme Blackwood, & Stephen Clark: Syntax-based word ordering incorporating a large-scale language model. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 736-746. [PDF, 155KB]

(2011) Hala Almaghout, Jie Jiang, & Andy Way: CCG contextual labels in hierarchical phrase-based SMT. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.281-288. [PDF, 317KB]; presentation, 25 slides [PDF, 964KB]

 (2011) Pratyush Banerjee, Hala Almaghout, Sudip Naskar, Johann Roturier, Jie Jiang, Andy Way, & Josef van Genabith: The DCU machine translation systems for IWSLT 2011. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.41-48. [PDF, 274KB]

 (2011) Jin-Woo Chung & Jong C.Park: Text parsing for sign language generation with combinatory categorial grammar. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 11pp. [PDF, 420KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: CCG augmented hierarchical phrase-based machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.211-218. [PDF, 653KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Richard Moot: Semi-automated extraction of a wide-coverage type-logical grammar for French. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 60KB]

(2010) Richard Moot: Wide-coverage French syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

Closely related languages

(2014) Yves Scherrer & Benoît Sagot: A language-independent and fully unsupervised approach t lexicon induction and part-of-speech tagging for closely related languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.502-508. [PDF, 184KB]

(2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2012) Vladislav Kuboň: Machine translation among related Slavic languages [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.13-14. [PDF]

(2012) Preslav Nakov & Jörg Tiedemann: Combining word-level and character-level models for machine translation between closely-related languages. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.301-305. [PDF, 122KB]

(2012) Preslav Nakov & Hwee Tou Ng: Improving statistical machine translation for a resource-poor language using related resource-rich languages. Journal of Artificial Intelligence Research 44 (2012); pp.179-222. [PDF, 421KB]

(2012) Hrvoje Peradin & Francis Tyers: A rule-based machine translation system from Serbo-Croatian to Macedonian. In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina España-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.55-63. [PDF, 431KB]

(2012) Daniel Zeman: Data issues of the multilingual translation matrix. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.395-400. [PDF, 225KB]

(2011) Darja Fišer & Nikola Ljubešić: Bilingual lexicon extraction from comparable corpora for closely related languages.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.125-131. [PDF, 95KB]

(2011) Vishal Goyal & Gurpreet Singh Lehal: Hindi to Punjabi machine translation system. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Portland, Oregon, June 19-24, 2011; pp.1-6. [PDF, 247KB]

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

(2011) Preslav Nakov: Reusing parallel corpora between related languages [abstract] AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; 1p. [PDF, 184KB]

(2011) Pim Otte & Francis M.Tyers: Rapid rule-based machine translation between Dutch and Afrikaans. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.153-160. [PDF, 317KB]

(2011) Tihomir Rangelov: Rule-based machine translation between Bulgarian and Macedonian. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.53-59. [PDF, 164KB]

(2010) Mikel L.Forcada, Boyan Ivanov Bonev, Sergio Ortiz Rojas, Juan Antonio Pérez Ortiz, Gema Ramírez Sánchez, Felipe Sánchez Martínez, Carme Armentano-Oller, Marco A.Montava, & Francis M.Tyers: Documentation of the open-source shallow-transfer machine translation platform Apertium; ed. Mireia Ginestí Rosell. Departament de Llenguatges i Sistemes Informàtics, Universitat d’Alacant, March 10, 2010; 214pp. [PDF, 700KB]

(2010) Petr Homola & Vladislav Kuboň: Exploiting charts in the MT between related languages [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

(2010) Jernej Vičič & Petr Homola: Speeding up the implementation process of a shallow transfer machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 624KB]

Code switching see Dialects and registers

Cognates

(2013) Lisa Beinborn, Torsten Zesch & Iryna Gurevych: Cognate production using character-based machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.883-891. [PDF, 228KB]

(2013) Nikola Ljubešić & Darja Fišer: Identifying false friends between closely related languages. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.69-77. [PDF, 342KB]

(2013) Yves Scherrer & Tomaž Erjavec: Modernizing historical Slovene words with character-based SMT. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.58-62. [PDF, 110KB]

(2012) Andrew Finch, Ohnmar Htun, & Eiichiro Sumita: The NICT translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.121-125. [PDF, 621KB]

(2012) João Silva, Luísa Coheur, Ângela Costa, & Isabel Trancoso: Dealing with unknown words in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3977-3981. [PDF, 406KB]

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2011) Mirabela Navlea & Amalia Todiraşcu: Cognate identification for a French-Romanian lexical alignment system: empirical study. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.145-152. [PDF, 271KB]

Coherence and Cohesion see Reference

Collocations see Multiword expressions

Combinatory categorial grammar  see Categorial grammar

Compositionality

(2013) Itsuki Toyota, Zi Long, Lijuan Dong, Takehito Utsuro, & Mikio Yamamoto: Compositional translation of technical terms by integrating patent families as a parallel corpus and a comparable corpus. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.16-23. [PDF, 2058KB]

(2013) Hui Zhang, Kristina Toutanova, Chris Quirk, & Jianfeng Gao: Beyond left-to-right: multiple decomposition structures for SMT. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.12-21. [PDF, 161KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Extraction of domain-specific bilingual lexicon from comparable corpora: compositional translation and ranking. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.745-761. [PDF, 319KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Identification of fertile translations in medical comparable corpora: a morpho-compositional approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 154KB]

Compound nouns see Nouns and noun phrases

Comprehension see Reading and comprehension

Concatenation grammar see Transduction grammars

Conjunctions and connectives

(2013) Najeh Hajlaoui: Are ACT’s scores increasing with better translation quality? WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.408-413. [PDF, 255KB]

(2013) Camillo Lugaresi & Barbara Di Eugenio: Translating Italian connectives into Italian sign language. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.270-280. [PDF, 192KB]

(2013) Thomas Meyer & Bonnie Webber: Implicitation of discourse connectives in (machine) translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.19-26. [PDF, 396KB]

(2013) Thomas Meyer & Lucie Poláková: Machine translation with many manually labeled discourse connectives. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.43-50. [PDF, 307KB]

(2012) Najeh Hajlaoui & Andrei Popescu-Belis: Translating English discourse connectives into Arabic: a corpus-based analysis and an evaluation metric. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.1-8. [PDF, 731KB]

(2012) Iørn Korzen & Morten Gylling: Text structure in a contrastive and translational perspective: on information density and clause linkage in Italian and Danish. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.23-46. [PDF, 601KB]

(2012) Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui, & Andrea Gesmundo: Machine translation of labeled discourse connectives. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 196KB]

(2012) Thomas Meyer & Andrei Popescu-Belis: Using sense-labeled discourse connectives for statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.129-138. [PDF, 275KB]

(2011) Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, & Andrei Popescu-Belis: How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.78-86. [PDF, 153KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

(2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS projectTralogy, Paris, 3-4 March 2011;  17pp. [PDF, 169KB]

Constraint synchronous grammar see Context-free grammar

Context-free grammars

(2013) Miriam Kaeshammer: Synchronous linear context-free rewriting systems for machine translation. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 68-77. [PDF, 213KB]

(2012) Victor Chahuneau, Noah A.Smith, & Chris Dyer: pycdec: a Python interface to cdec. Prague Bulletin of Mathematical Linguistics 98, October 2012; pp.51-61. [PDF, 280KB]

(2012) Yang Feng, Dongdong Zhang, Mu Li, Ming Zhou, & Qun Liu: Hierarchical chunk-to-string translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.950-958. [PDF, 152KB]

(2012) Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, & Chris Callison-Burch: Joshua 4.0: packing, PRO, and paraphrases. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.283-291. [PDF, 195KB]

(2012) Baskaran Sankaran, Majid Razmara, & Anoop Sarkar: Kriya – an end-to-end hierarchical phrase-based MT system. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.83-98. [PDF,

(2012) Patrick Simianer, Stefan Riezler, & Chris Dyer: Joint feature selection in distributed stochastic learning for large-scale discriminative training in SMT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.11-21. [PDF, 281KB]

(2012) Philip Williams & Philipp Koehn: GHKM rule extraction and scope-3 parsing in Moses. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.388-394. [PDF, 122KB]

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

(2011) Tagyoung Chung, Licheng Fang, & Daniel Gildea: SCFG latent annotation for machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.151-158. [PDF, 122KB]

(2011) Qin Gao & Stephan Vogel: Utilizing target-side semantic role labels to assist hierarchical phrase-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.107-115. [PDF, 352KB]

 (2011) Greg Hanneman & Alon Lavie: CMU syntax-based machine translation at WMT 2011. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.365-371. [PDF, 118KB]

 (2011) Greg Hanneman, Michelle Burroughs, & Alon Lavie: A general-purpose rule extractor for SCFG-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.135-144. [PDF, 267KB]

 (2011) Greg Hanneman & Alon Lavie: Automatic category label coarsening for syntax-based machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.98-106. [PDF, 264KB]

(2011) Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, & Michael Riley: Hierarchical phrase-based translation representations. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1373-1383. [PDF, 334KB]

(2011) Markos Mylonakis & Khalil Sima’an: Learning hierarchical translation structure with linguistic annotations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.642-652. [PDF, 165KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

 (2011) Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, & Adam Lopez: Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.478-484. [PDF, 164KB]

(2011) Fai Wong, Francisco Oliveira, Sam Chao, & Chi-Wai Tang: Handheld machine translation system based on constraint synchronous grammar. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.439-446. [PDF, 158KB]

 (2011) Andreas Zollmann & Stephan Vogel: A word-class approach to labeling PSCFG rules for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1-11. [PDF, 230KB]

(2010) Chris Dyer & Philip Resnik: Context-free reordering, finite-state translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.858-866. [PDF, 272KB]

(2010) Chris Dyer, Adam Lopez, Juri Ganitkevitch, Jonathan Weese, Ferhan Ture, Phil Blunsom, Hendra Setiawan, Vladimir Eidelman, & Philip Resnik: cdec: a decoder, alignment, and learning framework for finite-state and context-free translation models. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.7-12. [PDF, 171KB]

(2010) Marc Dymetman & Nicola Cancedda: Intersecting hierarchical and phrase-based models of translation: formal aspects and algorithms. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.1-9. [PDF, 354KB]

 (2010) Zhongqiang Huang, Martin Čmejrek & Bowen Zhou: Soft syntactic constraints for hierarchical phrase-based translation using latent syntactic distributions.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.138-147. [PDF, 506KB]

(2010) Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, & Jason Eisner: Unsupervised discriminative language model training for machine translation using simulated confusion sets. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.656-664. [PDF, 348KB]

(2010) Yang Liu & Qun Liu: Joint parsing and translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.707-715. [PDF, 211KB]

(2010) Adam Lopez: Context-free translation models. Presentation at Fifth Machine Translation Marathon, 13-18 September 2010, University of Le Mans; 160 slides. [PDF, 3817KB]

(2010) Markos Mylonakis & Khalil Sima’an: Learning probabilistic synchronous CFGs for phrase-based translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.117-125. [PDF, 151KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

(2010) Francisco Oliveira, Fai Wong, Iok-Sai Hong & Ming-Chui Dong: Parsing extended constraint synchronous grammar in Chinese-Portuguese machine translation. International Conference on Computational Processing of Portuguese Language (PROPOR 2010), Porto Alegre, Brazil, 2010; 6pp. [PDF, 91KB]

 (2010) Wong Fai & Chao Sam: PCT: Portuguese-Chinese machine translation systems [abstract].  Journal of Translation Studies 13 (1-2), Special issue: The teaching of computer-aided translation, ed. Chan Sin-wai; pp. 181-196.

(2010) Andreas Zollmann & Stephan Vogel: New parameterizations and features for PSCFG-based machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.110-117. [PDF, 181KB]

Contrastive linguistics

(2012) Erich Steiner: Methodological cross-fertilization: empirical methodologies in (computational) linguistics and translation studies. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.3-21. [PDF, 1090KB]

(2011) Oliver Čulo, Silvia Hansen-Schirra, Karin Maksymski, & Stella Neumann: Empty links and crossing lines: querying multi-layer annotation and alignment in parallel corpora. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.75-104. [PDF, 923KB]

Controlled languages

(2015) Rei Miyata, Anthony Hartley, Cécile Paris, Midori Tatsumi, & Kyo Kageura: Japanese controlled language rules to improve machine translatability of municipal documents. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.90-103. [PDF, 466KB]

(2014) Marion Wittkowsky: Integrating machine translation (MT) in the higher education of translators and technical writers. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.100-108. [PDF, 158KB]

(2013) Maarit Koponen: This translation is not too bad: an analysis of post-editor choices in a machine-translation post-editing task. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 1-9. [PDF, 148KB]

(2013) Aarne Ranta: MOLTO – Multilingual On Line Translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.429. [PDF,  182KB]

(2013) Melanie Siegel: Authoring support for controlled language and machine translation: a report from practice. Translation: Computation, Corpora,  Cognition 3 (1), June 2013; pp.49-60. [PDF, 411KB]

(2012) Florence Beaujard: Linguistics activities of Airbus design office. META-FORUM, Brussels, June 19-21, 2012; 7 slides. [PDF of PPT, 1355KB]

(2012) Stephen Doherty: Investigating the effects of controlled language on the reading and comprehension of machine translated texts: a mixed-methods approach. Thesis sumitted for the degree of Doctor of Philosophy, … Dublin City University, January 2012; xiii, 275pp. and appendices. [PDF, 3456KB]

(2012) Anthony Hartley, Midori Tatsumi, Hitoshi Isahara, Kyo Kageura, & Rei Miyata: Readability and translatability judgments for ‘Controlled Japanese’. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.237-244 [PDF, 465KB]

(2012) Sabine Lehmann, Siu Ke Pepe Lo, Ben Gottesmann, Melanie Siegel, Robert Grabowski, Frederik Fouvry, & Mayo Kudo: Applying CNL authoring support to improve machine translation of forum data. CNL-2012. 3rd Workshop on Controlled Natural Language, 29-31 August, Zurich; 31 slides. [PDF of PPT, 429KB]

(2012) Smruthi Mukund & Rohini K.Srihari: Analyzing Urdu social media for sentiments using transfer learning with controlled translations. Proceedings of the 2012 Workshop on Language in Social Media (LSM 2012), Montréal, Canada, June 7, 2012; pp.1-8. [PDF, 393KB]

(2012) Laura Ramírez Polo: Use and evaluation of controlled languages in industrial environments and feasibility study for the implementation of machine translation. PhD thesis, Universidad de Valencia, June 2012; 504pp. [PDF, 6696KB]

(2012) Johann Roturier, Linda Mitchell, Robert Grabowski & Melanie Siegel: Using automatic machine translation metrics to analyze the impact of source reformulations. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 509KB]

(2012) Midori Tatsumi, Anthony Hartley, Hitoshi Isahara, Kyo Kageura, Toshio Okamoto, & Katsumasa Shimizu: Building translation awareness in occasional authors:a user case from Japan. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.53-56. [PDF, 264KB]

 (2010) Aarne Ranta: Multilingual packages of controlled languages: an introduction to GF. CNL2010: 2nd Workshop on Controlled Natural Language, Marettimo Island, Sicily, September 13-15, 2010; 83 slides [PDF of PPT, 1140KB]

(2010) Aarne Ranta, Krasimir Angelov, & Thomas Hallgren: Tools for multilingual grammar-based translation on the web. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.66-71. [PDF, 313KB]

 (2010) Irina Temnikova: Cognitive evaluation approach for controlled language post-editing experiment. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3485-3490. [PDF, 631KB]

Conversation see Dialogue

Coordination

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

Decipherment

(2013) Taylor Berg-Kirkpatrick & Dan Klein: Decipherment with a million random restarts. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.874-878. [PDF, 115KB]

(2013) Bozhidar Bozhanov & Ivan Derzhanski: Rosetta Stone linguistic problems. [ACL 2013] Proceedings of the Fourth Workshop on Teaching Natural Language Processing, Sofia, Bulgaria, August 4-9 2013; pp.1-8. [PDF, 92KB]

(2013) Qing Dou & Kevin Knight: Dependency-based decipherment for resource-limited machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1668-1676. [PDF, 153KB]

(2013) Kevin Knight: Decipherment. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Tutorials, Sofia, Bulgaria, August 4-9 2013; pp.3-4. [PDF, 79KB]

(2013) Malte Nuhn, Julian Schamper & Hermann Ney: Beam search for solving substitution ciphers.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1568-1576. [PDF, 284KB]

(2013) Malte Nuhn & Hermann Ney: Decipherment complexity in 1:1 substitution ciphers.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.615-621. [PDF, 213KB]

(2013) Sujith Ravi: Scalable decipherment for machine translation via hash sampling.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.362-271. [PDF, 276KB]

(2012) Qing Dou & Kevin Knight: Large scale decipherment for out-of-domain machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.266-275. [PDF, 578KB]

(2011) Jacob Aron: Cracking the code of machine translation. New Scientist, vol.210, issue 2817, June 2011; 2pp.

(2011) Taylor Berg-Kirkpatrick & Dan Klein: Simple effective decipherment via combinatorial optimization. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.313-321. [PDF, 299KB]

(2011) Kevin Knight, Beáta Megyesi & Christiane Schaefer: The Copiale cipher. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.110-119. [PDF, 657KB]

(2011) John Markoff: Translation tool cracked a 1700s code. New York Times, November 13, 2011. [PDF, 65KB]

(2011) Sujith Ravi & Kevin Knight: Deciphering foreign language. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.12-21. [PDF, 225KB]

(2010) Eric Corlett & Gerald Penn: An exact A* method for deciphering letter-substitution ciphers. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1040-1047. [PDF, 132KB]

(2010) Benjamin Snyder, Regina Barzilay, & Kevin Knight: A statistical model for lost language decipherment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1048-1057. [PDF, 764KB]

Definitions

(2013) Yulan Yan, Chikara Hashimoto, Kentaro Torisawa, Takao Kawai, Jun’ichi Kazama, & Stijn de Saeger: Minimally supervised method for multilingual paraphrase extraction from definition sentences on the web. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.63-73. [PDF, 381KB]

Dependency relations

(2014) Kurt Eberle: AutoLearn<Word>. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.145-154. [PDF, 446KB]

(2014) Weston Feely, Mehdi Manshadi, Robert Frederking, & Lori Levin: The CMU METAL Farsi NLP approach. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4052-4055. [PDF, 206KB]

(2014) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, Aaron Smith, & Joakim Nivre: Anaphora models and reordering for phrase-based SMT. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.122-129. [PDF, 184KB]

(2014) Hideki Isozaki, Natsume Kouchi, & Tsutomu Hirao: Dependency-based automatic enumeration of semantically equivalent word orders for evaluating Japanese translations. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.287-292. [PDF, 264KB]

(2014) Xiaofeng Wu, Hui Yu, & Qun Liu: RED: DCU-CASICT participation in WMT2014 metrics task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.420-425. [PDF, 301KB]

(2013) Fabienne Cap, Marion Weller, & Ulrich Heid: Using a rich feature set for the identification of German MWEs. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.34-42. [PDF, 308KB]

(2013) Qing Dou & Kevin Knight: Dependency-based decipherment for resource-limited machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1668-1676. [PDF, 153KB]

(2013) Melania Duma, Cristina Vertan, & Wolfgang Menzel: A new syntatctic metric for evaluation of machine translation.  ACL Student Research Workshop, Sofia, Bulgaria, August 4-9 2013; pp.130-135. [PDF, 346KB]

(2013) Yvette Graham: A dependency-constrained hierarchical model with Moses. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.464-471. [PDF, 341KB]

(2013) Nathan David Green & Zdeněk Žabokrtský: Improvements to syntax-based machine translation using ensemble dependency parsers. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 19-24. [PDF, 202KB]

(2013) Dan Han, Pascual Martínez-Gómez, Yusuke Miyao, Katsuhito Sudoh, & Masaaki Nagata: Using unlabeled dependency parsing for pre-ordering for Chinese-to-Japanese statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp. 25-33. [PDF, 334KB]

(2013) Shuhei Kondo, Kevin Duh & Yuji Matsumoto: Hidden Markov tree model for word alignment. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.503-511. [PDF, 531KB]

(2013) Junhui Li, Philip Resnik, & Hal Daumé III: Modeling syntactic and semantic structures in hierarchical phrase-based translation. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.540-549. [PDF, 224KB]

(2013) Yang Liu: A shift-reduce parsing algorithm for phrase-based string-to-dependency translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1-10. [PDF, 290KB]

(2013) Wei-Yun Ma & Kathleen McKeown: Using a supertagged dependency language model to select a good translation in system combination. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.433-438. [PDF, 200KB]

(2013) Fandong Meng, Jun Xie, Linfeng Song, Yajuan Lü, & Qun Liu: Translation with source constituency and dependency trees. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1066-1076. [PDF, 341KB]

(2013) Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová, & Eva Hajičová: (Pre-)annotation of topic-focus articulation in Prague Czech-English dependency treebank. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.55-63. [PDF, 990KB]

(2013) Antonio Valerio Miceli Barone, & Giuseppe Attardi: Pre-reordering for machine translation using transition-based walks on dependency parse trees. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.164-169. [PDF, 207KB]

(2013) Jan Niehues & Alex Waibel: An MT error-driven discriminative word lexicon using sentence structure features. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.512-520. [PDF, 189KB]

(2013) Oscar Täckström, Ryan McDonald, & Joakim Nivre: Target language adaptation of discriminative transfer parsers. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.1061-1071. [PDF, 258KB]

(2013) Akihiro Tamura, Taro Watanabe, Eiichiro Sumita, Hiroya Takamura, & Manabu Okumura: Part-of-speech induction in dependency trees for statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.841-851. [PDF, 769KB]

(2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2013) Marion Weller, Alexander Fraser, & Sabine Schulte im Walde: Using subcategorization knowledge to improve case prediction for translation into German.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.593-603. [PDF, 252KB]

(2013) Xiaofeng Wu, Hui Yu & Qun Liu: DCU participation in WMT2013 metrics task. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.435-439. [PDF, 175KB]

(2013) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Handling ambiguities of bilingual predicate-argument structures for statistical machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1127-1136. [PDF, 269KB]

(2012) Ondřej Bojar & Dekai Wu: Towards a predicate-argument evaluation for MT. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.30-38. [PDF, 126KB]

(2012) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: EBMT system of Kyoto University in OLYMPICS task at IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.96-101. [PDF, 771KB]

(2012) Greg Durrett, Adam Pauls, & Dan Klein: Syntactic transfer using a bilingual lexicon. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1-11. [PDF, 286KB]

(2012) Minwei Feng, Weiwei Sun, & Hermann Ney: Semantic cohesion model for phrase-based SMT. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.867-878. [PDF, 227KB]

(2012) Michael Gasser: Toward a rule-based system for English-Amharic translation. [SALTMIL 2012] Workshop on Language Technology for Normalisation of Less-Resourced Languages, 22 May 2012, Istanbul, Turkey; pp.41-46. [PDF, 319KB]

(2012) Ryan Georgi, Fei Xia, & William D.Lewis: Measuring the divergence of dependency structures cross-linguistically to improve syntactic projection algorithms.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.771-778. [PDF, 1013KB]

(2012) Jan Hajič, Eva Hajičová, Jarmila Panevová, Petr Sgall, Ondřej Bojar, Silvie Cinková, Eva Fučiková, Marie Mikulová, Petr Pajas, Jan Popelka, Jiři Semecký, Jana Šindlerová, Jan Štěpánek, Josef Toman, Zdeňka Urešová, & Zdeněk Žabokrtský: Announcing Prague Czech-English dependency treebank 2.0.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3153-3160. [PDF, 544KB]

(2012) Eva Hajičová & Petr Sgall: Formal models and practice of annotation [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.17-18. [PDF]

(2012) Christian Hardmeier, Joakim Nivre, & Jörg Tiedemann: Tree kernels for machine translation quality estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.109-113. [PDF, 423KB]

(2012) Andreas Kirkedal: Tree-based hybrid machine translation.  EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.77-86. [PDF, 260KB]

(2012) Hwidong Na & Jong-Hyeok Lee: Forest-to-string translation using binarized dependency forest for IWSLT 2012 OLYMPICS task. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.130-135. [PDF, 787KB]

(2012) Toshiaki Nakazawa & Sadao Kurohashi: Alignment by bilingual generation and monolingual derivation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1963-1978. [PDF, 1172KB]

(2012) Daniele Pighin, Meritxell González, & Lluís Màrquez: The UPC submission to the WMT 2012 shared task on quality estimation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.127-132. [PDF, 123KB]

(2012) Ariya Rastrow, Mark Dredze, & Sanjeev Khudanpur: Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.175-183. [PDF, 419KB]

(2012) Rudolf Rosa, David Mareček, & Ondřej Dušek: DEPFIX: a system for automatic correction of Czech MT outputs. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.362-368. [PDF, 114KB]

(2012) Rudolf Rosa, Ondřej Dušek, David Mareček, & Martin Popel: Using parallel features in parsing of machine-translated sentences for correction of grammatical errors. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.39-48. [PDF, 186KB]

(2012) Amit Sangodkar & Om P.Damani: Re-ordering source sentences for SMT.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2164-2171. [PDF, 649KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2012) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: A comparative study of target dependency structures for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.100-104. [PDF, 750KB]

(2012) Deyi Xiong, Min Zhang, & Haizhou Li: Modeling the translation of predicate-argument structure for SMT. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.902-911. [PDF, 172KB]

(2012) Feifei Zhai, Jiajun Zhang, Yu Zhou, & Chengqing Zong: Machine translation by modelling predicate-argument structure transformation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.3019-3036. [PDF, 1473KB]

(2012) Yue Zhang & Joakim Nivre: Analyzing the effect of global learning and beam-search on transition-based dependency parsing. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.1391-1400. [PDF, 282KB]

(2011) Daniel Andrade, Takuya Matsuzaki, & Jun’ichi Tsujii: Learning the optimal use of dependency-parsing information for finding translations with comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.10-18. [PDF, 651KB]

 (2011) Giuseppe Attardi, Atanas Chanev, & Antonio Valerio Miceli Barone: A dependency based statistical translation model. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.79-87. [PDF, 174KB]

(2011) Igor Boguslavsky: Processing of Russian by the ETAP-3 linguistic processor. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 110KB]; presentation: 48 slides [PDF of PPT, 227KB]

 (2011) Wenliang Chen, Jun’ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, & Haizhou Li: SMT helps bitext dependency parsing. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.73-83. [PDF, 583KB]

 (2011) Chenhui Chu, Toshiaki Nakazawa, & Sadao Kurohashi: Japanese-Chinese phrase alignment using common Chinese characters information. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.464-471. [PDF, 347KB]

(2011) Michael Gasser: Towards synchronous extensible dependency grammar. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.3-10. [PDF, 265KB]

 (2011) Kevin Gimpel & Noah A.Smith: Quasi-synchronous phrase dependency grammars for machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.474-485. [PDF, 360KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

 (2011) Wenjing Lang, Qiaoli Zhou, Guiping Zhang, & Dongfeng Cai: Multi-stage Chinese dependency parsing based on dependency direction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.64-71. [PDF, 128KB]

(2011) Ryan McDonald, Slav Petrov, & Keith Hall: Multi-source transfer of delexicalized dependency parsers. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.62-72. [PDF, 284KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

(2011) Hwidong Na & Jong-Hyeok Lee: Multi-word unit dependency forest-based translation rule extraction. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.41-51. [PDF, 466KB]

(2011) Toshiaki Nakazawa & Sadao Kurohashi: Bayesian subtree alignment model based on dependency trees. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.794-802. [PDF, 790KB]

(2011) Jan-Thorsten Peter, Matthias Huck, Hermann Ney, & Daniel Stein: Soft string-to-dependency hierarchical machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.246-253.[PDF, 249KB]

(2011) Martin Popel, David Mareček, Nathan Green, & Zdeněk Žabokrtský: Influence of parser choice on dependency-based MT. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.433-439. [PDF, 113KB]

(2011) Manuela Sanguinetti & Cristina Bosco: Building the multilingual TUT parallel treebank. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.19-28. [PDF, 316KB]

(2011) Anders Søgaard & Martin Haulrich: Sentence-level instance-weighting for graph-based and transition-based dependency parsing. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.43-47. [PDF, 86KB]

(2011) Anders Søgaard: Data point selection for cross-language adaptation of dependency parsers. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.682-686. [PDF, 124KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2011) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Extracting pre-ordering rules from chunk-based dependency trees for Japanese-to-English translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.300-307. [PDF, 859KB]

(2011) Jun Xie, Haitao Mi & Qun Liu: A novel dependency-to-string model for statistical machine translation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.216-226. [PDF, 992KB]

 (2010) Samuel Brody: Unsupervised dependency parsing via word alignment. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.1214-1222. [PDF, 344KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.83-91. [PDF, 510KB]

(2010) Thoudam Doren Singh & Sivaji Bandyopadhyay: Statistical machine translation of English-Manipuri using morpho-syntactic and semantic information. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 680KB]

(2010) Michael Gasser: A dependency grammar for Amharic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, 17 May 2010, Valletta, Malta; 9pp. [PDF, 179KB]

(2010) Yvette Graham: Sulis: an open source transfer decoder for deep syntactical statistical machine translation. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.17-26. [PDF, 705KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Yifan He, Jinhua Du, Andy Way, & Josef van Genabith: The DCU dependency-based metric in WMT-MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 349-353. [PDF, 444KB]

 (2010) Elena Irimia & Alexandru Ceausu: Dependency-based translation equivalents for factored machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 60KB]

(2010) Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee: Chinese syntactic reordering through contrastive analysis of predicate-predicate patterns in Chinese-to-Korean SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 817KB]

(2010) Yanjun Ma & Andy Way: HMM word-to-phrase alignment with dependency constraints. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.101-109. [PDF, 191KB]

(2010) David Mareček, Martin Popel, & Zdeněk Žabokrtský: Maximum entropy translation model in dependency-based MT framework. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 201-206. [PDF, 254KB]

(2010) Yuval Marton, Nizar Habash, & Owen Rambow: Improving Arabic dependency parsing with lexical and inflectional morphological features. Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages, pages 13–21, Los Angeles, CA, USA, June 2010; pp.13-21. [PDF, 187KB]

(2010) Haitao Mi & Qun Liu: Constituency to dependency translation with forests. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1433-1442. [PDF, 190KB]

(2010) Simon Mille & Leo Wanner: Syntactic dependencies for multilingual and multilevel corpus annotation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1889-1896. [PDF, 395KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

(2010) Libin Shen, Jinxi Xu, & Ralph Weischedel: String-to-dependency statistical machine translation. Computational Linguistics 36 (4), pp. 649-671 [PDF, 233KB]

(2010) Drahomira (Johanka) Spoustová & Miroslav Spousta: Dependency parsing as a sequence labeling task. Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.7-14. [PDF, 271KB]

(2010) Kathrin Spreyer: Notes on the evaluation of dependency parsers obtained through cross-lingual projection. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1176-1184. [PDF, 114KB]

(2010) Kathrin Spreyer, Lilja Øvrelid, & Jonas Kuhn: Training parsers on partial trees: a cross-language comparison. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1978-1985. [PDF, 285KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

(2010) Zhaopeng Tu, Yang Liu, Young-Sook Hwang, Qun Liu, & Shouxun Lin: Dependency forest for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1092-1100. [PDF, 215KB]

(2010) Sriram Venkatapathy, Rajeev Sangal, Aravind Joshi, & Karthik Gali: A discriminative approach to dependency based statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.66-74. [PDF, 187KB]

(2010) Zhiyang Wang, Yajuan Lü, Qun Liu, & Young-Sook Hwang: Better filtration and augmentation for hierarchical phrase-based translation rules. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.142-146. [PDF, 135KB]

Determiners  see Articles

Dialects and registers

(2015) Mihaela Vela & Ekaterina Lapshinova-Koltunski: Register-based machine translation evaluation with text classification techniques. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.215-228. [PDF, 506KB]

(2014) Anik Dey & Pascale Fung: A Hindi-English code-switching corpus. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2410-2413. [PDF, 427KB]

(2014) Saab Mansour, Yaser Al-Onaizan, Graeme Blackwood, & Christoph Tillmann: Automatic dialect classification for statistical machine translation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp. 342-355. [PDF, 588KB]

(2014) Fatiha Sadat & Atefeh Farzindar: The ASMAT project – Arabic social media analysis tools. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.130. [PDF, 296KB]

(2013) Heike Adel, Ngoc Thang Vu, & Tanja Schultz: Combination of recurrent neural networks and factored language models for code-switching language modeling. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.206-211. [PDF, 210KB]

(2013) Pierrette Bouillon, Johanna Gerlach, Ulrich Germann, Barry Haddow & Manny Rayner: Two approaches to correcting homophone confusion in a hybrid machine translation system. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.109-116. [PDF, 298KB]

(2013) Rahma Boujelbane, Mariem Ellouze Khemekhem, Siwar BenAyed, & Lamia Hadrich Belguith: Building bilingual lexicon to create dialect Tunisian corpora and adapt language model. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.88-93. [PDF, 454KB]

(2013) Heba Elfardy & Mona Diab: Sentence level dialect identification in Arabic. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.456-461. [PDF, 211KB]

(2013) Ahmed Hamdi, Rahma Boujelbane, Nizar Habash, & Alexis Nasr: The effects of factorizing root and pattern mapping in translating between Tunisian Arabic and standard Arabic. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.125-134. [PDF, 672KB]

(2013) Chi-kiu Lo & Dekai Wu: Can informal genres be better translated by tuning on automatic semantic metrics? Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.93-100. [PDF, 598KB]

(2013) Hassan Sajjad, Kareem Darwish, & Yonatan Belinkov: Translating dialectal Arabic to English. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.1-6. [PDF, 182KB]

(2013) Wael Salloum & Nizar Habash: Dialectal Arabic to English machine translation: pivoting through Modern Standard Arabic. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.348-358. [PDF, 158KB]

(2013) Elke Teich, Stefania Degaetano-Ortlieb, Hannah Kermes, & Ekaterina Lapshinova-Koltunski: Scientific registers and disciplinary diversification: a comparable corpus approach. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.59-68. [PDF, 388KB]

(2012) Chris Irwin Davis: Tajik-Farsi Persian transliteration using statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3988-3995. [PDF, 590KB]

(2012) Heba Elfardy & Mona Diab: AIDA: automatic identification and glossing of dialectal Arabic. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.83. [PDF, 148KB]

(2012) Nizar Habash, Ramy Eskander, & Abdelati Hawwari: A morphological analyzer for Egyptian Arabic. SIGMORPHON-2012. Proceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology, Montréal, Canada, June 7, 2012; pp.1-9. [PDF, 147KB]

(2012) Manny Rayner, Pierrette Bouillon, & Barry Haddow: Using source-language transformations to address register mismatches in SMT. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 76KB]

(2012) Wael Salloum & Nizar Habash: Elissa: a dialectal to standard Arabic machine translation system. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 385-392. [PDF, 181KB]

(2012) Amira Shoukry & Ahmed Rafea: Preprocessing Egyptian dialect tweets for sentiment mining. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.47-56. [PDF, 1193KB]

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

(2011) Michael Paul, Andrew Finch, Paul R.Dixon, & Eiichiro Sumita: Dialect translation: integrating Bayesian co-segmentation models with pivot-based SMT. [EM NLP 2011] DIALECTS2011: Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, Edinburgh, Scotland, UK, July 31, 2011; pp.1-9. [PDF, 210KB]

 (2011) Wael Salloum & Nizar Habash: Dialectal to standard Arabic paraphrasing to improve Arbaic-English statistical machine translation. [EMNLP 2011] DIALECTS2011: Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, Edinburgh, Scotland, UK, July 31, 2011; pp.10-21. [PDF, 186KB]

(2010) Yassine Benajiba & Mona Diab: A web application for dialectal Arabic text annotation. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 91-98. [PDF, 747KB]

(2010) Mona Diab, Nizar Habash, Owen Rambow, Mohamed Altantawy, & Yassine Benajiba: COLABA: Arabic dialect annotation and processing. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 66-74. [PDF, 747KB]

 (2010) Kathleen Egan: Cross lingual Arabic blog alerting (COLABA). AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Hassan Sawaf: Arabic dialect handling in hybrid machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 146KB]

Dialogue

(2015) William Lewis, Christian Federmann, & Ying Xin: Applying cross-entropy difference for selecting parallel training data from publicly available sources for conversational machine translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.126-134. [PDF, 3.5MB]

(2015) William Lewis: Skype Translator: breaking down language and hearing barriers. A behind the scenes look at near real-time speech translation. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp. 58-65. [PDF, 758KB]

(2015) Tanvi Surti: Designing user experience for machine translated conversations. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; pp.220-223. [PDF, 960KB]

(2013) Sankaranarayanan Ananthakrishnan, Sanjika Hewavitharana, Rohit Kumar, Enoch Kan, Rohit Prasad, & Prem Natarjan: Semi-supervised word sense disambiguation for mixed-initiative conversational spoken language translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp. 43-50. [PDF, 536KB]

(2013) Sankaranarayanan Ananthakrishnan, Wei Chen, Rohit Kumar, & Dennis Mehay: Source aware phrase-based decoding for robust conversational spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 222KB]

(2013) Seppo Enarvi & Mikko Kurimo: Studies on training text selection for conversational Finnish language modeling. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 210KB]

(2013) Sanjika Hewavitharana, Dennis N.Mehay, Sankaranarayanan Ananthakrishnan, & Prem Natarajan: Incremental topic-based translation model adaptation for conversational spoken language translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.697-701. [PDF, 279KB]

(2012) M.Federico, M.Cettolo, L.Bentivogli, M.Paul, & S.Stüker: Overview of the IWSLT 2012 evaluation campaign. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.12-33. [PDF, 772KB]; presentation: 44 slides [PDF of PPT, 3701KB]

(2012) Shixiang Lu, Wei Wei, Xiaoyin Fu, & Bo Xu: Translation model based cross-lingual language model adaptation: from word models to phrase models. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.512-522. [PDF, 196KB]

 (2011) Avneesh Saluja, Ian Lane, & Ying Zhang: Context-aware language modeling for conversational speech translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.97-104. [PDF, 177KB]

 (2011) Anne H.Schneider & Saturnino Luz: Speaker alignment in synthesised, machine translation communication. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.254-260. [PDF, 231KB]

(2010) Sherri Condon, Dan Parvaz, John Aberdeen, Christy Doran, Andrew Freeman, & Marwan Awad: Evaluation of machine translation errors in English and Iraqi Arabic. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.729-735. [PDF, 644KB]

(2010) Xiangyu Duan, Rafael E.Banchs, Jun Lang, Deyi Xiong, Aiti Aw, Min Zhang, & Haizhou Li: I2R’s machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.67-72. [PDF, 391KB]

(2010) Guillem Gascó, Vicent Alabau, Jesús Andrés-Ferrer, Jesús González-Rubio, Martha-Alicia Rocha, Germán Sanchis-Trilles, Francisco Casacuberta, Jorge González, & Joan-Andreu Sánchez: ITI-UPV system description for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.85-92. [PDF, 414KB]; presentation by Germán Sanchis-Trilles: 16 slides [PDF of PPT, 481KB]

(2010) Chooi-Ling Goh, Taro Watanabe, Michael Paul, Andrew Finch, & Eiichiro Sumita: The NICT translation  system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.139-146. [PDF, 379KB]

(2010) Chi-Ho Li, Nan Duan, Yinggong Zhao, Shujie Liu, Lei Cui, Mei-yuh Hwang, Amittai Axelrod, Jianfeng Gao, Yaodong Zhang, & Li Deng: The MSRA machine translation system for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.135-138. [PDF, 254KB]; presentation: 17 slides [PDF of PPT, 477KB]

(2010) Wang Ling, Tiago Luís, João Graça, Luísa Coheur & Isabel Trancoso: The INESC-ID machine translation system for the IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.81-84. [PDF, 227KB]

(2010) Michael Paul, Marcello Federico, & Sebastian Stüker: Overview of the IWSLT 2010 evaluation campaign. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.3-27. [PDF, 558KB]

(2010) Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, & Claudia Baur: A multilingual CALL game based on speech translation. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1531-1538. [PDF, 573KB]

(2010) Anne H.Schneider, Ielka van der Sluis, & Saturnino Luz: Comparing intrinsic and extrinsic evaluation of MT output in a dialogue system. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.329-336. [PDF, 448KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

(2010) Hao Xiong, Jun Xie, Hui Yu, Kai Liu, Wei Luo, Haitao Mi, Yang Liu, Yajuan Lü, & Qun Liu: The ICT statistical machine translation system for IWSLT 2010.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.73-79. [PDF, 510KB]

Discontinuous relations

(2015) Nina Seemann, Fabienne Braune, & Andreas Maletti: A systematic evaluation of MBOT in statistical machine translation. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.200-214. [PDF, 585KB]

(2013) Violeta Seretan: On translating syntactically-flexible expressions [abstract]. Invited talk at [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; p.11. [PDF, 284KB]

(2012) Karunesh Kumar Arora & R.Mahesh K.Sinha: Improving statistical machine translation through co-joining parts of verbal constructs in English-Hindi translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.95-101. [PDF, 164KB]

(2011) Marilisa Amoia, Kerstin Kunz, & Ekaterina Lapshinova-Koltunski: Discontinuous constituents: a problematic case for parallel corpora annotation and querying. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.2-10. [PDF, 184KB]

(2011) Mohit Bansal, Chris Quirk, & Robert C.Moore: Gappy phrasal alignment by agreement . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1308-1317. [PDF, 215KB]

(2011) Kevin Gimpel & Noah A.Smith: Generative models of monolingual and bilingual gappy patterns. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.512-522. [PDF, 379KB]

(2011) Wei Wei & Bo Xu: Effective use of discontinuous phrases for hierarchical phrase-based translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.397-404. [PDF, 148KB]

 (2010) Michel Galley & Christopher D.Manning: Accurate non-hierarchical phrase-based translation.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.966-974. [PDF, 433KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Improved discriminative ITG alignment using hierarchical phrase pairs and semi-supervised training. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.730-738. [PDF, 1050KB]

(2010) Hwidong Na, Jin-Ji Li, Yeha Lee, & Jong-Hyeok Lee: A synchronous context free grammar using depependency sequence for syntax-based statistical machine translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 184KB]

Discourse analysis and representation

(2014) Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann, & Bonnie Webber: ParCor 1.0: a parallel pronoun-coreference corpus to support statistical MT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3191-3198. [PDF, 126KB]

(2014) Christian Hardmeier: Discourse in statistical machine translation [abstract]. PhD thesis, Uppsala University, 2014; 2pp. [PDF, 15KB]

(2014) Shafiq Joty, Francisco Guzmán, Lluís Màrquez & Preslav Nakov: DiscoTK: using discourse structure for machine translation evaluation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.402-408. [PDF, 426KB]

(2014) Carolina Scarton & Lucia Specia: Document-level translation quality estimation: exploring discourse and pseudo-references. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.101-108. [PDF, 409KB]

(2013) Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013

(2013) Guisheng Ben, Deyi Xiong, Zhiyang Teng, Yajuan Lü, & Qun Liu: Bilingual lexical cohesion trigger model for document-level machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.382-386. [PDF, 155KB]

(2013) Christian Hardmeier, Sara Stymne, Jörg Tiedemann, & Joakim Nivre: Docent: a document-level decoder for phrase-based statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, System demonstrations, Sofia, Bulgaria, August 4-9 2013; pp.193-198. [PDF, 147KB]

(2013) Behzad Mirzababaei, Heshaam Faili, & Nava Ehsan: Discourse-aware statistical machine translation as a context-sensitive spell checker. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.475-482. [PDF, 1025KB]

(2013) Jiří Mírovský, Kateřina Rysová, Magdaléna Rysová, & Eva Hajičová: (Pre-)annotation of topic-focus articulation in Prague Czech-English dependency treebank. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.55-63. [PDF, 990KB]

(2013) Erwan Moreau & Raphael Rubino: An approach using style classification features for quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.429-434. [PDF, 133KB]

(2013) Mei Tu, Yu Zhou, & Chengqing Zong: A novel translation framework based on rhetorical structure theory.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.370-374. [PDF, 610KB]

(2013) Bonnie Webber, Katja Markert, Andrei Popescu-Belis, Jörg Tiedemann: Introduction to Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.iii-vi. [PDF, 175KB]

(2013) Deyi Xiong, Yang Ding, Min Zhang, & Chew Lim Tan: Lexical chain based cohesion models for document-level statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1563-1573. [PDF, 254KB]

(2012) Najeh Hajlaoui & Andrei Popescu-Belis: Translating English discourse connectives into Arabic: a corpus-based analysis and an evaluation metric. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.1-8. [PDF, 731KB]

(2012) Christian Hardmeier: Discourse in statistical machine translation: a survey and a case study. Discours: Revue de Linguistique, Psycholinguistique et Informatique 11, December 2012; 29pp. [PDF, 432KB]

(2012) Christian Hardmeier, Joakim Nivre, & Jörg Tiedemann: Document-wide decoding for phrase-based statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1179-1190. [PDF, 289KB]

(2012) Iørn Korzen & Morten Gylling: Text structure in a contrastive and translational perspective: on information density and clause linkage in Italian and Danish. Translation: Computation, Corpora,  Cognition 2 (1), July 2012; pp.23-46. [PDF, 601KB]

(2012) Thomas Meyer, Andrei Popescu-Belis, Najeh Hajlaoui, & Andrea Gesmundo: Machine translation of labeled discourse connectives. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 196KB]

(2012) Thomas Meyer & Andrei Popescu-Belis: Using sense-labeled discourse connectives for statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.129-138. [PDF, 275KB]

(2012) Katharina Wäschle & Stefan Riezler: Structural and topical dimensions in multi-task patent translation.  [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 818-828. [PDF, 161KB]

(2012) Bonnie Webber: Discourse and SMT. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 20 slides [PDF of PPT,  459KB]

(2012) Billy T.M.Wong & Chunyu Kit: Extending machine translation evaluation metrics with lexical cohesion to document level. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1060-1068. [PDF, 125KB]

(2011) Bruno Cartoni, Sandrine Zufferey, Thomas Meyer, & Andrei Popescu-Belis: How comparable are parallel corpora? Measuring the distribution of general vocabulary and connectives. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.78-86. [PDF, 153KB]

(2011) Thomas Meyer: Disambiguating temporal-contrastive discourse connectives for machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.46-51. [PDF, 114KB]

(2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS projectTralogy, Paris, 3-4 March 2011;  17pp. [PDF, 169KB]

(2011) Mara Tsoumari & Georgios Petasis: A new annotation tool for aligned bilingual corpora. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.43-52. [PDF, 184KB]

(2010) George Foster, Pierre Isabelle, & Roland Kuhn: Translating structured documents. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 132KB]

(2010) Jesús Giménez, Lluís Màrquez, Elisabet Comelles, Irene Castellón, & Victoria Arranz: Document-level automatic MT evaluation based on discourse representations. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 333-338. [PDF, 112KB]

(2010) Yang Mei & Katrin Kirchhoff: Contextual modeling for meeting translation using unsupervised word sense disambiguation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1227-1235. [PDF, 461KB]

(2010) Richard Moot: Wide-coverage French syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

Emphasis and stress

(2015) Quoc Truong Do, Sakriani Sakti, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: Improving translation of emphasis with pause prediction in speech-to-speech translation systems. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.204-208. [PDF, 1.7MB]

Empty and null categories  see Pronouns

Entailment (see also Logical aspects)

(2014) Rohit Gupta, Hanna Bechara, & Constantin Orasan: Intelligent translation memory matching and retrieval metric exploiting linguistic technology. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.86-89. [PDF, 176KB]

(2013) Ido Dagan, Bernardo Magnini, Guenter Neumann, & Sebastian Pado: EXploring Customer Interactions through Textual EntailMENT. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.417. [PDF, 201KB]

(2013) Sergio Jimenez, Claudia Becerra, & Alexander Gelbukh: SOFTCARDINALITY: learning to identify directional cross-lingual entailment from cardinalities and SMT. [SEMEVAL-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation (SemEval 2013), Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 144KB]

(2013) José G.C.de Souza, Miquel Esplà-Gomis, Marco Turchi, & Matteo Negri: Exploiting qualitative information from automatic word alignment for cross-lingual NLP tasks.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.771-776. [PDF, 277KB]

(2013) Marco Turchi & Matteo Negri: ALTN: word alignment features for cross-lingual textual entailment. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.128-132. [PDF, 386KB]

(2013) Darnes Vilariño, David Pinto, Saúl León, Yuridiana Alemán, & Helena Gómez-Adorno: BUAP: N-gram based feature evaluation for the cross-lingual textual entailment task. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.124-127. [PDF, 83KB]

(2013) Jiang Zhao, Man Lan, & Zheng-Yu Niu: ECNUCS: recognizing cross-lingual textual entailment using multiple text similarity and text difference measures. [SEM-2013] Second Joint Conference on Lexical and Computational Semantics, Volume 2: Seventh International Workshop on Semantic Evaluation, Atlanta, Georgia, June 14-15, 2013; pp.118-123. [PDF, 100KB]

(2012) Julio Castillo & Paula Estrella: Semantic textual similarity for MT evaluation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.52-58. [PDF, 278KB]

(2012) Yashar Mehdad, Matteo Negri, & Marcello Federico: Match without a referee: evaluating MT adequacy without reference translations. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.171-180. [PDF, 126KB]

(2011) Ido Dagan & Shachar Mirkin: Textual entailment inference in machine translation [abstract]. Machine Translation and Morphologically-rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 39 slides [PDF of PPT, 663KB]

 (2011) Yashar Mehdad, Matteo Negri, & Marcello Federico: Using bilingual parallel corpora for cross-lingual textual entailment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1336-1345. [PDF, 236KB]

(2011) Matteo Negri, Luisa Bentivogli, Yashar Mehdad, Danilo Giampiccolo, & Alessandro Marchetti: Divide and conquer: crowdsourcing the creation of cross-lingual textual entailment corpora.  [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.670-679. [PDF, 493KB]

(2010) Yashar Mehdad, Matteo Negri, & Marcello Federico: Towards cross-lingual textual entailment.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.321-324. [PDF, 230KB]

(2010) Matteo Negri & Yashar Mehdad: Creating bi-lingual entailment corpus through translations with Mechanical Turk: $100 for a 10-day rush. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.212-216. [PDF, 101KB]

Etymology

(2013) Vivi Nastase & Carlo Strapparava: Bridging languages through etymology: the case of cross language text categorization.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.651-659. [PDF, 229KB]

Factoring see Annotation and features, Morphological analysis

False friends see Cognates

Features  see Annotations and features, Linguistic features

Function words (see also Conjunctions, Particles, Postpositions, Prepositions)

(2015) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Enhancing function word translation with syntax-based statistical post-editing. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: Sixth Workshop on Patent and Scientific Literature Translation (PSLT6); pp.44-51. [PDF, 542KB]

(2012) Toshiaki Nakazawa & Sadao Kurohashi: Alignment by bilingual generation and monolingual derivation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1963-1978. [PDF, 1172KB]

(2012) Chong Tze Yuang, Rafael E.Banchs, & Chng Eng Siong: An empirical evaluation of stop word removal in statistical machine translation. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.30-37. [PDF, 216KB]

(2011) Yusuke Abe, Takafumi Suzuki, Bing Liang, Takehito Utsuro, Mikio Yamamoto, Suguru Matsuyoshi, & Yasuhide Kawada: Example-based translation of Japanese functional expressions utilizing semantic equivalence classes. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.91-103. [PDF, 776KB]

(2011) Lei Cui, Dongdong Zhang, Mu Li, & Ming Zhou: Function word generation in statistical machine translation systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.139-146. [PDF, 184KB]

(2011) Chenchen Ding, Takashi Inui, & Mikio Yamamoto: Long-distance hierarchical structure transformation rules utilizing function words. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.159-166. [PDF, 215KB]

(2011) Kota Takeya & Yves Lepage: Marker-based chunking for analogy-based translation of chunks.  MT Summit  XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.338-345. [PDF, 171KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

Grammar formalisms see also: Case grammar, Categorial grammar, Context-free grammars, Definite Clause Grammar, Fulcrum approach,  Generalised Phrase Structure Grammar, Head-driven Phrase Struture Grammar,  Left Associative Grammar, Lexical Functional Grammar, Meaning-text model, Montague grammar, Principles and parameters, Role and reference grammar, Transduction grammars, Transformational-generative grammar, Tree Adjoining Grammar, Transduction grammar, Unification grammar

(2013) Matthias Büchse, Alexander Koller, & Heiko Vogler: Generic binarization for parsing and translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.145-154. [PDF, 304KB]

(2012) Sameh Alansary: A formalized reference grammar for UNL-based machine translation between English and Arabic. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.33-42. [PDF, 352KB]

(2012) John J.Camilleri & Krasimir Angelov: GF Eclipse plugin: an IDE for grammar development in GF. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.94. [PDF, 139KB]

(2010) Aarne Ranta, Krasimir Angelov, & Thomas Hallgren: Tools for multilingual grammar-based translation on the web. Proceedings of the ACL 2010 System Demonstrations, Uppsala, Sweden, 13 July 2010; pp.66-71. [PDF, 313KB]

(2010) Aarne Ranta: Multilingual packages of controlled languages: an introduction to GF. CNL2010: 2nd Workshop on Controlled Natural Language, Marettimo Island, Sicily, September 13-15, 2010; 83 slides [PDF of PPT, 1140KB]

Grammars and grammar acquisition

(2013) Greg Hanneman & Alon Lavie: Improving syntax-augmented machine translation by coarsening the label set. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.288-297. [PDF, 1205KB]

(2013) Gideon Maillette de Buy Wenniger & Khalil Sima’an: Hierarchical alignment decomposition labels for Hiero grammar rules. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 19-28. [PDF, 128KB]

(2013) Markus Saers & Dekai Wu: Unsupervised learning of bilingual categories in inversion transduction grammar induction. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 223KB]

(2013) Libin Shen & Bowen Zhou: What is hidden among translation rules. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.839-844. [PDF,118KB]

(2013) Xinyan Xiao & Deyi Xiong: Max-margin synchronous grammar induction for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.255-264. [PDF, 885KB]

(2012) Phil Blunsom: Synchronous grammars and translation. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 45 slides [PDF of PPT, 456KB]

(2012) Masood Ghayoomi: From grammar rule extraction to treebanking: a bootstrapping approach. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1912-1919. [PDF, 1203KB]

(2012) Bevan Jones, Jacob Andreas, Daniel Bauer, Karl Moritz Hermann, & Kevin Knight: Semantics-based machine translation with hyperedge replacement grammars. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1359-1376. [PDF, 169KB]

(2012) Markus Saers, Karteek Addanki, & Dekai Wu: From finite-state to inversion transductions: toward unsupervised bilingual grammar induction. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2325-2339. [PDF, 222KB]

(2012) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Compact rule extraction for hierarchical phrase-based translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 342KB]

(2012) Baskaran Sankaran, Majid Razmara, & Anoop Sarkar: Kriya – an end-to-end hierarchical phrase-based MT system. Prague Bulletin of Mathematical Linguistics 97, April 2012; pp.83-98. [PDF,

(2012) Philip Williams & Philipp Koehn: GHKM rule extraction and scope-3 parsing in Moses. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.388-394. [PDF, 122KB]

(2012) Xinyan Xiao, Deyi Xiong, Yang Liu, Qun Liu, & Shouxun Lin: Unsupervised discriminative induction of synchronous grammar for machine translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2883-2898. [PDF, 252KB]

(2012) Jingbo Zhu, Tong Xiao, & Chunliang Zhang: Learning better rule extraction with translation span alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.280-284. [PDF, 235KB]

(2011) Mark Hopkins, Greg Langmead & Tai Vo: Extraction programs: a unified approach to translation rule extraction. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.523-532. [PDF, 552KB]

(2011) Alexander Koller & Marco Kuhlmann: A generalized view on parsing and translation [abstract]. IWPT 2011: 12th International Confernce on Parsing Technologies, October 5-7, 2011, Dublin City University; pp.2-13. [PDF, 12KB]

(2011) Baskaran Sankaran, Gholamreza Haffari, & Anoop Sarkar: Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.533-541. [PDF, 148KB]

 (2011) Jonathan Weese, Juri Ganitkevitch, Chris Callison-Burch, Matt Post, & Adam Lopez: Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.478-484. [PDF, 164KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) John DeNero & Dan Klein: Discriminative modeling of extraction sets for machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1453-1463. [PDF, 616KB]

 (2010) Adrià de Gispert, Juan Pino, & William Byrne: Hierarchical phrase-based translation grammars extracted from alignment posterior probabilities.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.545-554. [PDF, 162KB]

(2010) Tomoharu Iwata, Daichi Mochihashi, & Hiroshi Sawada: Learning common grammar from multilingual corpus. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.184-188. [PDF, 966KB]

(2010) Richard Moot: Semi-automated extraction of a wide-coverage type-logical grammar for French. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 60KB]

(2010) Richard Moot: Wide-coverage French syntax and semantics using Grail. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 121KB]

(2010) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Fine-grained tree-to-string translation rule extraction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.325-334. [PDF, 766KB]

(2010) Tong Xiao, Jingbo Zhu, Hao Zhang, & Muhua Zhu: An empirical study of translation rule extraction with multiple parsers. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1345-1353. [PDF, 388KB]

(2010) Omar F.Zaidan & Juri Ganitkevitch: An enriched MT grammar for under $100. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.93-98. [PDF, 186KB]

(2010) Sina Zarrieß, Aoife Cahill, Jonas Kuhn, & Christian Rohrer: Cross-lingual induction of deep broad-coverage syntax: a case study on German participles. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1426-1434. [PDF, 106KB]

Head-driven Phrase Structure Grammar

(2012) Han Dan, Katsuhito Sudoh, Xianchao Wu, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: Head finalization reordering for Chinese-to-Japanese machine translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.57-66. [PDF, 257KB]

(2012) Junhui Li, Zhaopeng Tu, Guodong Zhou, & Joseph van Genabith: Head-driven hierarchical phrase-based translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.33-37. [PDF, 268KB]

(2012) Dominikus Wetzel & Francis Bond: Enriching parallel corpora for statistical machine translation with semantic negation rephrasing. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.20-29. [PDF, 163KB]

(2012) Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, & Masaaki Nagata: A comparative study of target dependency structures for statistical machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.100-104. [PDF, 750KB]

(2011) Eva Safar: Machine translation with corpus linguistics and HPSG. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 12 slides [PDF of PPT, 288KB]

(2010) Xianchao Wu, Takuya Matsuzaki, & Jun’ichi Tsujii: Fine-grained tree-to-string translation rule extraction. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.325-334. [PDF, 766KB]

Historical linguistics

(2013) Yves Scherrer & Tomaž Erjavec: Modernizing historical Slovene words with character-based SMT. Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 8-9 August 2013, Sofia, Bulgaria; pp.58-62. [PDF, 110KB]

(2012) Sanja Štajner & Ruslan Mitkov: Using comparable corpora to track diachronic and synchronic changes in lexical density and lexical richness.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.88-97. [PDF, 383KB]

Homonyms, homographs and homophones (see also Word sense disambiguation)

(2013) Pierrette Bouillon, Johanna Gerlach, Ulrich Germann, Barry Haddow & Manny Rayner: Two approaches to correcting homophone confusion in a hybrid machine translation system. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.109-116. [PDF, 298KB]

(2011) Manaal Faruqui, Prasenjit Majumder, & Sebastian Padó: Soundex-based translation correction in Urdu-English cross-language information retrieval. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.25-29. [PDF, 326KB]

Idioms

(2014) Wan Yu Ho, Christine Kng, Shan Wang & Francis Bond: Identifying idioms in Chinese translations. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.716-721. [PDF, 204KB]

(2011) Stéphane Huet & Philippe Langlais: Identifying the translations of idiomatic expressions using TransSearch. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.45-56. [PDF, 1123KB]

(2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

Implicit elements

(2015) Dong Zhan & Hiromi Nakaiwa: Automatic detection of antecedents of Japanese zero pronouns using a Japanese-English bilingual corpus. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.66-79. [PDF, 863KB]

Inflection

(2015) Ke Tran, Arianna Bisazza, & Christof Monz: A distributed inflection model for translating into morphologically rich languages. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.145-159. [PDF, 720KB]

(2013) Victor Chahuneau, Eva Schlinger, Noah A.Smith, & Chris Dyer: Translating into morphologically rich languages with synthetic phrases. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1044-1054. [PDF, 312KB]

(2012) Alexander Fraser, Marion Weller, Aoife Cahill, & Fabienne Cap: Modeling inflection and word formation in SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp.664-674. [PDF, 146KB]

Instructions

(2012) Stephen Doherty & Sharon O’Brien: A user-based usability assessment of raw machine translated technical instructions. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 6pp. [PDF, 143KB]

Interpretation

(2015) Bianca Prandi: Use of CAI tools in interpreters’ training: a pilot study. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.48-57. [PDF, 682KB]

(2013) Susanne Altenberg: The human interpreter in action – multilingualism at the European Parliament. [Keynote at IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.6, 2013; [abstract] 1p. [PDF, 126KB]

(2013) Akiko Sakamoto, Nayuko Watanabe, Satoshi Kamatani, & Kazuo Sumita: Development of a simultaneous interpretation system for face-to-face services and its evaluation in real situation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.85-92. [PDF, 1073KB]

(2013) Akiko Sakamoto, Kazuhiko Abe, Kazuo Sumita & Satoshi Kamatani: Evaluation of a simultaneous interpretation system and analysis of speech log for user experience assessment. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 691KB]

(2013) Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, & Satoshi Nakamura: Constructing a speech translation system using simultaneous interpretation data. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 1041KB]

(2013) Mahsa Yarmohammadi, Vivek K.Rangarajan Sridhar, Srinivas Bangalore, & Baskaran Sankaran: Incremental segmentation and decoding strategies for simultaneous translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1032-1036. [PDF, 192KB]

Interrogatives and questions

(2012) Ângela Costa, Tiago Luís, Joana Ribeiro, Ana Cristina Mendes, & Luísa Coheur: An English-Portuguese parallel corpus of questions: translation guidelines and application in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2172-2176. [PDF, 388KB]

Inversion transduction grammar see Transduction grammars

Language death and preservation

(2012) Steven Bird & David Chiang: Machine translation for language preservation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.125-134. [PDF, 924KB]

(2012) András Kornai: Language death in the digital age. META-FORUM, Brussels, June 19-21, 2012; 36 slides. [PDF of PPT, 475KB]

Language identification

(2014) Shyam S.Agrawal, Abhimane, Shweta Bansal, & Minakshi Mahakshi: Statistical analysis of multilingual text corpus and development of language models. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2436-2440. [PDF, 815KB]

(2012) Marco Lui & Timothy Baldwin: langid.py: an off-the-shelf language identification tool. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 10 July 2012, System Demonstrations; pp.25-30. [PDF, 91KB]

(2012) Gerald Lynch & Carl Vogel: Towards the automatic detection of the source language of a literary translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.775-784. [PDF, 158KB]

(2011) Luciano Barbosa, Srinivas Bangalore, & Vivek Kumar Sridhar Rangarajan: Crawling back and forth: using back and out links to locate bilingual sites. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.429-437. [PDF, 192KB]

(2010) Daniel Chung Yong Lim, Ian Lane & Alex Waibel: Real-time spoken language identification and recognition for speech-to-speech translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.307-312. [PDF, 477KB]

(2010) Fei Xia, Carrie Lewis, & William D.Lewis: The problems of language identification within hugely multilingual data sets.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2790-2797. [PDF, 284KB]

Language universals

 (2012) Tod Allman, Stephen Beale, & Richard Denton: Linguist’sAssistant: a multi-lingual natural language generator based on linguistic universals, typologies, and primitives. INLG 2012: Proceedings of the 7th International Natural Language Generation Conference, Utica, May 2012; pp.59-66. [PDF, 641KB]

(2011) Reut Tsarfaty: Morphology, syntax, and what’s in between: reifying cross-linguistic variation through cross-linguistic parsing. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 109KB]; presentation: 80 slides [PDF of PPT, 1321KB]

(2010) Tomoharu Iwata, Daichi Mochihashi, & Hiroshi Sawada: Learning common grammar from multilingual corpus. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.184-188. [PDF, 966KB]

Latent semantic analysis

(2013) Tsutomu Hirao, Tomoharu Iwata, & Masaaki Nagata: Latent semantic matching: application to cross-language text categorization without alignment information.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.212-216. [PDF, 846KB]

(2013) Ekaterina Stambolieva: Learning comparable corpora from latent semantic analysis simplified document space. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.129-137. [PDF, 531KB]

(2012) Masato Hagiwara & Satoshi Sekine: Latent semantic transliteration using Dirichlet mixture. [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.30-37. [PDF, 588KB]

(2011) Rafael E.Banchs & Haizhou Li: AM-FM: a semantic framework for translation quality assessment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.153-158. [PDF, 83KB]

(2011) Rafael E.Banchs & Marta R.Costa-jussà: A semantic feature for statistical machine translation. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.126-134. [PDF, 576KB]

(2011) Nick Ruiz & Marcello Federico: Topic adaptation for lecture translation through bilingual latent semantic models. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.294-302. [PDF, 159KB]

(2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

(2010) John C.Platt, Kristina Toutanova, & Wen-tau Yih: Translingual document representations from discriminative projections. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.251-261. [PDF, 319KB]

(2010) Billy Tak-Ming Wong: Semantic evaluation of machine translation.  LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2884-2888. [PDF, 574KB]

Lexical Functional Grammar

 (2012) Petr Homola: A machine translation toolchain for polysynthetic languages. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.65-68. [PDF, 211KB]

(2011) François Lareau, Mark Dras, Benjamin Börschinger, & Robert Dale: Collocations in multilingual natural language generation: lexical functions meet lexical functional grammar. [ALTA 2011] Proceedings of Australasian Language Technology Association Workshop, Canberra, Australia, 1-2 December 2011; pp.95-104. [PDF, 302KB]

(2010) Anton Bryl & Josef van Genabith: f-align: an open-source alignment tool for LFG f-structures. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 229KB]

(2010) Yvette Graham & Josef van Genabith: Deep syntax language models and statistical machine translation. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28  August 2010; pp.118-126. [PDF, 163KB]

(2010) Yifan He, Jinhua Du, Andy Way, & Josef van Genabith: The DCU dependency-based metric in WMT-MetricsMATR 2010. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 349-353. [PDF, 444KB]

(2010) Sina Zarrieß, Aoife Cahill, Jonas Kuhn, & Christian Rohrer: Cross-lingual induction of deep broad-coverage syntax: a case study on German participles. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1426-1434. [PDF, 106KB]

Linear transduction grammar see Transduction grammars

Linguistic features

(2014) Abdullah Alrajeh & Mahesan Niranjan: Bayesian reordering model with feature selection. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.477-485. [PDF, 356KB]

(2014) Marta R.Costa-jussà, Parth Gupta, Rafael E.Banchs & Paolo Rosso: English-to-Hindi system descrption for WMT 2014: deep source-context features for Moses. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.79-83. [PDF, 296KB]

(2014) Piyush Dungarwal, Rajen Chatterjee, Abhijit Mishra, Anoop Kunchukuttan, Ritesh Shah, & Pushpak Bhattacharyya: The IIT Bombay HindiÛEnglish translation system at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.90-96. [PDF, 222KB]

(2014) Spence Green, Daniel Cer, & Christopher D.Manning: An empirical comparison of features and tuning for phrase-based machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.466-476. [PDF, 507KB]

(2013) Eunah Cho, Than-Le Ha, & Alex Waibel: CRF-based disfluency detection using semantic features for German to English spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 473KB]

(2013) William D.Lewis & Chris Quirk: Controlled ascent: imbuing statistical MT with linguistic knowledge. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.51-66. [PDF, 502KB]

(2013) Antonio Toral: Hybrid selection of language model training data using linguistic information and perplexity. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.8-12. [PDF, 307KB]

(2013) Antonio Toral, Sudip Kumar Naskar, Joris Vreeke, Federico Gaspari, & Declan Groves: A web application for the diagnostic evaluation of machine translation over specific linguistic phenomena. [NAACL-HLT 2013] Proceedings of the NAACL HLT 2013 Demonstration Session, 10-12 June 2013, Atlanta, Georgia; pp.20-23. [PDF, 281KB]

(2013) Ines Turki Khemakhem, Salma Jamoussi, & Abdelmajid Ben Hamadou: Integrating morpho-syntactic features in English-Arabic statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.74-81. [PDF, 491KB]

(2012) Elisabet Comelles, Jordi Atserias, Victoria Arranz, & Irene Castellón: VERTa: linguistic features in MT evaluation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3944-3950. [PDF, 521KB]

(2011) Elisabet Comelles, Irene Castellón, Jordi Atserias, Victoria Arranz, & Olivier Hamon: VERTa: exploring a multidimensional linguistically-motivated metric. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.21-28. [PDF, 309KB]

(2011) Sudip Kumar Naskar, Antonio Toral, Federico Gaspari, & Andy Way: A framework for diagnostic evaluation of MT based on linguistic checkpoints. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.529-536. [PDF, 170KB]

(2010) Philipp Koehn, Barry Haddow, Philip Williams, & Hieu Hoang: More linguistic annotation for statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 115-120. [PDF, 101KB]

Linguistics and MT

(2014) E.Comelles, J.Atserias, V.Arranz, I.Castellón, & J.Sesé: VERTa: facing the multilingual experience of a linguistically-based MT evaluation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2701-2707. [PDF, 163KB]

(2014) Elisabet Comelles & Jordi Atserias: VERTa participation in the WMT14 metrics task. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.368-375. [PDF, 255KB]

(2013) William D.Lewis & Chris Quirk: Controlled ascent: imbuing statistical MT with linguistic knowledge. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.51-66. [PDF, 502KB]

(2012) Adam Przepiórkowski: Linguistics for language technologies, language technologies for linguistics. HLT Days 27-28 September 2012, Warsaw; 54 slides [PDF of PPT, 533KB]

(2012) Chris Wendt: Machine translation at Microsoft. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; presentation: 39 slides [PDF, 3543KB]

Loan words

(2012) James Breen, Timothy Baldwin, & Francis Bond: Segmentation and translation of Japanese multi-word loanwords. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; pp. 61-69. [PDF, 249KB]

Logical aspects (see also Entailment)

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Sylwia Ozdowska & Vincent Claveau: Inferring syntactic rules for word alignment through inductive logic programming. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.749-754. [PDF, 370KB]

Meaning-text model

(2011) François Lareau, Mark Dras, Benjamin Börschinger, & Robert Dale: Collocations in multilingual natural language generation: lexical functions meet lexical functional grammar. [ALTA 2011] Proceedings of Australasian Language Technology Association Workshop, Canberra, Australia, 1-2 December 2011; pp.95-104. [PDF, 302KB]

Measure words see Numbers and numerals

Minimum Recursion Semantics

(2012) Dominikus Wetzel & Francis Bond: Enriching parallel corpora for statistical machine translation with semantic negation rephrasing. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.20-29. [PDF, 163KB]

Minority languages

(2013) Igor Leturia, Kepa Sarasola, Xabier Arregi, Arantza Diaz de Ilarraza, Eva Navas, Iñaki Sainz, Arantza del Pozo, David Baranda, & Urtza Iturraspe: The BerbaTek project for Basque: promoting a less-resourced language via language technology for translation, content management and learning. Translation: Computation, Corpora,  Cognition 3 (1), June 2013; pp.119-135. [PDF, 785KB]

(2012) András Kornai: Language death in the digital age. META-FORUM, Brussels, June 19-21, 2012; 36 slides. [PDF of PPT, 475KB]

(2012) Jon Riding: Hunting the snark: the problem posed for MT by non-catenative morphologies. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 10pp. [PDF, 286KB], presentation: 26 slides [PDF, 1277KB]

(2011) Vamshi Ambati, Stephan Vogel, & Jaime Carbonell: Multi-strategy approaches to active learning for statistical machine translation.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.122-129. [PDF, 432KB]

(2011) Te Taka Keegan & Hori Manuirirangi: Minority languages & translation technologies case study: te reo Māori & Google Translator Toolkit. Translating and the Computer 33, 17-18 November 2011, London; 6pp. [PDF, 339KB]

(2010) proceedings of 7th SaLTMiL Workshop on Creation and use of basic lexical resources for less-resourced languages, LREC-2010: proceedings of the seventh international conference on Language Resources and Evaluation, Valletta, Malta, 23 May 2010. [PDF, 916KB]

Modality

 (2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2010) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Nathaniel W.Filardo, Lori Levin, & Christine Piatko: A modality lexicon and its use in automatic tagging. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1402-1407. [PDF, 375KB]

Multiple meaning see Disambiguation

Multiword expressions [see also Nouns and Noun phrases, Prepositional phrases, Verbs and verb phrases]

(2015) Martin Benjamin, Amar Mukunda & Jeff Allen: Kamusi pre-D-source-side disambiguation and a sense aligned multilingual lexicon. Proceedings of the 37th Conference Translating and the Computer, London, November 26-27, 2015; pp.27-32. [PDF, 188KB]

(2014) Sandra Antunes & Amália Mendes: An evaluation of the role of statistical measures and frequency for MWE identification. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4046-4051. [PDF, 138KB]

(2014) A.Barreiro, J.Monti, B.Orliac, S. Preuß, K.Arrieta, W.Ling, F.Batista & I.Trancoso: Linguistic evaluation of support verb constructions by OpenLogos and Google Translate. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.35-40. [PDF, 131KB]

(2014) Guillaume Jacquet, Maud Ehrmann, & Ralf Steinberger: Clustering of multi-word named entity variants: multilingual evaluation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 217KB]

(2014) Adam Kilgarriff, Miloš Jakubíček, Vojtěch Kovář, Pavel Rychlý, Vít Baisa, & Lucia Kocincová: Extrinsic corpus evaluation with a collocation dictionary task. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.545-552. [PDF, 143KB]

(2014) Valia Kordoni & Iliana Simova: Multiword expressions in machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1208-1211. [PDF, 110KB]

(2014) Lis Pereira, Elga Strafella, & Yuji Matsumoto: Collocation or free combination? Applying machine translation techniques to identify collocations in Japanese. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.736-739. [PDF, 325KB]

(2014) Liling Tan & Santanu Pal: Manawi: using multi-word expressions and named entities to improve machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.201-206. [PDF, 191KB]

(2013) Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technologies, eds. Johanna Monti, Ruslan Mitkov, Gloria Corpas Pastor, Violeta Seretan, MT Summit XIV workshop, Nice, France, 3 September, 2013.

(2013) Lars Bungum, Björn Gambäck, André Lynum, & Erwin Marsi: Improving word translation disambiguation by capturing multiword expressions with dictionaries. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, June 13-14, 2013; pp.21-30. [PDF, 155KB]

(2013) Mahmoud Ghoneim & Mona Diab: Multiword expressions in the context of statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1181-1187. [PDF, 120KB]

(2013) Adam Kilgarriff: Terminology finding, parallel corpora and bilingual word sketches in the Sketch Engine. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 6pp. [PDF, 1018KB]; presentation, 23 slides [PDF of PPT, 957KB]

(2013) Santanu Pal, Sudio Kumar Naskar, & Sivaji Bandyopadhyay: MWE alignment in phrase based statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.61-68. [PDF, 539KB]

(2013) Karl Pichotta & John DeNero: Identifying phrasal verbs using many bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.636-646. [PDF, 267KB]

(2013) Rico Sennrich: Promoting flexible translations in statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.207-214. [PDF, 599KB]

(2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbeaum: Automatic construction of a multi-word expressions bilingual lexicon: a statistical machine translation evaluation perspective. COLING 2012: Proceedings of the 3rd Workshop on Cognitive Aspects of the Lexicon (CogALex-III), Mumbai, December 2012; pp.95-107. [PDF, 210KB]

(2012) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Identifying bilingual multi-word expressions for statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.674-679. [PDF, 387KB]

(2012) James Breen, Timothy Baldwin, & Francis Bond: Segmentation and translation of Japanese multi-word loanwords. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; pp. 61-69. [PDF, 249KB]

(2012) Giuliano Lancioni & Marco Boella: Idiomatic MWEs and machine translation, a retrieval and representation model: the AraMWE project. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.9-16.[PDF, 786KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

(2012) Felipe Sánchez-Martínez, Rafael C.Carrasco, Miguel A.Martínez-Prieto, & Joaquín Adiego: Generalized bywords for bitext compression and translation spotting. Journal of Artificial Intelligence Research 43; pp.389-418. [PDF, 418KB]

(2011) Abhijit Bhole, Goutham Tholpadi, & Raghavendra Udupa: Mining multi-word named entity equivalents from comparable corpora. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.65-72. [PDF, 820KB]

(2011) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Improved statistical machine translation using multiword expressions. [LIHMT] International Workshop on Using Linguistic Information for Hybrid Machine Translation, 18th November 2011, Universitat Politècnica de Catalunya, Barcelona; pp.15-20. [PDF, 345KB]

(2011) Petter Haugereid & Francis Bond: Extracting transfer rules for multiword expressions from parallel corpora. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.92-100. [PDF, 228KB]

(2011) Stéphane Huet & Philippe Langlais: Identifying the translations of idiomatic expressions using TransSearch. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.45-56. [PDF, 1123KB]

(2011) Hyoung-Gyu Lee, Joo-Young Lee, Min-Jeong Kim, Hae-Chang Rim, Joong-Hwi Shin, & Young-Sook Hwang: Phrase segmentation model using collocation and translational entropy.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.198-205. [PDF, 203KB]

(2011) François Lareau, Mark Dras, Benjamin Börschinger, & Robert Dale: Collocations in multilingual natural language generation: lexical functions meet lexical functional grammar. [ALTA 2011] Proceedings of Australasian Language Technology Association Workshop, Canberra, Australia, 1-2 December 2011; pp.95-104. [PDF, 302KB]

(2011) Zhanyi Liu, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li: Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044. [PDF, 462KB]

(2011) Luís Marujo, Nuno Grazina, Tiago Luís, Wang Ling, Luísa Coheur, & Isabel Trancoso: BP2EP – adaptation of Brazilian Portuguese texts to European Portuguese. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.129-136. [PDF, 352KB]

(2011) Johanna Monti, Anabela Barreiro, Annibale Elia, Federica Marano, & Antonella Napoli: Taking on new challenges in multi-word unit processing for machine translation. Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.11-19. [PDF, 148KB]

(2011) Hwidong Na & Jong-Hyeok Lee: Multi-word unit dependency forest-based translation rule extraction. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.41-51. [PDF, 466KB]

(2011) Tsuyoshi Okita & Andy Way: MWE-sensitive word alignment in factored translation model. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 27 January, 2011; 2pp. [PDF, 42KB]; presentation: 51 slides [PDF of PPT, 650KB]

(2011) Santanu Pal, Tanmoy Chakraborty, & Sivaji Bandyopadhyay: Handling multiword expressions in phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.215-224. [PDF, 161KB]

(2011) Johan Segura & Violaine Prince: Using alignment to detect multiword expressions in bilingual corpora. .  Tralogy, Paris, 3-4 March 2011;  17pp. [PDF, 430KB]

(2011) Violeta Seretan & Eric Wehrli: FipsCoView: on-line visualisation of collocations extracted from multilingual parallel corpora. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.125-127. [PDF, 120KB]

(2011) Thoudam Doren Singh & Sivaji Bandyopadhyay: Integration of reduplicated multiword expressions and named entities in a phrase based statistical machine translation system. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1304-1312. [PDF, 658KB]

(2011) Yulia Tsvetkov & Shuly Wintner: Identification of multi-word expressions by combining multiple linguistic information sources. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; 2pp. [PDF, 59KB]; presentation: 56 slides [PDF of PPT , 2080KB]

(2010) Anabele Barreiro, Annibale Elia, Johanna Monti, & Mario Monteleone: Mixed up with machine translation: multi-word units disambiguation challenge.  Translating and the Computer 32, 18-19 November 2010, London; 21pp. [PDF, 138KB]

(2010) Marine Carpuat & Mona Diab: Task-based evaluation of multiword expressions: a pilot study in statistical machine translation. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.242-245. [PDF, 92KB]

(2010)  Marta R.Costa-jussà, Vidas Daudaravicius, & Rafael E.Banchs: Integration of statistical collocation segmentations in a phrase-based statistical machine translation system. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 975KB]

(2010) Xiangyu Duan, Min Zhang, & Haizhou Li: Pseudo-word for phrase-based machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.148-156. [PDF, 275KB]

(2010) María Fernández-Parra & Pius ten Hacken: Identifying fixed expressions: a comparison of SDL MultiTerm Extract and Déjà Vu’s lexicon. Translating and the Computer 32, 18-19 November 2010, London; 26pp. [PDF, 455KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussà, Vidas Daudaravicius, Rafael E.Banchs, & José B.Mariño: UPC-BMIC-VDU system description for the IWSLT 2010: testing several collocation segmentations in a phrase-based SMT system.  Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.189-195. [PDF, 439KB]

(2010) Carlos A.Henríquez Q., Marta R.Costa-jussà, Vidas Daudaravicius, Rafael E.Banchs, & José B.Mariño: Using collocation segmentation to augment the phrase table. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 98-102. [PDF, 1398KB]

(2010) Zhanyi Liu, Haifeng Wang, Hua Wu, & Sheng Li: Improving statistical machine translation with monolingual collocation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.825-833. [PDF, 405KB]

(2010) Tsuyoshi Okita, Alfredo Maldonado Guerra, Yvette Graham, & Andy Way: Multi-word expression-sensitive word alignment. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.26-34. [PDF, 239KB]

(2010) Xabier Saralegi & Maddalen Lopez de Lacalle: Dictionary and monlingual corpus-based query translation for Basque-English CLIR. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1353-1358. [PDF, 284KB]

(2010)  Nasredine Semmar, Christophe Servan, Gaël de Chalendar, Benoît Le Ny, & Jean-Jacques Bouzaglou: A hybrid word alignment approach to improve translation lexicons with compound words and idiomatic expressions. Translating and the Computer 32, 18-19 November 2010, London; 10pp. [PDF, 95KB]; presentation, 37 slides [PDF of PPT, 934KB]

(2010) Yulia Tsvetkov & Shuly Wintner: Extraction of multi-word expressions from small parallel corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1256-1264. [PDF, 257KB]

Names and named entities

(2014) Guillaume Jacquet, Maud Ehrmann, & Ralf Steinberger: Clustering of multi-word named entity variants: multilingual evaluation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 217KB]

(2014) Keith Miller, Linda Moreau, & Sherri Condon: Handling entities in MT/CAT/HLT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Tutorials, 88 slides

(2014) Liling Tan & Santanu Pal: Manawi: using multi-word expressions and named entities to improve machine translation. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.201-206. [PDF, 191KB]

(2013) Mahmoud Azab, Houda Bouamor, Behrang Mohit, & Kemal Oflazer: Dudley North visits North London: learning when to transliterate to Arabic. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.439-444. [PDF, 148KB]

(2013) Wanxiang Che, Mengqiu Wang, Christopher D.Manning, & Ting Liu: Named entity recognition with bilingual constraints. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.52-62. [PDF, 289KB]

(2013) Yufeng Chen, Chengqing Zong; & Keh-Yi Su: A joint model to identify and align bilingual named entities. Computational Linguistics 39 (2); pp.229-266. [PDF, 708KB]

(2013) Kareem Darwish: Named entitiy recognition using cross-lingual resources: Arabic as an example.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1558-1567. [PDF, 264KB]

(2013) Taesung Lee & Seung-won Hwang: Bootstrapping entity translation on weakly comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.631-640. [PDF, 624KB]

(2013) Haibo Li, Jing Zheng, Heng Ji, Qi Li, & Wen Wang: Name-aware machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.604-614. [PDF, 443KB]

(2013) Santanu Pal, Sudip Kumar Naskar, & Sivaji Bandyopadhyay: A hybrid word alignment model for phrase-based statistical machine translation. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.94-101. [PDF, 297KB]

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

(2013) Mengqiu Wang, Wanxiang Che, & Christopher D.Manning: Joint word alignment and bilingual named entity recognition using dual decomposition. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1073-1082. [PDF, 424KB]

(2013) Gae-won You, Young-rok Cha, Jinhan Kim, & Seung-won Hwang: Enriching entity translation discovery using selective temporality. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.201-205. [PDF, 651KB]

(2013) Chao Zhang, Shiqi Zhao, & Haifeng Wang: Bootstrapping large-scale named entities using URL-text hybrid patterns. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.293-301. [PDF, 249KB]

(2012) see Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea.

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Mahathi Bhagavatula, Santosh GSK, & Vasudeva Varma: Language-independent named entity identification using Wikipedia. [ACL 2012] Proceedings of the First Workshop on Multilingual Modeling, Jeju, Republic of Korea, 8-14 July 2012; pp.11-17. [PDF, 262KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Hindi and Marathi to English NE transliteration tool using phonology and stress analysis. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.111-118. [PDF, 301KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Optimizing transliteration for Hindi/Marathi to English using only two weights. COLING: Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology, December 2012; pp. 31-47. [PDF, 293KB]

(2012) Ryuichiro Higashinaka, Kugatsu Sadamitsu, Kuniko Saito, Toshiro Makino, & Yoshihiro Matsuo: Creating an extended named entity dictionary from Wikipedia. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1163-1178. [PDF, 176KB]

(2012) Brant N.Kay & Brian C.Rineer: Approaches to Arabic name transliteration and matching in the DataFlux quality knowledge base. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp. 32-37. [PDF, 607KB]

(2012) Sungchul Kim, Kristina Toutanova, & Hwanjo Yu: Multilingual named entitity recognition using parallel data and metadata from Wikipedia. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.694-702. [PDF, 417KB]

(2012) Dawn Lawrie, James Mayfield, Paul McNamee, & Douglas W.Oard: Creating and curating a cross-language person-entity linking collection.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3106-3110. [PDF, 312KB]

(2012) Xuansong Li, Stephanie M.Strassel, Heng Ji, Kira Griffitt, & Joe Ellis: Linguistic resources for entity linking evaluation: from monolingual to cross-lingual.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3098-3105. [PDF, 537KB]

(2012) Keith J.Miller, Elizabeth Schroeder Richerson, Sarah McLeod, James Finley, & Aaron Schein: International multicultural name matching competition: design, execution, results, and lessons learned.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3111-3117. [PDF, 638KB]

(2012) Vassilina Nikoulina, Agnes Sandor, & Marc Dymetman: Hybrid adaptation of named entity recognition for statistical machine translation. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT), Mumbai, December 2012; pp.1-15. [PDF, 198KB]

(2012) Thi Thanh Thao Phan, & Izabella Thomas: E-V MT of proper names: error analysis and some proposed solutions. TSD 2012: 15th International Conference on Text, Speech and Dialogue, Brno, Czech Republic, September 3-7, 2012; abstract #530, 1p. [HTML]

(2012) Mārcis Pinnis: Latvian and Lithuanian named entity recognition with TildeNER. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1258-1265. [PDF, 955KB]

(2012) Feiliang Ren: A practical Chinese-English ON translation method based on ON’s distribution characteristics on the web. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp. 239-246. [PDF, 102KB]

(2012) Carolin Shihadeh & Günter Neumann: ARNE – a tool for named entity recognition from Arabic text. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.24-31. [PDF, 1190KB]

(2012) Oscar Täckström: Nudging the envelope of direct transfer methods for multilingual named entity recognition. NAACL-HLT Workshop on the Induction of Linguistic Structure, Montréal, Canada, June 3-8, 2012; pp.55-63. [PDF, 188KB]

(2012) Katherine M.Young, Jeremy Gwinnup, & Joshua Reinhart: Reversing the Palladius mapping of Chinese names in Russian text. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 202KB]; abstract, 1p. [PDF, 74KB]; presentation, 37 slides [PDF of PPT, 1386KB]

(2011) see  Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011

(2011) Yufeng Chen & Chengqing Zong: A semantic-specific model for Chinese named entity translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.138-146. [PDF, 282KB]

(2011) Thoudam Doren Singh & Sivaji Bandyopadhyay: Integration of reduplicated multiword expressions and named entities in a phrase based statistical machine translation system. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1304-1312. [PDF, 658KB]

(2011) Héla Fehri, Kais Haddar, & Abdelmajid Ben Hamadou: A new representation model for the automatic recognition and translation of Arabic named entities with NooJ.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.545-550. [PDF, 214KB]

(2011) Héla Fehri, Kais Haddar, & Abdelmajid Ben Hamadou: Recognition and translation of Arabic named entities with NooJ using a new representation model. Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, Blois (France), July 12-15; pp.134-142. [PDF, 298KB]

(2011) Ruiji Fu, Bing Qin, & Ting Liu: Generating Chinese named entity data from a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.264-272. [PDF, 450KB]

(2011) Deirdre Hogan, Jennifer Foster, & Josef van Genabith: Decreasing lexical data sparsity in statistical syntactic parsing – experiments with named entities. Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, USA, 23 June 2011; pp.14-19. [PDF, 97KB]

(2011) Tomoya Iwakura: A named entity recognition method using rules acquired from unlabeled data. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.170-177. [PDF, 306KB]

(2011) Johannes Knopp: Extending a multilingual lexical resource by bootstrapping named entity classification using Wikipedia’s category system. [IJCNLP 2011] Proceedings of the 5th Workshop on Cross Lingual Information Access, Chiang Mai, Thailand, November 13, 2011; pp.35-43. [PDF, 326KB]

(2011) Kiran Kumar N, Santosh GSK, & Vasudeva Varma: A language-independent approach to identify the named entities in under-resourced languages, and clustering multilingual documents. CLEF 2011: Conference on Multilingual and Multimodal Iinformation Access Evaluation, 19-22 September 2011, Amsterdam; 29slides [PDF of PPT, 185KB]

(2011) Emeline Lecuit, Denis Maurel, & Duško Vitas: A tagged and aligned corpus for the study of proper names in translation. AEPC 2011: proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora, associated with the 8th International Conference on Recent Advances in Natural Language Processing (RANLP 2011), 15th September 2011, Hissar, Bulgaria; pp.11-18. [PDF, 244KB]

(2011) Wang Ling, Pável Calado, Bruno Martins, Isabel Trancoso, Alan Black, & Luísa Coheur: Named entity translation using anchor texts. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.206-213. [PDF, 442KB]

(2011) Paul McNamee, James Mayfield, Dawn Lawrie, Douglas W.Oard, & David Doerman: Cross-language entity linking. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.255-263. [PDF, 409KB]

(2011) Santanu Pal, Tanmoy Chakraborty, & Sivaji Bandyopadhyay: Handling multiword expressions in phrase-based statistical machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.215-224. [PDF, 161KB]

(2011) Antonio Toral & Andy Way: Automatic acquisition of named entities for rule-based machine translation.  Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, Barcelona, Spain, January 20-21, 2011, ed. F.Sánchez-Martínez and J.A.Pérez-Ortiz; pp.37-43. [PDF, 151KB]

(2011) Marco Turchi: Multilingual text mining and machine translation activities carried out at the EC’s Joint Research Center. Invited talk at: Sixth Machine Translation Marathon, 5-10 September 2011, Trento, Italy; 52 slides [PDF of PPT, 4286KB]

(2010) Yassine Benajiba, Imed Zitouni, Mona Diab, & Paolo Rosso: Arabic named entity recognition: using features extracted from noisy data. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.281-285. [PDF, 122KB]

(2010) Gosse Bouma: Cross-lingual ontology alignment using EuroWordNet and Wikipedia. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1023-1028. [PDF, 336KB]

(2010) David Burkett, Slav Petrov, John Blitzer, & Dan Klein: Learning better monolingual models with unannotated bilingual text. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.46-54. [PDF, 431KB]

(2010) Yufeng Chen, Chenqing Zong, & Keh-Yih Su: On jointly recognizing and aligning bilingual named entities. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.631-639. [PDF, 151KB]

(2010) William McIntyre: Translation of Chinese entities in Russian text. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Brian Roberson: Multi-language desktop suite. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Ling-Xiang Tang, Shlomo Geva, Andrew Trotman, & Yue Xu: A voting mechanism for named entity translation in English-Chinese question answering. [Coling 2010] Proceedings of the 4th Workshop on Cross Lingual Information Access, Beijing, China, 28 August 2010; pp.43-51. [PDF, 554KB]

(2010) Raghavendra Udupa & Mitesh Khapra: Improving the multilingual user experience of Wikipedia using cross-language name search. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.492-500. [PDF, 246KB]

(2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

Negation

(2012) Kathryn Baker, Michael Bloodgood, Bonnie J.Dorr, Chris Callison-Burch, Nathaniel W.Filardo, Christine Piatko, Lori Levin, & Scott Miller: Use of modality and negation in semantically-informed syntactic MT. Computational Linguistics 38 (2), pp. 411-438. [PDF, 3140KB]

(2012) Danielle L.Mowery, Sumithra Velupillai, & Wendy W.Chapman: Medical diagnosis lost in translation – analysis of uncertainty and negation expressions in English and Swedish clinical texts. Proceedings of the 2012 Workshop on Biomedical Natural Language Processing (BioNLP 2012), Montréal, Canada, June 8, 2012; pp.56-64. [PDF, 288KB]

(2012) Dominikus Wetzel & Francis Bond: Enriching parallel corpora for statistical machine translation with semantic negation rephrasing. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.20-29. [PDF, 163KB]

Neologisms see Unknown and rare words

Non-contiguous relations see Discontiguous relations

Normalization

(2013) Wei Xu, Alan Ritter, & Ralph Grishman: Gathering and generating paraphrases from Twitter with application to normalization. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.121-128. [PDF, 274KB]

Nouns and noun phrases

(2014) Marcin Junczys-Dowmunt & Bruno Pouliquen: SMT of German patents at WIPO: decompounding and verb structure pre-reordering.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.217-220. [PDF, 369KB]

(2014) Carla Parra Escartin: Chasing the perfect splitter: a comparison of different compound splitting tools. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2990-2994. [PDF, 3340-3347B]

(2014) Philip Williams, Rico Sennrich, Maria Nadejde, Matthias Huck, Eva Hasler, & Philipp Koehn: Edinburgh’s syntax-based systems at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.207-214. [PDF, 222KB]

(2013) İlknur Durgar El-Kahlout & Coşkun Mermer: TÜBİTAK-BİLGEM German-English machine translation systems for W13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.109-113. [PDF, 100KB]

(2013) Masato Hagiwara & Satoshi Sekine: Accurate word segmentation using transliteration and language model projection. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics,  Short papers, Sofia, Bulgaria, August 4-9 2013; pp.183-189. [PDF, 548KB]

(2013) Sara Stymne, Nicola Cancedda, & Lars Ahrenberg: Generation of compound words in statistical machine translation into compounding languages. Computational Linguistics 39 (4); pp.1067-1108. [PDF, 615KB]

(2013) Marion Weller, Max Kisselew, Svetlana Smekalova, Alexander Fraser, Helmut Schmid, Nadir Dur­rani, Hassan Sajjad & Richárd Farkas: Munich-Edinburgh-Stuttgart submissions at WMT13: morphological and syntactic processing for SMT.  WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.232-239. [PDF, 160KB]

(2012) Hiroyuki Kaji, Takashi Tsunakawa, & Yoshihiro Komatsubara: Improving compositional translation with comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.134-142. [PDF, 449KB]

(2011) Shane Bergsma, David Yarowsky, & Kenneth Church: Using large monolingual and bilingual corpora to improve coordination disambiguation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1346-1355. [PDF, 157KB]

(2011) Nathan Green: Effects of noun phrase bracketing in dependency parsing and machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Student Session, Portland, Oregon, June 19-24, 2011; pp.69-74. [PDF, 288KB]

(2011) Els Lefever & Véronique Hoste: An evaluation and possible improvement path for current SMT behavior on ambiguous nouns. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.52-60. [PDF, 164KB]

(2011) Klaus Macherey, Andrew M.Dai, David Talbot, Ashok C.Popat, & Franz Och: Language-independent compound splitting with morphological operations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1395-1404. [PDF, 562KB]

(2011) Sara Stymne: Definite noun phrases in statistical machine translation into Scandinavian languages. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.289-296. [PDF, 336KB]; presentation, 50 slides [PDF]

(2011) Sara Stymne & Nicola Cancedda: Productive generation of compound words in statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.250-260. [PDF, 167KB]

(2011) Philip Williams & Philipp Koehn: Agreement constraints for statistical machine translation into German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.217-226. [PDF, 146KB]

(2010) Mohamed Altantawy, Nizar Habash, Owen Rambow, & Ibrahim Saleh: Morphological analysis and generation of Arabic nouns: a morphemic functional approach. LREC 2010: proceedings of the seventh international conference on Language Resources and Evaluation, organised by ELRA, 17-23 May 2010, Valletta, Malta; pp.851-858. [PDF,               899KB]

(2010) Hiroshi Echizen-ya & Kenji Araki: Automatic evaluation method for machine translation using noun-phrase chunking. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.108-117. [PDF, 209KB]

 (2010) İlknur Durgar El-Kahlout & François Yvon: The pay-offs of preprocessing for German-English statistical machine translation. [PDF, 301KB]

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

(2010) Fabienne Fritzinger & Alexander Fraser: How to avoid burning ducks: combining linguistic analysis and corpus statistics for German compound processing.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 224-234. [PDF, 178KB]

(2010) Gábor Recski, András Rung, Attila Zséder, & András Kornai: NP alignment in bilingual corpora. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3379-3382. [PDF, 305KB]

 (2010) Maria Rosenberg: Lexical representation of agentive nominal compounds in French and Swedish [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 59KB]

Number (singular and plural)

(2011) Elizabeth Baran & Nianwen Xue: Singular or plural? Exploiting parallel corpora for Chinese number prediction. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.207-214. [PDF, 187KB]

(2011) David Mareček, Rudolf Rosa, Petra Galuščáková, & Ondřej Bojar: Two-step translation with grammatical post-processing. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.426-432. [PDF, 119KB]

Numbers and numerals

(2012) Mei Tu, Yu Zhou & Chengqing Zong: A universal approach to translating numerical and time expressions. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.209-216. [PDF, 756KB]

Out of vocabulary see Unknown and rare words

Paraphrases

(2014) Marianna Apidianaki, Emilia Verzeni, & Diana McCarthy: Semantic clustering of pivot paraphrases. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4270-4275. [PDF, 243KB]

(2014) Petra Barančiková, Rudolf Rosa, & Aleš Tamchyna: Improving evaluation of English-Czech MT through paraphrasing.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2696-2700. [PDF, 127KB]

(2014) Petra Barančíková: Parmesan: Meteor without paraphrases with paraphrased references. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.355-361. [PDF, 305KB]

(2014) Mercedes García-Martínez, Karan Singla, Aniruddha Tammewar, Bartolomé Mesa-Lao, Ankita Thakur, Anusuya M.A., Srinivas Bangalore, & Michael Carl: SEECAT: ASR & Eye-tracking enabled computer-assisted translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; pp.81-88. [PDF, 624KB]

(2014) Michaela Regneri, Rui Wang, & Manfred Pinkal: Aligning predicate-argument structures for paraphrase fragment extraction. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.4300-4307. [PDF, 216KB]

(2013) Wilker Aziz & Lucia Specia: Multilingual WSD-like constraints for paraphrase extraction. Proceedings of the Seventeenth Conference on Computational Natural Language Learning, Sofia, Bulgaria, 8-9 August 2013; pp.202-211. [PDF, 248KB]

(2013) Juri Ganitkevitch, Benjamin Van Durme, & Chris Callison-Burch: PPDB: the paraphrase database. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.758-764. [PDF, 217KB]

(2013) Majid Razmara, Maryam Siahbani, Gholamreza Haffari, & Anoop Sarkar: Graph propagation for paraphrasing out-of-vocabulary words in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1105-1115. [PDF, 325KB]

(2013) Wei Xu, Alan Ritter, & Ralph Grishman: Gathering and generating paraphrases from Twitter with application to normalization. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.121-128. [PDF, 274KB]

(2013) Yulan Yan, Chikara Hashimoto, Kentaro Torisawa, Takao Kawai, Jun’ichi Kazama, & Stijn de Saeger: Minimally supervised method for multilingual paraphrase extraction from definition sentences on the web. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.63-73. [PDF, 381KB]

(2012) Marianna Apidianaki: Measuring the adequacy of cross-lingual paraphrases in a machine translation setting. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.63-72. [PDF, 137KB]

(2012) Emma Barker & Rob Gaizauskas: Assessing the comparability of news texts. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3996-4003. [PDF, 348KB]

(2012) Houda Bouamor, Aurélien Max, Gabriel Illouz, & Anne Vilnat: A contrastive review of paraphrase acquisition techniques.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2653-2658. [PDF, 352KB]

(2012) Houda Bouamor, Aurélien Max, & Anne Vilnat: Validation of sub-sentential paraphrases acquired from parallel monolingual corpora. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 716-725. [PDF, 173KB]

(2012) Chris Brockett: Diverse words, shared meanings: statistical machine translation for paraphrase, grounding, and intent: invited talk [abstract]. ALTA 2012: Proceedings of Australasian Language Technology Association Workshop, 4-6 December 2012, Otago University, Dunedin, New Zealand; 1p. [PDF, 69KB]

(2012) Christian Buck, Daniel Zeman, & Eva Hasler: Multiple reference translations for European languages. Machine Translation Marathon 2012 September 3-8, Edinburgh, UK; 12 slides [PDF of PPT, 83KB]

(2012) Juri Ganitkevitch, Yuan Cao, Jonathan Weese, Matt Post, & Chris Callison-Burch: Joshua 4.0: packing, PRO, and paraphrases. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.283-291. [PDF, 195KB]

(2012) Wei He, Hua Wu, Haifeng Wang, & Ting Liu: Improve SMT quality with automatically extracted paraphrase rules. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.979-987. [PDF, 330KB]

(2012) Georgi Iliev & Angel Genov: Expanding parallel resources for medium-density languages for free.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3937-3943. [PDF, 321KB]

(2012) Dennis N.Mehay & Michael White: Shallow and deep paraphrasing for improved machine translation parameter optimization. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.27-30. [PDF, 1267KB]

(2012) Tsuyoshi Okita: Neural probabilistic language model for system combination. COLING 2012: Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT (ML4HMT, Mumbai, December 2012; pp.65-75. [PDF, 150KB]

(2012) Hong Sun & Ming Zhou: Joint learning of a dual SMT system for paraphrase generation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.38-42. [PDF, 121KB]

(2012) Pidong Wang, Preslav Nakov, & Hwee Tou Ng: Source language adaptation for resource-poor machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.286-296. [PDF, 171KB]

(2010) Sander Wubben, Antal van den Bosch, & Emiel Krahmer: Paraphrase generation as monolingual translation: data and evaluation. INLG 2010: Proceedings of the 6th International Natural Language Generation conference, July 7-9, 2010, Trim, Co.Meath, Ireland; pp.203-208. [PDF, 139KB]

(2012) Sander Wubben, Antal van den Bosch, & Emiel Krahmer: Sentence simplification by monolingual machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.1015-1024. [PDF, 139KB]

(2011) Kfir Bar & Nachum Dershowitz: Using verb paraphrases for Arabic-to-English example-based translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 94KB];  presentation: 27 slides [PDF of PPT, 1356KB]

(2011) Houda Bouamor, Aurélien Max, & Anne Vilnat: Monolingual alignment by edit rate computation on sentential paraphrase pairs. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.395-400. [PDF, 138KB]

(2011) Boxing Chen, Roland Kuhn, & George Foster: Semantic smoothing and fabrication of phrase pairs for SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.144-150. [PDF, 335KB]

(2011) Ido Dagan & Shachar Mirkin: Textual entailment inference in machine translation [abstract]. Machine Translation and Morphologically-rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011; presentation: 39 slides [PDF of PPT, 663KB]

(2011) Juri Ganitkevitch, Chris Callison-Burch, Courtney Napoles, & Benjamin Van Durme: Learning sentential paraphrases from bilingual parallel corpora for text-to-text generation. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1168-1179. [PDF, 311KB]

(2011) Wei He, Shiqi Zhao, Haifeng Wang, & Ting Liu: Enriching SMT training data via paraphrasing. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.803-810. [PDF, 157KB]

(2011) Jie Jiang, Jinhua Du, & Andy Way: Incorporating source-language paraphrases into phrase-based SMT with confusion networks. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.31-40. [PDF, 1325KB]

(2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

(2011) Yuval Marton, Ahmed El Kholy, & Nizar Habash: Filtering antonymous, trend-contrasting, and polarity-dissimilar distributional paraphrases for improving statistical machine translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.237-249. [PDF, 177KB]

(2011) Preslav Nakov & Hwee Tou Ng: Translating from morphologically complex languages: a paraphrase-based approach. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1298-1307. [PDF, 156KB]

(2011) Masao Utiyama, Graham Neubig, Takashi Onishi, & Eiichiro Sumita: Searching translation memories for paraphrases.  MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.325-331. [PDF, 86KB]

 (2011) Rui Wang & Chris Callison-Burch: Paraphrase fragment extraction from monolingual comparable corpora. ACL 2011: Proceedings of the Fourth Workshop on Building and Using Comparable Corpora, Portland, Oregon, USA, 24 June 2011; pp.52-60. [PDF, 303KB]

(2010) Hala Almaghout, Jie Jiang, & Andy Way: The DCU machine translation systems for IWSLT 2010. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.37-44. [PDF, 621KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Michael Denkowski & Alon Lavie: METEOR-NEXT and the METEOR paraphrase tables: improved evaluation support for five target languages. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 339-342. [PDF, 104KB]

(2010) Michael Denkowski, Hassan Al-Haj, & Alon Lavie: Turker-assisted paraphrasing for English-Arabic machine translation. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, Los Angeles, CA, June 2010; pp.66-70. [PDF, 237KB]

 (2010) Jinhua Du, Jie Jiang, & Andy Way: Facilitating translation using source language paraphrase lattices.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.420-429. [PDF, 766KB]

 (2010) Yakov Kronrod, Philip Resnik, Olivia Buzek, Chang Hu, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. Contribution to workshop of ‘Collaborative translation’ at AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31, 2010; 4pp. [PDF, 79KB]

(2010) Roland Kuhn, Boxing Chen, George Foster, & Evan Stratford: Phrase clustering for smoothing TM probabilities – or, how to extract paraphrases from phrase tables. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.608-616. [PDF, 328KB]

(2010) Yuval Marton: Improved statistical machine translation with hybrid phrasal paraphrases derived from monolingual text and shallow lexical resource. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 293KB]

 (2010) Aurélien Max: Example-based paraphrasing for improved phrase-based statistical machine translation.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.656-666. [PDF, 234KB]

(2010) Takashi Onishi, Masao Utiyama & Eiichiro Sumita: Paraphrase lattice for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.1-5. [PDF, 419KB]

(2010) Soma Paul, Prashant Mathur, & Sushant Kishore: Syntactic construct: an aid for translating English nominal compound into Hindi. Proceedings of the NAACL HLT 2010 Workshop on Extracting and Using Constructions in Computational Linguistics, Los Angeles, CA, June 2010; pp.32-38. [PDF, 107KB]

 (2010) Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronrod, Alex Quinn, & Benjamin B.Bederson: Improving translation via targeted paraphrasing. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.127-137. [PDF, 548KB]

(2010) Shiqi Zhao, Haifeng Wang, Xiang Lan, & Ting Liu: Leveraging multiple MT engines for paraphrase generation. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.1326-1334. [PDF, 597KB]

Parentheses

(2010) Filip Graliński: Mining parenthetical translations for Polish-English lexica [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 58KB]

Parsing see Syntactic analysis, Tagging

Particles

(2011) Ma Jianjun, Huang Degen, Liu Haixia, & Sheng Wenfeng: POS tagging of English particles for machine translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.57-63. [PDF, 222KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2010) Jinhua Du & Andy Way: A discriminative latent variable-based “DE” classifier for Chinese-English SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.286-294. [PDF, 698KB]

Parts of speech see Tagging

Passives

(2012) Benjamin Gottesman: Statistical machine translation for depassivizing German part-of-speech sequences. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.55-60. [PDF, 441KB]

Personal names see Names [named entities]

Phonology and phonetics

(2015) Oliver Adams, Graham Neubig, Trevor Cohn & Steven Bird: Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.248-255. [PDF, 2.8MB]

(2012) Zeeshan Ahmed, Jie Jiang, Julie Carson-Berndsen, Peter Cahill, & Andy Way: Hierarchical phrase-based MT for phonetic representation-based speech translation. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 246KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Hindi and Marathi to English NE transliteration tool using phonology and stress analysis. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.111-118. [PDF, 301KB]

(2012) Jagadeesh Jagarlamudi & Hal Daumé III: Regularized interlingual projections: evaluation on multilingual transliteration. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.12-23. [PDF, 700KB]

(2010) Kareem Darwish: Transliteration mining with phonetic conflation and iterative training. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.39-47. [PDF, 465KB]

Possessives

(2014) Irina Burukina: Translating implicit elements in RBMT. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.182-193. [PDF, 209KB]

Postpositions

 (2012) Jayan V, Sunil R, & Bhadran V K: Disambiguation of pre/post positions in English – Malayalam text translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 93-101. [PDF, 249KB]

(2010) Jin-Ji Li, Jungi Kim & Jong-Hyeok Lee: Transferring syntactic relations of subject-verb-object pattern in Chinese-to-Korean SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 774KB]

(2010) Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, & Suguru Matsuyoshi: Utilizing semantic equivalence classes of Japanese functional expressions in translation rule acquisition from parallel patent sentences. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1777-1785. [PDF, 942KB]

Pragmatics in MT

(2013) Marco Turchi, Matteo Negri & Marcello Federico: Coping with the subjectivity of human judgements in MT quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.240-251. [PDF, 472KB]

(2012) David Farwell: Pragmatics as the ultimate challenge to high quality MT [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.10-11. [PDF]

(2012) Takatomo Kano, Sakriani Sakti, Shinnosuke Takamichi, Graham Neubig, Tomoki Toda, & Satoshi Nakamura: A method for translation of paralinguistic information. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.158-163. [PDF, 759KB]

(2010) Carmen Banea, Rada Mihalcea, & Janyce Wiebe: Multilingual subjectivity: are more languages better? Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.28-36. [PDF, 128KB]

(2010) Jungi Kim, Jin-Ji Li, & Jong-Hyeok Lee: Evaluating multilanguage-comparability of subjectivity analysis systems. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.595-603. [PDF, 459KB]

(2010) Fangzhong Su & Katja Markert: Word sense subjectivity for cross-lingual lexical substitution. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.357-360. [PDF, 109KB]

Predicates and arguments see Dependency relations

Prepositions

(2014) Marion Weller, Sabine Schulte im Walde, & Alexander Fraser: Using noun class information to model selectional preferences for translating prepositions in SMT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.275-287. [PDF, 581KB]

(2012) Jayan V, Sunil R, & Bhadran V K: Disambiguation of pre/post positions in English – Malayalam text translation. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp. 93-101. [PDF, 249KB]

(2012) Reshef Shilon, Hanna Fadida, & Shuly Wintner: Incorporating linguistic knowledge in statistical machine translation: translating prepositions. EACL Workshop on Innovative Hybrid Approaches to the Processing of Textual Data,  23 April 2012, Avignon, France; pp.106-114. [PDF, 137KB]

(2010) Md.Zahurul Islam, Jörg Tiedemann & Andreas Eisele: English to Bangla phrase-based machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 601KB]

Pronouns

(2015) Dong Zhan & Hiromi Nakaiwa: Automatic detection of antecedents of Japanese zero pronouns using a Japanese-English bilingual corpus. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.66-79. [PDF, 863KB]

(2014) Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann, & Bonnie Webber: ParCor 1.0: a parallel pronoun-coreference corpus to support statistical MT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3191-3198. [PDF, 126KB]

(2014) Christian Hardmeier: Discourse in statistical machine translation [abstract]. PhD thesis, Uppsala University, 2014; 2pp. [PDF, 15KB]

(2013) Christian Hardmeier, Jörg Tiedemann & Joakim Nivre: Latent anaphora resolution for cross-lingual pronoun prediction. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.380-391. [PDF, 274KB]

(2013) Michal Novák, Anna Nedoluzhko, & Zdeněk Žabokrtský: Translation of “it” in a deep syntax framework. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.51-59. [PDF, 384KB]

(2013) Michal Novák, Zdeněk Žabokrtský & Anna Nedoluzhko: Two case studies on translating pronouns in a deep syntax framework. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1037-1041. [PDF, 348KB]

(2013) Bing Xiang, Xiaoqiang Luo, & Bowen Zhou: Enlisting the ghost: modeling empty categories for machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.822-831. [PDF, 294KB]

(2012) Liane Guillou: Improving pronoun translation for statistical machine translation. [EACL 2012] Proceedings of the EACL Student Research Workshop, Avignon, France, 26 April 2012; pp. 1-10. [PDF, 373KB]

(2012) Lorenza Russo, Sharid Loáiciga, & Asheesh Gulati: Italian and Spanish null subjects: a case study evaluation in an MT perspective.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1779-1784. [PDF, 372KB]

(2012) Lorenza Russo, Sharid Loáiciga, & Asheesh Gulati: Improving machine translation of null subjects in Italian and Spanish.  [EACL 2012] Proceedings of the EACL Student Research Workshop, Avignon, France, 26 April 2012; pp. 81-88. [PDF, 116KB]

(2012) Hirotoshi Taira, Katsuhito Sudoh, & Masaaki Nagata: Zero pronoun resolution can improve the quality of J-E translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.111-118. [PDF, 249KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2010) Christian Hardmeier & Marcello Federico: Modelling pronominal anaphora in statistical machine translation. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.283-289. [PDF, 353KB]

(2010) Ronan Le Nagard & Philipp Koehn: Aiding pronouns translation with co-reference resolution. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 252-261. [PDF, 146KB]

Pronunciation

 (2013) Michael Heck, Sebastian Stüker, Sakriani  Sakti, Alex Waibel, & Satoshi Nakamura: Incremental unsupervised training for university lecture recognition. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 5pp. [PDF, 162KB]

(2011) Jun Hatori & Hisami Suzuki: Japanese pronunciation prediction as phrasal statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.120-128. [PDF, 580KB]

Punctuation

(2015) Eunah Cho, Jan Niehues, Kevin Kilgour, & Alex Waibel: Punctuation insertion for real-time spoken language translation. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.173-179. [PDF, 2.5MB]

(2013) Alexandra Birch, Nadir Durrani, & Philipp Koehn: English SLT and MT system description for the IWSLT 2013 evaluation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 9pp. [PDF, 225KB]

(2012) Eunah Cho, Jan Niehues, & Alex Waibel: Segmentation and punctuation prediction in speech language translation using a monolingual translation system. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.252-259. [PDF, 603KB]

(2011) Stephan Peitz, Markus Freitag, Arne Mauser, & Hermann Ney: Modeling punctuation prediction as machine translation. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.238-245. [PDF, 214KB]

(2010) Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez, & Francisco Casacuberta: UPV-PRHLT English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 172-176. [PDF, 90KB]

Questions see Interrogatives and questions

Rare words see Unknown and rare words

Reduplicatives

(2011) Thoudam Doren Singh & Sivaji Bandyopadhyay: Integration of reduplicated multiword expressions and named entities in a phrase based statistical machine translation system. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1304-1312. [PDF, 658KB]

Reference

(2014) Liane Guillou, Christian Hardmeier, Aaron Smith, Jörg Tiedemann, & Bonnie Webber: ParCor 1.0: a parallel pronoun-coreference corpus to support statistical MT. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.3191-3198. [PDF, 126KB]

(2013) Guisheng Ben, Deyi Xiong, Zhiyang Teng, Yajuan Lü, & Qun Liu: Bilingual lexical cohesion trigger model for document-level machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.382-386. [PDF, 155KB]

(2013) Beata Beigman Klebanov & Michael Flor: Associative texture is lost in translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.27-32. [PDF, 309KB]

(2013) Deyi Xiong, Yang Ding, Min Zhang, & Chew Lim Tan: Lexical chain based cohesion models for document-level statistical machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1563-1573. [PDF, 254KB]

(2012) Minwei Feng, Weiwei Sun, & Hermann Ney: Semantic cohesion model for phrase-based SMT. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.867-878. [PDF, 227KB]

(2012) Baoli Li: Learning to model multilingual unrestricted coreference in OntoNotes. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.129-135. [PDF, 130KB]

(2012) Ziheng Lin, Chang Liu, Hwee Tou Ng, & Min-Yen Kan: Combining coherence models and machine translation evaluation metrics for summarization evaluation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.1006-1014. [PDF, 183KB]

(2012) Kateřina Veselovská, Nguy Giang Linh, & Michal Novák: Using Czech-English parallel corpora in automatic identification of it.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.112-120. [PDF, 306KB]

(2012) Rob Voigt & Dan Jurafsky: Towards a literary machine translation: the role of referential cohesion. NAACL-HLT Workshop on Computational Linguistics for Literature, Montréal, Canada, June 8, 2012; pp.18-25. [PDF, 222KB]

(2012) Billy T.M.Wong & Chunyu Kit: Extending machine translation evaluation metrics with lexical cohesion to document level. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.1060-1068. [PDF, 125KB]

(2012) Hao Xiong & Qun Liu: ICT: system description for CoNLL-2012. Proceedings of the Joint Conference on EMNLP and CoNLL: Shared Task, July 13, 2012, Jeju Island, Korea; pp.71-75. [PDF, 557KB]

(2011) Andrei Popescu-Belis, Bruno Cartoni, Andrea Gesmundo, James Henderson, Cristina Grisot, Paola Merlo, Thomas Meyer, Jacques Moeschler & Sandrine Zufferey: Improving MT coherence through text-level processing of input texts: the COMTIS projectTralogy, Paris, 3-4 March 2011;  17pp. [PDF, 169KB]

Registers see Dialects and registers

Rhetorical Structure Theory see Discourse representation

Sentiment

(2013) Alexandra Balahur & Marco Turchi: Improving sentiment analysis in Twitter using multilingual machine translated data. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.49-55. [PDF, 99KB]

(2013) Motaz Saad, David Langlois, & Kamel Smaili: Comparing multilingual comparable articles based on opnions. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.105-111. [PDF, 512KB]

(2013) Svitlana Volkova, Theresa Wilson, & David Yarowsky: Exploring sentiment in social media: bootstrapping subjectivity clues from multilingual Twitter streams.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.505-510. [PDF, 317KB]

(2012) Alexandra Balahur & Marco Turchi: Multilingual sentiment analysis using machine translation. [ACL 2012] Proceedings of the 3rd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, Jeju, Republic of Korea, 12 July 2012; pp.52-60. [PDF, 133KB]

(2012) Guo-Hau Lai, Ying-Mei Guo, & Richard Tzong-Han Tsai: Unsupervised Japanese-Chinese opinion word translation using dependency distance and feature-opinion association weight. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1503-1517. [PDF, 254KB]

(2012) Xinfan Meng, Furu Wei, Ge Xu, Longkai Zhang, Xiaohua Liu, Ming Zhou, & Houfeng Wang: Lost in translations? Building sentiment lexicons using context based machine translation. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.829-838. [PDF, 232KB]

(2012) Smruthi Mukund & Rohini K.Srihari: Analyzing Urdu social media for sentiments using transfer learning with controlled translations. Proceedings of the 2012 Workshop on Language in Social Media (LSM 2012), Montréal, Canada, June 7, 2012; pp.1-8. [PDF, 393KB]

(2012) Amira Shoukry & Ahmed Rafea: Preprocessing Egyptian dialect tweets for sentiment mining. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.47-56. [PDF, 1193KB]

(2011) Kevin Duh, Akinori Fujino, & Masaaki Nagata: Is machine translation ripe for cross-lingual sentiment classification?  ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.429-433. [PDF, 95KB]

 (2011) Sergey Kulikov: What is web-based machine translation up to? Tralogy, Paris, 3-4 March 2011; 11pp. [PDF, 131KB]

 (2011) Mohammed Rushdi-Saleh, M.Teresa Martín-Valdivia, L.Alfonso Ureña-López, & José M.Perea-Ortega: Bilingual experiments with an Arabic-English corpus for opinion mining.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.740-745. [PDF, 390KB]

(2010) Carmen Banea, Rada Mihalcea, & Janyce Wiebe: Multilingual subjectivity: are more languages better? Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.28-36. [PDF, 128KB]

(2010) Christian Scheible: Sentiment translation through lexicon induction. Proceedings of the ACL 2010 Student Research Workshop, Uppsala, Sweden, 13 July 2010; pp.25-30. [PDF, 119KB]

(2010) Christian Scheible, Florian Laws, Lukas Michelbacher, & Hinrich Schütze: Sentiment translation through multi-edge graphs. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1104-1112. [PDF, 236KB]

(2010) Bin Wei & Christopher Pal: Cross lingual adaptation: an experiment in sentiment classifications. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.258-262. [PDF, 109KB]

Sign languages

(2013) Jens Forster, Christoph Schmidt, & Hermann Ney: SIGNSPEAK: scientific understanding and vision-based technological development for continuous sign language recognition and translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.441. [PDF, 227KB]

(2013) Camillo Lugaresi & Barbara Di Eugenio: Translating Italian connectives into Italian sign language. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.270-280. [PDF, 192KB]

(2013) Christoph Schmidt, Oscar Koller, Hermann Ney, Thomas Hoyoux, & Justus Piater: Using viseme recognition to improve a sign language translation system. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 727KB]

(2012) ATLAS: automatic translation into sign languages. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.91. [PDF, 85KB]

(2011) Jin-Woo Chung & Jong C.Park: Text parsing for sign language generation with combinatory categorial grammar. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 11pp. [PDF, 420KB]

(2011) Kirsty J.W.Crombie Smith: Sign language – are we making information accessible?  Translating and the Computer 33, 17-18 November 2011, London; 11pp. [PDF, 571KB]

(2011) M.Filhol & A.Braffort: Text to SL translation.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 13 slides [PDF of PPT, 1044KB]

(2011) Michael Filhol, Line Patris, & Pierre Guitteny: Text-sign parallel corpus study to start designing an automatic translation system.  International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 23 October 2011, Dundee, UK; 7pp. [PDF, 351KB]

(2011) Matt Huenerfauth: Cyclic data-driven research on American sign language automation. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 18 slides [PDF of PPT, 7886KB]

(2011) Leonardo Lesmo, Alessandro Mazzei, & Daniele Radicioni: Linguistic processing in the ATLAS project. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 25 slides [PDF of PPT, 5367KB]

(2011) Leonardo Lesmo, Alessandro Mazzei & Daniele P.Radicioni: An ontology based architecture for translation. Proceedings of the Ninth International Conference on Computational Semantics (IWCS 2011), ed. Johan Bos and Stephen Pulman, January 12-14, Oxford, UK; pp.345-349. [PDF, 199KB]

(2011) Sara Morrissey: Assessing three representation methods for sign language machine translation and evaluation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.137-144. [PDF, 343KB]

(2011) Sara Morrissey: Body at work: using corpora in sign language machine translation. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 20 slides [PDF of PPT, 2584KB]

(2011) Eva Safar: Machine translation with corpus linguistics and HPSG. International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 12 slides [PDF of PPT, 288KB]

(2011) Christoph Schmidt, Daniel Stein, & Hermann Ney: Challenges in statistical sign language translation . International Workshop on Sign Language Translation and Avatar Technology (SLTAT), 10-11 January 2011, Federal Ministry of Labour and Social Affairs, Berlin, Germany; 21 slides [PDF of PPT, 360KB]

(2010) Philippe Dreuw, Jens Forster, Yannick Gweth, Daniel Stein, Hermann Ney, Gregorio Martinez, Jaume Verges Llahi, Onno Crasborn, Ellen Ormel, Wei Du, Thomas Hoyoux, Justus Piater, Jose Miguel Moya, & Mark Wheatley: SignSpeak – understanding, recognition, and translation of sign languages. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.65-72. [PDF, 6730KB]

(2010) Guillem Massó & Toni Badia: Dealing with sign language morphemes in statistical machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.154-157. [PDF, 543KB]

(2010) Sara Morrissey, Harold Somers, Robert Smith, Shane Gilchrist & Sandipan Dandapat: Building a sign language corpus for use in machine translation. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.172-177. [PDF, 588KB]

(2010) Daniel Stein, Jens Forster, Uwe Zelle, Philippe Dreuw, & Hermann Ney: RWTH-Phoenix: analysis of the German sign language corpus. [LREC 2010] 4th Workshop on the Representation and Processing of Sign Languages: Corpora and Sign Language  Technologies, Malta, May 2010; pp.225-230. [PDF, 2678KB]

(2010) Daniel Stein, Christoph Schmidt, & Hermann Ney: Sign language machine translation overkill. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.337-344. [PDF, 2052KB]

Simplification

(2015) Isao Goto, Hideki Tanaka, & Tadashi Kumano: Japanese news simplification: tak design, data set construction, and analysis of simplified text. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.17-31. [PDF, 569KB]

(2013) Maxine Eskenazi, Yibin Lin, & Oscar Saz: Tools for non-native readers: the case for translation and simplification. [NAACL HLT 2013] Proceedings of the 2nd Workshop of Natural Language Processing for Improving Textual Accessibility (NLP4ITA), Atlanta, Georgia, 14 June, 2013; pp.20-28. [PDF, 297KB]

(2013) David Kauchak: Improving text simplification language modeling using unsimplified text data. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1537-1546. [PDF, 302KB]

(2012) Han-Bin Chen, Hen-Hsen Huang, Hsin-His Chen, & Ching-Ting Tan: A simplification-translation-restoration framework for cross-domain SMT applications. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.545-560. [PDF, 744KB]

(2012) Sander Wubben, Antal van den Bosch, & Emiel Krahmer: Sentence simplification by monolingual machine translation. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.1015-1024. [PDF, 139KB]

Slang

(2015) Nikhil Bojja, Arun Nedunchezhian, & Pidong Wang: Machine translation in mobile games: augmenting social media text normalization with incentivized feedback. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; pp.11-16. [PDF, 465KB]  

Slot grammar

(2012) Kurt Eberle, Bogdan Babych, Johanna Geiß, Mireia Ginestí-Rosell, Anthony Hartley, Reinhard Rapp, Serge Sharoff, & Martin Thomas: Design of a hybrid high quality machine translation system. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp.101-112. [PDF, 381KB]

Spatial expressions

(2010) Ineke Schuurm an & Vincent Vandeghinste: Cultural aspects of spatiotemporal analysis in multilingual applications. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3308-3313. [PDF, 318KB]

Spelling checkers and correction

(2014) Irina Galinskaya, Valentin Gusev, Elena Meshcheryakova, & Mariya Shmatova: Measuring the impact of spelling errors on the quality of machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2683-2689. [PDF, 448KB]

(2013) Alexey Borisov,Jacob Dlougach & Irina Galinskaya: Yandex School of Data Analysis machine translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.99-103. [PDF, 125KB]

(2013) Behzad Mirzababaei, Heshaam Faili, & Nava Ehsan: Discourse-aware statistical machine translation as a context-sensitive spell checker. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.475-482. [PDF, 1025KB]

(2012) Hisami Suzuki & Jianfeng Gao: A unified approach to transliteration-based text input with online spelling correction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.609-618. [PDF, 432KB]

(2011) Sara Stymne: Spell checking techniques for replacment of unknown words and data cleaning for Haitian Creole SMS translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.470-477. [PDF, 107KB]

Stratificational grammar

(2011) Igor Boguslavsky: Processing of Russian by the ETAP-3 linguistic processor. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 110KB]; presentation: 48 slides [PDF of PPT, 227KB]

Stylistic aspects

(2013) Erwan Moreau & Raphael Rubino: An approach using style classification features for quality estimation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.429-434. [PDF, 133KB]

(2012) Arianna Bisazza & Marcello Federico: Cutting the long tail: hybrid language models for translation style adaptation. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 439-448. [PDF, 266KB]

(2012) Monica Gavrila, Walther v.Hahn, & Cristina Vertan: Same domain different discourse style: a case study on language resources for data-driven machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3441-3446. [PDF, 344KB]

Subjectivity see Pragmatics and MT

Sublanguage [see also Domain restriction and specification]

(2015) Masaru Fuji, Atsushi Fujita, Masao Utiyama, Eiichiro Sumita, & Yuji Matsumoto: Patent claim translation based on sublanguage-specific sentence structure. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.1-16. [PDF, 700KB]

(2014) Najeh Hajlaoui: SMT for restricted sublanguage in CAT tool context at the European Parliament. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.228-234. [PDF, 163KB]

(2013) Lingxiao Wang & Christian Boitet: Online production of HQ parallel corpora and permanent task-based evaluation of multiple MT systems: both can be obtained through iMAGs with no added cost. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), Nice, France, 2 September 2013; pp. 103-110. [PDF, 2109KB]

(2010) Iustina Ilisei, Diana Inkpen, Gloria Corpas Pastor, & Ruslan Mitkov: Identification of translational sublanguage: a machine learning approach [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 57KB]

(2010) Graham Russell: Moz: translation of structured terminology-rich text. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 157KB]

Syllables

(2012) Chunyue Zhang, Tingting Li, & Tiejun Zhao: Syllable-based machine transliteration with extra phrase features.  [ACL 2012] Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea; pp.52-56. [PDF, 163KB]

(2011) Oi Yee Kwong: English-Chinese personal name transliteration by syllable-based maximum matching. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.96-100. [PDF, 256KB]

(2010) Chai Wutiwiwatchai & Ausdang Thangthai: Syllable-based Thai-English machine transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.66-70. [PDF, 189KB]

Synchronous context-free grammar see Context-free grammar

Synchronous dependency grammar see Dependency relations

Synonyms

(2014) Ann Bies, Justin Mott, Seth Kulick, Jennifer Garland, & Colin Warner: Incorporating alternate translations into English Translation Treebank.  LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1863-1868. [PDF, 339KB]

(2013) Daniel Andrade, Masaaki Tsuchida, Takashi Onishi, & Kai Ishikawa: Synonym acquisition using bilingual comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1077-1081. [PDF, 402KB]

(2013) Daniel Andrade, Masaaki Tsuchida, Takashi Onishi, & Kai Ishikawa: Translation acquisition using synonym sets. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.655-660. [PDF, 500KB]

(2012) Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama, & Akiko Aizawa: Automatic translation of scholarly terms into patent terms using synonym extraction techniques.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3447-3451. [PDF, 393KB]

(2010) Kfir Bar & Nachum Dershowitz: Using synonyms for Arabic-to-English example-based translation. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 83KB]

(2010) Hiroyuki Shindo, Akinori Fujino, & Masaaki Nagata: Word alignment with synonym regularization. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.137-141. [PDF, 492KB]

Taxonomies

(2013) Mihael Arcan, Susan Marie Thomas, Derek de Brandt, & Paul Buitelaar: Translating the FINREP taxonomy using a domain-specific corpus. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.199-206. [PDF, 473KB]

(2011) John McCrae, Maurizio Espinoza, Elena Monteil-Ponsoda, Guadalupe Aguado-de-Cea, & Philipp Cimiano: Combining statistical and semantic approaches to the translation of ontologies and taxonomies. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.116-125. [PDF, 576KB]

Temporal expressions

 (2013) Gae-won You, Young-rok Cha, Jinhan Kim, & Seung-won Hwang: Enriching entity translation discovery using selective temporality. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.201-205. [PDF, 651KB]

(2012) Mei Tu, Yu Zhou & Chengqing Zong: A universal approach to translating numerical and time expressions. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.209-216. [PDF, 756KB]

(2010) Ineke Schuurm an & Vincent Vandeghinste: Cultural aspects of spatiotemporal analysis in multilingual applications. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.3308-3313. [PDF, 318KB]

Tense

(2014) Sharid Loáiciga, Thomas Meyer, & Andrei Popescu-Belis: English-French verb phrase alignment in Europarl for tense translation modeling. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 674-681. [PDF, 179KB]

(2013) Thomas Meyer, Cristina Grisot, & Andrei Popescu-Belis: Detecting narrativity to improve English to French translation of simple past verbs. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.33-42. [PDF, 307KB]

(2012) Gong Zheng Xian, Zhang Min, Tan ChewLim, & Zhou GuoDong: Classifier-based tense model for SMT. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.411-420. [PDF, 190KB]

(2012) Zhengxian Gong, Min Zhang, Chewlim Tan, & Guodong Zhou: N-gram-based tense models for statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.276-285. [PDF, 265KB]

(2011) Feifan Liu, Fei Liu, & Yang Liu: Learning from Chinese-English parallel data for Chinese tense prediction. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1116-1124. [PDF, 556KB]

(2010) Lucia Silva: Fine-tuning in Brazilian Portuguese-English statistical transfer machine translation: verbal tenses.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.58-63. [PDF, 251KB]

Terminology management

(2015) Jennifer DeCamp: Machine translation and terminology management [abstract]. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.2: MT Users’ Track; p.19. [PDF, 579KB]  

(2014) Mihael Arcan, Marco Turchi, Sara Topelli, & Paul Buitelaar: Enhancing statistical machine translation with bilingual terminology in a CAT environment. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.54-68. [PDF, 561KB]

(2014) Michael Farrell: Solving terminology problems more quickly with ‘IntelliWebSearch (Almost) Unlimited’. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.211-216. [PDF, 114KB]

(2014) Nizar Ghoula, Jacques Guyot, & Gilles Falquet: Terminology management revisited. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.56-65. [PDF, 916KB]

(2014) Koen Kerremans: Representing intra- and interlingual terminological variation in a new type of translation resource: a prototype proposal. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.45-50. [PDF, 377KB]

(2014) Adam Kilgarriff: Terminology finding in the Sketch Engine: an evaluation. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.130-132. [PDF, 286KB]

(2014) Tsuyoshi Okita, Ali Hosseinzadeh Vahid, Andy Way, & Qun Liu: The DCU terminology translation system for the medical query subtask at WMT14. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.239-245. [PDF, 352KB]

(2014) Marion Weller, Alexander Fraser, & Ulrich Heid: Combining bilingual terminology mining and morphological modeling for domain adaptation in SMT. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.11-18. [PDF, 387KB]

(2014) Ventsislav Zhechev: Term translation central: up-to-date MT without frequent retraining. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Commercial MT Users, pp.152-159. [PDF, 432KB]

(2014) SCATE: Smart Computer Aided Translation Environment. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.135. [PDF, 311KB]

 (2014) TaaS Terminology as a Service. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014; p.60. [PDF, 337KB]

(2013) Ahmet Aker, Monica Paramita, & Robert Gaizauskas: Extracting bilingual terminologies from comparable corpora.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.402-411. [PDF, 204KB]

(2013) Mihael Arcan, Susan Marie Thomas, Derek de Brandt, & Paul Buitelaar: Translating the FINREP taxonomy using a domain-specific corpus. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.199-206. [PDF, 473KB]

(2013) Dhouha Bouamor, Nasredine Semmar, & Pierre Zweigenbaum: Using WordNet and semantic similarity for bilingual terminology mining from comparable corpora. Proceedings of the 6th Workshop on Building and Using Comparable Corpora, Sofia, Bulgaria, August 8, 2013; pp.16-23. [PDF, 628KB]

(2013) Béatrice Daille: TTC: terminology extraction, translation tools and comparable corpora. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.449. [PDF, 272KB]

(2013) Klaus Fleischmann: Collaborative terminology in the age of social web 2.0. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 4pp. [PDF, 246KB]; presentation, 31 slides [PDF of PPT, 3384KB]

(2013) Amir Hazem & Emmanuel Morin: Word co-occurrence counts prediction for bilingual terminology extraction from comparable corpora. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1392-1400. [PDF, 162KB]

(2013) Adam Kilgarriff: Terminology finding, parallel corpora and bilingual word sketches in the Sketch Engine. [Aslib 2013] Translating and the Computer 35, 28-29 November 2013, etc.venues, Paddington, London, UK; 6pp. [PDF, 1018KB]; presentation, 23 slides [PDF of PPT, 957KB]

(2013) Horst Liebscher & Thomas Senf: Streamlining terminology management in an RBMT context. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.391-396. [PDF, 483KB]

(2013) Clara López Rodríguez, Juan Antonio Prieto Velasco, & Maribel Tercedor Sánchez: Multimodal representation of specialised knowledge in ontology-based terminological databases: the case of EcoLexicon. Journal of Specialised Translation 20, July 2013; pp.49-67. [PDF, 336KB]

(2013) Olatz Perez-de-Viñaspre, Maite Oronoz, Manex Agirrezabal & Mikel Lersundi: A finite state approach to translate SNOMED CT terms into Basque using medical prefixes and suffixes. Proceedings of the 11th International Conference on Finite State Methods and Natural Language Processing, St Andrews, Scotalnd, 15-17 July 2013; pp.99-103. [PDF, 169KB]

(2013) Koichi Sato, Koichi Takeuchi, & Kyo Kageura: Terminology-driven augmentation of bilingual terminologies. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.3-10. [PDF, 847KB]

(2013) Raivis Skadiņš, Mārcis Pinnis, Tatiana Gornostay, & Andrejs Vasiļjevs: Application of online terminology services in statistical machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.281-286. [PDF, 573KB]

(2013) Itsuki Toyota, Zi Long, Lijuan Dong, Takehito Utsuro, & Mikio Yamamoto: Compositional translation of technical terms by integrating patent families as a parallel corpus and a comparable corpus. [MT Summit XIV] Proceedings of the 5th Workshop on Patent Translation, Nice, September 2, 2013; pp.16-23. [PDF, 2058KB]

(2013) Andrejs Vasiljevs & Tatiana Gornostay: TaaS – terminology as a service. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.445. [PDF, 288KB]

(2013) Petra Wolf & Ulrike Bernardi: Hybrid domain adaptation for a rule based MT system. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.321-328. [PDF, 415KB]

(2012) Mihael Arcan, Christian Federmann, & Paul Buitelaar: Experiments with term translation. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.67-82. [PDF, 178KB]

(2012) Mihael Arcan, Paul Buitelaar, & Christian Federmann: Using domain-specific and collaborative resources for term translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.86-94. [PDF, 122KB]

(2012) Gerhard Budin: Terminological ontologies in multi-lingual cross-domain communities of practice [abstract]. In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; p.19. [PDF]

(2012) Joseph Z.Chang, Jason S.Chang, & Jyh-Shing Roger Jang: Learning to find translations and transliterations on the web. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, Short Papers; pp.130-134. [PDF, 270KB]

(2012) Elena Chiocchetti: Integration and harmonisation of multilingual terminological data sets [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.20-21. [PDF]

(2012) Béatrice Daille: Building bilingual terminologies from comparable corpora: the TTC TermSuite.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.29-32. [PDF, 317KB]

(2012) Estelle Delpech, Béatrice Daille, Emmanuel Morin, & Claire Lemaire: Identification of fertile translations in medical comparable corpora: a morpho-compositional approach. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 154KB]

(2012) Isabel Durán Muñoz: Meeting translators’ needs: translation-oriented terminological management and applications. Journal of Specialised Translation 18 (July 2012); pp.77-92. [PDF, 586KB]

(2012) Isabel Durán Muñoz, Gloria Corpas Pastor & Le An Ha: ProTermino: a comprehensive web-based terminological management tool based on knowledge representation. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 4pp. [PDF, 97KB]

(2012) Hiroyuki Kaji, Takashi Tsunakawa, & Yoshihiro Komatsubara: Improving compositional translation with comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.134-142. [PDF, 449KB]

(2012) Nikola Ljubešić, Špela Vintar, & Darja Fišer: Multi-word term extraction from comparable corpora by combining contextual and constituent clues.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.143-147. [PDF, 376KB]

(2012) Clara Inés López Rodríguez, Miriam Buendía Castro & Alejandro García Aragón: User needs to the test: evaluating a terminological knowledge base on the environment by trainee translators. Journal of Specialised Translation 18 (July 2012); pp.57-76. [PDF, 785KB]

(2012) Vesna Lušicky & Tanja Wissik: Terminology: don’t only collect it, use it!  [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 9pp. [PDF, 1257KB], presentation: 27 slides [PDF, 2179KB]

(2012) Alan K.Melby: Terminology in the age of multilingual corpora. Journal of Specialised Translation 18 (July 2012); pp.7-29. [PDF, 329KB]

(2012) Hidetsugu Nanba, Toshiyuki Takezawa, Kiyoko Uchiyama, & Akiko Aizawa: Automatic translation of scholarly terms into patent terms using synonym extraction techniques.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3447-3451. [PDF, 393KB]

(2012) Sergio Penkale & Andy Way: SmartMATE: an online end-to-end MT post-editing framework. AMTA-2012: Workshop on post-editing technology and practice. Proceedings, San Diego, October 28, 2012; 10pp. [PDF, 366KB]

(2012) Serge Sharoff: Beyond translation memories: finding similar documents in comparable corpora. [Aslib 2012] Translating and the Computer 34, 29-30 November 2012, One Birdcage Walk, London, UK; 7pp. [PDF, 145KB], presentation: 47 slides [PDF, 849KB]

(2012) Dan Ştefănescu: Mining for term translations in comparable corpora.  [BUCC 2012] The 5th Workshop on Building and Using Comparable Corpora: “Language Resources for Machine Translation in Less-Resourced Languages and Domains”,  LREC 2012 Workshop, 26 May 2012, Istanbul, Turkey; pp.98-103. [PDF, 616KB]

(2012) Andrejs Vasiļjevs, Markus Forsberg, Tatiana Gornostay, Dorte H.Hansen, Kristin M.Jóhannsdóttir, Krister Lindén, Gunn I.Lyse, Lene Offersgaard, Ville Oksanen, Sussi Olsen, Bolette S.Pedersen, Eiríkur Rögnvaldsson, Roberts Rozis, Inguna Skadiņa, & Koenraad De Smet: Creation of an open shared language resource repository in the Nordic and Baltic countries.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.1076-1083. [PDF, 1336KB]

(2012) Andrejs Vasiļjevs, Tatiana Gornostay, Inguna Skadiņa, Daiga Deksne, Raivis Skadiņš, & Mārcis Pinnis: Recent advances in the development and sharing of language resources and tools for Latvian [abstract].  In: Crosslingual Language Technology in service of an integrated multilingual Europe, 4-5 May 2012, University of Hamburg, Germany; pp.24-25. [PDF]

(2012) Cristina Vertan: Two approaches for integrating translation and retrieval in real applications. EACL Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra): Proceedings of the workshop, 23-24 April 2012, Avignon, France; pp. 59-64. [PDF, 226KB]

(2012) TTC: terminology extraction, translation tools and comparable corpora.  [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.97. [PDF, 74KB]

(2011) Takeshi Abekawa & Kyo Kageura: Using seed terms for crawling bilingual terminology lists on the Web. Translating and the Computer 33, 17-18 November 2011, London; 12pp. [PDF, 68KB]

(2011) Caroline Barrière & Pierre Isabelle: Searching parallel corpora for contextually equivalent terms. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.105-112. [PDF, 149KB]

(2011) Helena Blancafort, Ulrich Heid, Tatiana Gornostay, Claude Méchoulam, Béatrice Daille & Serge Sharoff. User-centred views on terminology extraction tools: usage scenarios and integration into MT and CAT tools. Tralogy, Paris, 3-4 March 2011;  15pp. [PDF, 201KB]

(2011) Fumiko Kano Glückstad: Application of classical psychology theory to terminological ontology alignment. Proceedings of the 8th international NLPSC workshop. Special theme: Human-machine interaction in translation, Copenhagen Business School, 20-21 August 2011; ed.Bernadette Sharp, Michael Zock, Michael Carl, Arnt Lykke Jakobsen (Copenhagen Studies in Language 41), Frederiksberg: Samfundslitteratur, 2011; pp.227-238. [PDF, 1187KB]

(2011) Masamichi Ideue, Kazahide Yamamoto, Masao Utiyama, & Eiichiro Sumita: A comparison of unsupervised bilingual term extraction methods using phrase tables. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.346-351. [PDF, 160KB]

(2011) Ian Jones: The NATO terminology programme and NATOTerm. Translating and the Computer 33, 17-18 November 2011, London; 22pp. [PDF, 1159KB]

(2011) Jan Niehues & Alex Waibel: Using Wikipedia to translate domain-specific terms in SMT. IWSLT 2011: Proceedings of the International Workshop on Spoken Language Translation, San Francisco, December 8-9, 2011, ed. Marcello Federico, Mei-Yuh Hwang, Margit Rödder, Sebastian Stüker; pp.230-237. [PDF, 271KB]

(2011) Seiji Okura, Yuji Yamamoto, Hajime Ito, Michael Kato, Miwako Shimazu, & Francis Bond: UTX 1.11, a simple and open user dictionary/terminology standard, and its effectiveness with multiple MT systems. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.587-594. [PDF, 95KB]

(2011) Špela Vintar & Darja Fišer: Enriching Slovene WordNet with domain-specific terms. Translation: Computation, Corpora,  Cognition 1 (1), December 2011; pp.29-44. [PDF, 631KB]

(2011) Petra Wolf, Ulrike Bernardi, Christian Federmann, & Sabine Hunsicker: From statistical term extraction to hybrid machine translation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.225-232. [PDF, 329KB]; presentation, 24 slides [PDF]

(2011) TTC: terminology extraction, translation tools and comparable corpora. (European Machine Translation Projects.) [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; p.357. [PDF, 47KB]

 (2010) Doris Albisser: Born global – how to communicate effectively in an international environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 2pp. [PDF, 476KB]; presentation: 27 slides [PDF, 553KB]

(2010) José João Almeida & Alberto Simões: Automatic parallel corpora and bilingual terminology extraction from parallel websites. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.50-55. [PDF, 257KB]

(2010) Sean Colbath: Terminology management for web monitoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; abstract

(2010) Tatiana Gornostay, Andrejs Vasiljevs, Signe Rirdance, & Roberts Rozis: Bridging the gap – EuroTermBank terminology delivered to users’ environment. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 7pp. [PDF, 794KB]

(2010) Tatiana Gornostay: TTC:terminology extraction, translation tools, comparable corpora. (European Community supported project.) Presented at EAMT 2010: 14th Annual conference of the European Association for Machine Translation, 28 May 2010, Saint-Raphaël, France. 1p. [PDF, 103KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

(2010) Lianhau Lee, Aiti Aw, Min Zhang, & Haizhou Li: EM-based hybrid model for bilingual terminology extraction from comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.639-646. [PDF, 114KB]

(2010) Juliet Macan: Translation and technology: is it a jungle? Translating and the Computer 32, 18-19 November 2010, London; 6pp. [PDF, 111KB]; presentation, 21 slides [PDF of PPT, 1866KB]

(2010) Christoph Rösener: Computational linguistics in the translator’s workflow—combining authoring tools and translation memory systems. Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing, Los Angeles, CA, June 2010; pp.1-6. [PDF, 2233KB]

(2010) Graham Russell: Moz: translation of structured terminology-rich text. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 4pp. [PDF, 157KB]

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

Thesauri

(2014) Lars Borin, Jens Allwood, & Gerard de Melo: Bring vs.MTRoget: evaluating automatic thesaurus translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.2115-2121. [PDF, 252KB]

(2011) Shoichi Yokoyama & Yuichi Takano: Investigation of translation disambiguation of verbs in patent sentences using word grouping. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.60-63. [PDF, 152KB]

Tokenization see Segmentation

Topic and comment see Discourse representation

Transcription [see also Transliteration]

(2015) Oliver Adams, Graham Neubig, Trevor Cohn & Steven Bird: Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. [IWSLT 2015] Proceedings of the International Workshop on Spoken Language Translation, December 3-4, 2015, Da Nang, Vietnam; pp.248-255. [PDF, 2.8MB]

(2014) ALST: linguistic and sensorial accessibility: technologies for voice-over and audio description. Ministerio de Economía y Competetitividad (Spain) Proyectos de investigación fundamental no orientada. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; p.128. [PDF, 227KB]

(2013) Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Nadir Durrani, Matthias Huck, Philipp Koehn, Thanh-Le Ha, Jan Niehues, Mohammed Mediani, Teresa Herrmann, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 128KB]

(2013) Kevin Kilgour, Christian Mohr, Michael Heck, Quoc Bao Nguyen, Van Huy Nguyen, Evgeniy Shin, Igor Tseyzer, Jonas Gehring, Markus Müller, Matthias Sperber, Sebastian Stüker & Alex Waibel: The 2013 KIT IWSLT speech-to-text systems for German and English. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 6pp. [PDF, 86KB]

(2013) Matt Post, Gaurav Kumar, Adam Lopez, Damianos Karakos, Chris Callison-Burch, & Sanjeev Khudanpur: Improved speech-to-text translation with the Fisher and Callhome Spanish-English speech translation corpus. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 192KB]

(2013) Xiaodong Shi, Yidong Chen, & Xiuping Huang: Key problems in conversion from simplified to traditional Chinese characters. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.287-293. [PDF, 273KB]

(2013) Thoudam Doren Singh: Taste of two different flavors: which Manipuri script works better for English-Manipuri language pair SMT stsrem? [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 11-18. [PDF, 162KB]

(2013) Joshua Winebarger, Bao Nguyen, Jonas Gehring, Sebastian Stüker, & Alex Waibel: The 2013  KIT Quaero speech-to-text system for French. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 297KB]

(2013) transLectures: transcription and translation of video lectures. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; p.447. [PDF, 248KB]

(2012) Mauro Cettolo, Christian Girardi, & Marcello Federico: WIT3: web inventory of transcribed and translated talks. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.261-268. [PDF, 197KB]

(2012) Heinrich Kolkhorst, Kevin Kilgour, Sebastian Stüker, & Alex Waibel: Evaluation of interactive user corrections for lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.217-221. [PDF, 635KB]

(2012) Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunach Cho, Teresa Herrmann, Rainer Kärgel & Alexander Waibel: The KIT translation systems for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.38-45. [PDF, 640KB]; presentation, 23 slides [PDF of PPT, 469KB]

(2012) Stephan Peitz, Simon Wiesler, Markus Nußbaum-Thom, & Hermann Ney: Spoken language translation using automatically transcribed text in training. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.276-283. [PDF, 623KB]

(2012) Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, & Hideki Kashioka: Factored recurrent neural network language model in TED lecture transcription. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.222-228. [PDF, 645KB]

(2012) Bridges across the language divide – EU-BRIDGE. [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.93. [PDF, 177KB]

(2012) transLectures: transcription and translation of video lectures.  [Project paper at] EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; p.204. [PDF, 142KB]

(2011) Sara Morrissey: Assessing three representation methods for sign language machine translation and evaluation. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.137-144. [PDF, 343KB]

(2010) Jalal Maleki: Syllable based transcription of English words into Perso-Arabic writing system. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 62-65. [PDF, 341KB]

(2010) M.G.Abbas Malik, Christian Boitet, & Pushpak Bhattacharyya: Finite-state scriptural translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.791-800. [PDF, 580KB]

(2010) M.G.Abbas Malik, Christian Boitet, Pushpak Bhattacharyya & Laurent Besacier: Weak translation problems – a case study of scriptural translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 6pp. [PDF, 241KB]

Transduction grammars

(2015) Meriem Beloucif, Markus Saers, & Dekai Wu: Improving semantic SMT via soft semantic role label constraints on ITG alignmens. MT Summit XV, October 30 – November 3, 2015, Miami, Florida, USA. Proceedings of MT Summit XV: vol.1: MT Researchers’ Track; pp.333-345. [PDF, 647KB]  

(2013) Trevor Cohn & Gholamreza Haffari: An infinite hierarchical Bayesian model of phrasal translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.780-790. [PDF, 432KB]

(2013) Katsuhiko Hayashi, Katsuhito Sudoh, Hajime Tsukada, Jun Suzuki, & Masaaki Nagata: Shift-reduce word reordering for machine translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1382-1386. [PDF, 424KB]

(2013) Peng Li, Yang Liu & Maosong Sun: Recursive autoencoders for ITG-based translation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.567-577. [PDF, 304KB]

(2013) Gideon Maillette de Buy Wenniger & Khalil Sima’an: A formal characterization of parsing word alignments by synchronous grammars with empirical evidence to the ITG hypothesis. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 58-67. [PDF, 150KB]

(2013) Markus Saers, Karteek Addanki & Dekai Wu: Combining top-down and bottom-up search for unsupervised induction of transduction grammars. [SSST-7]: Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Atlanta, Georgia, 13 June 2013; pp. 48-57. [PDF, 431KB]

(2013) Markus Saers, Karteek Addanki & Dekai Wu: Segmenting vs.chunking rules: unsupervised ITG induction via minim conditional description length. Proceedings of Recent Advances in Natural  Language Processing, Hissar, Bulgaria, 7-13 September 2013; pp.584-592. [PDF, 388KB]

(2013) Markus Saers & Dekai Wu: Unsupervised learning of bilingual categories in inversion transduction grammar induction. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 8pp. [PDF, 223KB]

(2013) Markus Saers, Karteek Addanki, & Dekai Wu: Unsupervised transduction grammar induction via minimum description length. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.67-73. [PDF, 418KB]

(2013) Markus Saers & Dekai Wu: Bayesian induction of bracketing inversion transduction grammars. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1158-1166. [PDF, 483KB]

(2013) Dekai Wu, Karteek Addanki, Markus Saers, & Meriem Beloucif: Learning to Freestyle: hip hop challenge-response induction via transduction rule segmentation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.102-112. [PDF, 398KB]

(2013) Dekai Wu, Karteek Addanki, & Markus Saers: Modeling hip hop challenge-response lyrics as machine translation. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.109-116. [PDF, 408KB]

(2013) Conghui Zhu, Taro Watanabe, Eiichiro Sumita, & Tiejun Zhao: Hierarchical phrase table combination for machine translation. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.802-810. [PDF, 1464KB]

(2012) Karteek Addanki, Chi-kui Lo, Markus Saers, & Dekai Wu: LTG vs. ITG coverage of cross-lingual verb frame alternations. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.295-302. [PDF, 283KB]

(2012) Colin Cherry, Robert C.Moore, & Chris Quirk: On hierarchical re-ordering and permutation parsing for phrase-based decoding.  WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.200-209. [PDF, 284KB]

(2012) Nan Duan, Mu Li, & Ming Zhou: Forced derivation tree based model training to statistical machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.445-454. [PDF, 761KB]

(2012) Peng Li, Yang Liu, & Maosung Sun: A beam search algorithm for ITG word alignment. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.673-682. [PDF, 471KB]

(2012) Hwidong Na & Jong-Hyeok Lee: Forest-to-string translation using binarized dependency forest for IWSLT 2012 OLYMPICS task. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.130-135. [PDF, 787KB]

(2012) Graham Neubig, Taro Watanabe, & Shinsuke Mori: Inducing a discriminative parser to optimize machine translation reordering. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.843-853. [PDF, 644KB]

(2012) Graham Neubig, Taro Watanabe, Shinsuke Mori, & Tatsuya Kawahara: Machine translation without words through substring alignment. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.165-174. [PDF, 596KB]

(2012) Markus Saers, Karteek Addanki, & Dekai Wu: From finite-state to inversion transductions: toward unsupervised bilingual grammar induction. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.2325-2339. [PDF, 222KB]

(2012) Ping Xu & Pascale Fung: Cross-lingual language modelling with syntactic reordering for low-resource speech recognition. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.766-776. [PDF, 275KB]

(2011) Marcello Federico: Aspects of tree-based statistical machine translation. Lecture at: Sixth Machine Translation Marathon, 5-10 September, Trento, Italy; 66 slides [PDF of PPT, 2112KB]

(2011) Shujian Huang, Stephan Vogel, & Jiajun Chen: Dealing with spurious ambiguity in learning ITG-based word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.379-383. [PDF, 182KB]

(2011) Yupeng Liu, Tiejun Zhao, & Sheng Li: Hypergraph training and decoding of system combination in SMT. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.570-577 [PDF, 235KB]

(2011) Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, & Tatsuya Kawahara: An unsupervised model for joint phrase alignment and extraction. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.632-641. [PDF, 782KB]

(2011) Markus Saers & Dekai Wu: Linear transduction grammars and zipper finite-state transducers.  [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.640-647. [PDF, 144KB]

(2011) Markus Saers, Dekai Wu, & Chris Quirk: On the expressivity of linear transductions. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.431-438. [PDF, 184KB]

(2011) Markus Saers & Dekai Wu: Principled induction of phrasal bilexica. [EAMT 2011]: proceedings of the 15th conference of the European Association for Machine Translation, 30-31 May 2011, Leuven, Belgium; eds. Mikel L.Forcada, Heidi Depraetere, Vincent Vandeghinste; pp.313-320. [PDF, 373KB]

(2011) Dekai Wu: Syntactic SMT and semantic SMT. Tutorial at MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.12-13. [PDF, 75KB]

(2011) Yinggong Zhao, Shujie Liu, Yangsheng Ji, Jiajun Chen, & Guodong Zhou: Transductive minimum error rate training for statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.641-648. [PDF, 214KB]

(2010) Hailong Cao & Eiichiro Sumita: Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21. [PDF, 65KB]

(2010) Yang Feng, Haitao Mi, Yang Liu, & Qun Liu: An efficient shift-reduce decoding algorithm for phrase-based machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.285-293. [PDF, 175KB]

(2010) Guillem Gascó Mora & Joan Andreu Sánchez Peiró: Syntax augmented inversion transduction grammars for machine translation [abstract]. CICLING 2010: 11th International Conference on Intelligent Text Processing and Computational Linguistics, March 21-27, 2010, Iaşi, Romania; 1p. [PDF, 62KB]

(2010) Zhongjun He, Yao Meng, & Hao Yu: Extending the hierachical phrase based model with maximum entropy based BTG. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 104KB]

(2010) Shujie Liu, Chi-Ho Li, & Ming Zhou: Discriminative pruning for discriminative ITG alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.316-324. [PDF, 972KB]

(2010) Gideon Maillette de Buy Wenniger, Maxim Khalilov, & Khalil Sima’an: A toolkit for visualizing the coherence of tree-based reordering with word-alignments. Fifth Machine Translation Marathon, 13-18 September, University of Le Mans; Prague Bulletin of Mathematical Linguistics, no.94, September 2010; pp.97-106. [PDF, 138KB]; presentation [PDF, 762KB]

(2010) Markos Mylonakis & Khalil Sima’an: Learning probabilistic synchronous CFGs for phrase-based translation. CoNLL-2010: Fourteenth Conference on Computational Natural Language Learning, Proceedings of the conference, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp.117-125. [PDF, 151KB]

(2010) Adam Pauls, Dan Klein, David Chiang, & Kevin Knight: Unsupervised syntactic alignment with inversion traduction grammars. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.118-126. [PDF, 766KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Linear inversion transduction grammar alignments as a second translation path. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 167-171. [PDF, 111KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: A systematic comparison between inversion transduction grammar and linear transduction grammar for word alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.10-18. [PDF, 279KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: Word alignment with stochastic bracketing linear inversion transduction grammar.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.341-344. [PDF, 97KB]

(2010) Markus Saers, Joakim Nivre, & Dekai Wu: A systematic comparison between inversion transduction grammar and linear transduction grammar for word alignment. SSST-4: Proceedings of Fourth Workshop on Syntax and Structure in Statistical Translation, ed. Dekai Wu. COLING 2010/SIGMT Workshop, 23rd International Conference on Computational Linguistics, Beijing, China, 28 August 2010; pp.10-18. [PDF, 279KB]

(2010) Anders Søgaard: Can inversion transduction grammars generate hand alignments? EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 6pp. [PDF, 602KB]; presentation by Michael Carl: 19 slides [PDF, 122KB]

(2010) Jinsong Su, Yang Liu, Haitao Mi, Hongmei Zhao, Yajuan Lü, & Qun Liu: Dependency-based bracketing transduction grammar for statistical machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1185-1193. [PDF, 226KB]

Transliteration

(2014) Nadir Durrani & Philipp Koehn: Improving machine translation via triangulation and transliteration. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.71-78. [PDF, 452KB]

(2014) Mitesh M.Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah, & Pushpak Bhattacharyya: When transliteration met crowdsourcing: an empirical study of transliteration via crowdsourcing using efficient, non-redundant and fair quality control. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.196-202. [PDF, 180KB]

(2014) Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah, & Pushpak Bhattacharyya: Śata-Anuvādak: tackling multiway translation of Indian languages. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 1781-1787. [PDF, 233KB]

(2014) Keith Miller, Linda Moreau, & Sherri Condon: Handling entities in MT/CAT/HLT. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; Tutorials, 88 slides

(2014) Mircea Petic & Daniela Gîfu: Transliteration and alignment of parallel texts from Cyrillic to Latin. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1819-1823. [PDF, 462KB]

(2014) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Bilingual dictionary construction with transliteration filtering. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1013-1017. [PDF, 164KB]

(2013) Mahmoud Azab, Houda Bouamor, Behrang Mohit, & Kemal Oflazer: Dudley North visits North London: learning when to transliterate to Arabic. [NAACL-HLT 2013] The 2013 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 9-14 June 2013, Atlanta, Georgia; pp.439-444. [PDF, 148KB]

(2013) Alexey Borisov,Jacob Dlougach & Irina Galinskaya: Yandex School of Data Analysis machine translation systems for WMT13. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.99-103. [PDF, 125KB]

(2013) Masato Hagiwara & Satoshi Sekine: Accurate word segmentation using transliteration and language model projection. ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics,  Short papers, Sofia, Bulgaria, August 4-9 2013; pp.183-189. [PDF, 548KB]

(2013) Philippe Langlais: Mapping source to target strings without alignment by analogical learning: a case study with transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.684-689. [PDF, 221KB]

(2013) Tingting Li, Tiejun Zhao, Andrew Finch, & Chunyue Zhang: A tightly-coupled unsupervised clustering and bilingual alignment model for transliteration.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Short papers, Sofia, Bulgaria, August 4-9 2013; pp.393-398. [PDF, 698KB]

(2013) John Richardson, Toshiaki Nakazawa, & Sadao Kurohashi: Robust transliteration mining from comparable corpora with bilingual topic models. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.261-269. [PDF, 460KB]

(2013) Hassan Sajjad, Svetlana Smekalova, Nadir Durrani, Alexander Fraser & Helmut Schmid: QCRI-MES submission at WMT13: using transliteration mining to improve statistical machine trans­lation. WMT 2013: 8th Workshop on Statistical Machine Translation, Proceedings of the Workshop, August 8-9, 2013, Sofia, Bulgaria; pp.219-224. [PDF, 205KB]

(2013) Nasredine Semmar & Houda Saadane: Using transliteration of proper names from Arabic to Latin script to improve English-Arabic word alignment. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1022-1026. [PDF, 92KB]

(2013) Katsuhito Sudoh, Shinsuke Mori & Masaaki Nagata: Noise-aware character alignment for bootstrapping statistical machine transliteration from bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.204-209. [PDF, 617KB]

(2013) Keiko Taguchi, Andrew Finch, Seiichi Yamamoto, & Eiichiro Sumita: Inducing Romanization systems. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.19-26. [PDF, 700KB]

(2012) see Proceedings of NEWS 2012 Named Entities Workshop, July 12, 2012, Jeju, Republic of Korea.

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) Walid Aransa, Holger Schwenk, & Loic Barrault: Semi-supervised transliteration mining from parallel and comparable corpora. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp. 185-192. [PDF, 650KB]

(2012) Tina Bögel: Urdu-Roman transliteration via finite state transducers. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.25-29. [PDF, 147KB]

(2012) Chris Irwin Davis: Tajik-Farsi Persian transliteration using statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3988-3995. [PDF, 590KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Hindi and Marathi to English NE transliteration tool using phonology and stress analysis. Proceedings of COLING 2012: Demonstration Papers, Mumbai, December 2012; pp.111-118. [PDF, 301KB]

(2012) M.L.Dhore, S.K.Dixit, & R.M.Dhore: Optimizing transliteration for Hindi/Marathi to English using only two weights. COLING: Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology, December 2012; pp. 31-47. [PDF, 293KB]

(2012) Thoudam Doren Singh: Bidirectional Bengali script and Meetei Mayek: transliteration of web based Manipuri news corpus. COLING 2012: Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language processing (SANLP), Mumbai, December 2012; pp.181-189. [PDF, 591KB]

(2012) Andrew Finch, Ohnmar Htun, & Eiichiro Sumita: The NICT translation system for IWSLT 2012. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.121-125. [PDF, 621KB]

(2012) Jennifer Flather: Sharpening the claws on CAT tools: increase quality & production, maximize limited resources [abstract]. TA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; p. [PDF, 78B] ; presentation, 24 slides [PDF of PPT, 1680KB]

(2012) Utpal Garain, Arjun Das, David S.Doermann, Douglas D.Oard: Leveraging statistical transliteration for dictionary-based English-Bengali CLIR of OCR’d text. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.339-348. [PDF, 235KB]

(2012) Jagadeesh Jagarlamudi & Hal Daumé III: Regularized interlingual projections: evaluation on multilingual transliteration. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.12-23. [PDF, 700KB]

(2012) Brant N.Kay & Brian C.Rineer: Approaches to Arabic name transliteration and matching in the DataFlux quality knowledge base. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp. 32-37. [PDF, 607KB]

(2012) Gurpreet Singh Lehal & Tejinder Singh Saini: Development of a complete Urdu-Hindi transliteration system. Proceedings of COLING 2012: Posters, Mumbai, December 2012; pp.643-652. [PDF,357KB]

(2012) Ram Prakash H: Quillpad multilingual predictive transliteration system. COLING 2012: Proceedings of the Second Workshop on Advances in Text Input Methods (WTIM 2), Mumbai, December 2012; pp.107-113. [PDF, 310KB]

(2012) Joel Ross: The conundrum of new online languages: translating Arabic chat [abstract]. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 1p. [PDF, 113KB]; presentation, 24 slides [PDF of PPT, 2189KB]

(2012) Houda Saadane, Ouafa Benterki, Nasredine Semmar, & Christian Fluhr: Using Arabic transliteration to improve word alignment from French-Arabic parallel corpora. AMTA-2012: Fourth workshop on computational approaches to Arabic script-based languages. Proceedings, San Diego, November 1, 2012; pp.38-46. [PDF,

(2012) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: A statistical model for unsupervised and semi-supervised transliteration mining. [ACL 2012] Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012; pp.469-477. [PDF, 257KB]

(2012) Hisami Suzuki & Jianfeng Gao: A unified approach to transliteration-based text input with online spelling correction. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.609-618. [PDF, 432KB]

(2012) Katherine M.Young, Jeremy Gwinnup, & Joshua Reinhart: Reversing the Palladius mapping of Chinese names in Russian text. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 5pp. [PDF, 202KB]; abstract, 1p. [PDF, 74KB]; presentation, 37 slides [PDF of PPT, 1386KB]

(2011) see  Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011

(2011) Rafael E.Banchs: Comparative evaluation of Spanish segmentation strategies for Spanish-Chinese transliteration. [IJCNLP 2011] Proceedings of the 2011 Named Entities Workshop, Chiang Mai, Thailand, November 12, 2011; pp.41-48. [PDF, 198KB]

(2011) Aditya Bhargava & Grzegorz Kondrak: How do you pronounce your name? Improving G2P with transliterations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.399-408. [PDF, 271KB]

(2011) Raphael Cohen, Yoav Goldberg, & Michael Elhadad: Transliterated pairs acquisition in medical Hebrew. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 36KB];  presentation: 37 slides [PDF of PPT, 315KB]

(2011) Manaal Faruqui, Prasenjit Majumder, & Sebastian Padó: Soundex-based translation correction in Urdu-English cross-language information retrieval. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.25-29. [PDF, 326KB]

(2011) Ali El-Kahky, Kareem Darwish, Ahmed Saad Aldein, Mohamed Abd El-Wahab, Ahmed Hefny, & Waleed Ammar: Improved transliteration mining using graph reinforcement. [EMNLP 2011] Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, Edinburgh, Scotland, UK, July 27-31, 2011; pp.1384-1393. [PDF, 939KB]

(2011) Masato Hagiwara & Satoshi Sekine: Latent class transliteration based on source language origin. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.53-57. [PDF, 330KB]

(2011) Yun Huang, Min Zhang, & Chew Lim Tan: Nonparametric Bayesian machine transliteration with asynchronous adaptor grammars. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.534-539. [PDF, 162KB]

(2011) Ea-Ee Jan, Niyu Ge, Shih-Hsiang Lin, & Berlin Chen: An effective and robust framework for transliteration exploration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1332-1340. [PDF, 204KB]

(2011) Peter Nabende: Mining transliterations from Wikipedia using dynamic Bayesian networks. [RANLP 2011] Proceedings of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 12-14 September 2011; pp.385-391. [PDF, 580KB]

(2011) Michael Paul, Andrew Finch, Paul R.Dixon, & Eiichiro Sumita: Dialect translation: integrating Bayesian co-segmentation models with pivot-based SMT. [EM NLP 2011] DIALECTS2011: Proceedings of the First Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, Edinburgh, Scotland, UK, July 31, 2011; pp.1-9. [PDF, 210KB]

(2011) Hassan Sajjad, Alexander Fraser, & Helmut Schmid: An algorithm for unsupervised transliteration mining with an application to word alignment. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.430-439. [PDF, 171KB]

(2011) Hassan Sajjad, Nadir Durrani, Helmut Schmid, & Alexander Fraser: Comparing two techniques for learning transliteration models using a parallel corpus. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.129-137. [PDF, 605KB]

(2011) Min Zhang, Xiangyu Duan, Ming Liu, Yunqing Xia, & Haizhou Li: Joint alignment and artificial data generation: an empirical study of pivot-based machine transliteration. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.1207-1215. [PDF, 319KB]

(2010) Niraj Aswani & Robert Gaizauskas: English-Hindi transliteration using similarity metrics. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1786-1793. [PDF, 530KB]

(2010) Avinesh PVS & Ankur Parikh: Phrase-based transliteration system with simple heuristics. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.81-84. [PDF, 67KB]

(2010) Kareem Darwish: Transliteration mining with phonetic conflation and iterative training. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.39-47. [PDF, 465KB]

(2010) Amitava Das, Tanik Saikh, Tapabrata Mondal, Asif Ekbal, Sivaji Bandyopadhyay: English to Indian languages machine transliteration system at NEWS 2010. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.71-75. [PDF, 120KB]

(2010) Nadir Durrani, Hassan Sajjad, Alexander Fraser, & Helmut Schmid: Hindi-to-Urdu machine translation through transliteration. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.465-474. [PDF, 313KB]

(2010) Andrew Finch & Eiichiro Sumita: A Bayesian model of bilingual segmentation for transliteration. Proceedings of the 7th International Workshop on Spoken Language Translation, 2-3 December 2010, Paris, France; pp.259-266. [PDF, 511KB]

(2010) Andrew Finch & Eiichiro Sumita: Transliteration using a phrase-based statistical machine translation system to re-score the output of a joint multigram model. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.48-52. [PDF, 197KB]

(2010) Ann Irvine, Mike Kayser, Zhifei Li, Wren Thornton, & Chris Callison-Burch: Integrating output from specialized modules in machine translation: transliterations in Joshua. Fourth Machine Translation Marathon “Open Source Tools for Machine Translation”, 25-30 January, Dublin, Ireland; Prague Bulletin of Mathematical Linguistics, no.93, January 2010; pp.107-116. [PDF, 152KB]

(2010) Ann Irvine, Chris Callison-Burch, & Alexandre Klementiev: Transliterating from all languages. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 8pp. [PDF, 282KB]

(2010) Md.Zahurul Islam, Jörg Tiedemann & Andreas Eisele: English to Bangla phrase-based machine translation.  EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 601KB]

(2010) Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma, Aditya Bhargava, Qing Dou, Mi-Young Kim, & Grzegorz Kondrak: Transliteration generation and mining with limited training resources. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.39-47. [PDF, 143KB]

(2010) Mitesh M.Khapra, A.Kumaran, & Pushpak Bhattacharyya: Everbody loves a rich cousin: an empirical study of transliteration through bridge languages.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.420-428. [PDF, 157KB]

(2010) Amit Kirschenbaum & Shuly Wintner: A general method for creating a bilingual transliteration dictionary. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.273-276. [PDF, 361KB]

(2010) A.Kumaran, Mitesh M.Khapra, & Haizhou Li: Report of NEWS 2010 transliteration mining shared task. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.21-28. [PDF, 598KB]

(2010) A.Kumaran, Mitesh M.Khapra, & Haizhou Li: Whitepaper of NEWS 2010 shared task on transliteration mining. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.29-38. [PDF, 677KB]

(2010) Haizhou Li, A.Kumaran, Min Zhang, & Vladimir Pervouchine: Report of NEWS 2010 transliteration generation shared task. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.1-11. [PDF, 411KB]

(2010) Haizhou Li, A.Kumaran, Min Zhang, & Vladimir Pervouchine: Whitepaper of NEWS 2010 shared task on transliteration generation. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.12-20. [PDF,  182KB]

(2010) Peter Nabende: Mining transliterations from Wikipedia using pair HMMs. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.76-80. [PDF, 145KB]

(2010) Sara Noeman & Amgad Madkour: Language independent transliteration mining system using finite state automata framework. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.57-61. [PDF, 120KB]

(2010) Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim, & Richard Sproat: A Python toolkit for universal transliteration. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2897-2901. [PDF, 576KB]

(2010) Yan Song, Chunyu Kit, & Hai Zhao: Reranking with multiple features for better transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.62-65. [PDF, 395KB]

(2010) Karthik Visweswariah, Vijil Chenthamarakshan, & Nandakishore Kambhatla: Urdu and Hindi: translation and sharing of linguistic resources. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1283-1291. [PDF, 217KB]

(2010) Chai Wutiwiwatchai & Ausdang Thangthai: Syllable-based Thai-English machine transliteration. NEWS 2010: Proceedings of the 2010 Named Entities Workshop, ACL 2010, Uppsala, Sweden, 16 July 2010; pp.66-70. [PDF, 189KB]

(2010) Dong Yang, Paul Dixon, & Sadaoki Furui: Jointly optimizing a two-step conditional random field model for machine transliteration and its fast decoding algorithm. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.275-280. [PDF, 115KB]

 (2010) Gae-won You, Seung-won Hwang, Young-In Song, Long Jiang, & Zaiqing Nie: Mining name translations from entity graph mapping.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.430-439. [PDF, 1542KB]

(2010) Min Zhang, Xiangyu Duan, Vladimir Pervouchine, & Haizhou Li: Machine transliteration: leveraging on third languages. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1444-1452. [PDF, 259KB]

Tree Adjoining Grammar (TAG)

 (2014) Piyush Dungarwal, Rajen Chatterjee, Abhijit Mishra, Anoop Kunchukuttan, Ritesh Shah, & Pushpak Bhattacharyya: The IIT Bombay HindiÛEnglish translation system at WMT 2014. [WMT 2014] Proceedings of the Ninth Workshop on Statistical Machine Translation, Baltimore, Maryland USA, June 26-27, 2014; pp.90-96. [PDF, 222KB]

(2011) Yang Liu, Qun Liu, & Yajuan Lü: Adjoining tree-to-string translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1278-1287. [PDF, 159KB]

(2011) Wei-Yun Ma & Kathleen McKeown: System combination for machine translation based on text-to-text generation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.546-553. [PDF,

(2010) Rejwanul Haque, Sudip Kumar Naskar, Antal van den Bosch, & Andy Way: Supertags as source language context in hierarchical phrase-based SMT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 196KB]

(2010) Andreas Maletti: A tree transducer model for synchronous tree-adjoing grammars. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Conference proceedings; pp.1067-1076. [PDF, 170KB]

 (2010) Libin Shen, Bing Zhang, Spyros Matsoukas, Jinxi Xu, & Ralph Weischedel: Statistical machine translation with a factorized grammar.  [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.616-625. [PDF, 147KB]

Unification grammar

(2011) Philip Williams & Philipp Koehn: Agreement constraints for statistical machine translation into German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.217-226. [PDF, 146KB]

Universals see Language universals, Semantic analysis and representation

Unknown and rare words (see also Names and named entities)

(2014) Burak Aydın & Arzucan Özgür: Expanding machine translation training data with an out-of-domain corpus using language modeling based vocabulary saturation. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.180-192. [PDF, 523KB]

(2014) Kurt Eberle: AutoLearn<Word>. Translating and the Computer 36: proceedings. Asling: International Society for Advancement in Language Technology, 27-28 November 2014; pp.145-154. [PDF, 446KB]

(2014) Miquel Esplà-Gomis, Víctor M.Sánchez-Cartegna, Felipe Sánchez-Martínez, Rafael C.Carrasco, Mikel L.Forcada, & Juan Antonio Pérez-Ortiz: An efficient method to assist non-expert users in extending dictionaries by assigning stems and inflectional paradigms to unknknown words. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.19-26. [PDF, 311KB]

(2014) Mark Fishel & Rico Sennrich: Handling technical OOVs in SMT.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.159-162. [PDF, 318KB]

(2014) Joern Wuebker, Hermann Ney, Adrià Martínez-Villaronga, Adrià Giménez, Alfons Juan, Christophe Servan, Marc Dymetman, Shachar Mirkin: Comparison of data selection techniques for the translation of video lectures. AMTA 2014: proceedings of the eleventh conference of the Association for Machine Translation in the Americas, Vancouver, BC, October 22-26; pp.193-207. [PDF, 627KB]

(2013) Sankaranarayanan Ananthakrishnan, Wei Chen, Rohit Kumar, & Dennis Mehay: Source aware phrase-based decoding for robust conversational spoken language translation. [IWSLT 2013] Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, Dec.5-6, 2013; 7pp. [PDF, 222KB]

(2013) Ann Irvine, Chris Quirk, & Hal Daumé III: Monolingual marginal matching for translation model adaptation. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.1077-1088. [PDF, 247KB]

(2013) Majid Razmara, Maryam Siahbani, Gholamreza Haffari, & Anoop Sarkar: Graph propagation for paraphrasing out-of-vocabulary words in statistical machine translation.  ACL-2013: Proceedings of the 51st Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 4-9 2013; pp.1105-1115. [PDF, 325KB]

(2012) Khan Md.Anwarus Salam, Setsuo Yamada, & Tetsuro Nishino: Sublexical translations for low-resource language. COLING 2012: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL-2012), Mumbai, December 2012; pp.39-51. [PDF, 514KB]

(2012) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Domain adaptation in SMT of user-generated forum content guided by OOV word reduction: normalization and/or supplementary data? EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.169-176. [PDF, 160KB]

(2012) Pratyush Banerjee, Sudip Kumar Naskar, Johann Roturier, Andy Way, & Josef van Genabith: Translation quality-based supplementary data selection by incremental update of translation models. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.149-165. [PDF, 149KB]

(2012) Julien Bourdaillet & Philippe Langlais: Identifying infrequent translations by aligning non parallel sentences. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 331KB]

(2012) Qing Dou & Kevin Knight: Large scale decipherment for out-of-domain machine translation. EMNLP-CoNLL 2012: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the conference, July 12-14, Jeju Island, Korea; pp.266-275. [PDF, 578KB]

(2012) Lluís Formiga, Adolfo Hernández, José B.Mariño, & Enric Monte: Improving English to Spanish out-of-domain translations by morphology generalization and generation. AMTA-2012: Monolingual machine translation-2012 workshop. Proceedings, San Diego, November 1, 2012; pp.6-16. [PDF, 274KB]

(2012) Barry Haddow & Philipp Koehn: Analysing the effect of out-of-domain data on SMT systems. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.422-432. [PDF, 155KB]

(2012) Philipp Koehn & Barry Haddow: Interpolated backoff for factored translation models. AMTA-2012: the Tenth Biennial Conference of the Association for Machine Translation in the Americas. Proceedings, San Diego, CA, October 28 – November 1, 2012; 10pp. [PDF, 239KB]

(2012) Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, & Alex Waibel: The Karlsruhe Institute of Technology translation systems for the WMT 2012. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.349-355. [PDF, 149KB]

(2012) Rohit Prasad, Rohit Kumar, Sankaranarayanan Ananthakrishnan, Wei Chen, Sanjika Hewavitharana, Matthew Roy, Frederick Choi, Aaron Challenner, Enoch Kan, Arvid Neelakantan, & Prem Natarajan: Active error detection and resolution for speech-to-speech translation. IWSLT-2012: 9th International Workshop on Spoken Language Translation, Hong Kong, December 6th-7th, 2012; pp.150-157. [PDF, 1156KB]

(2012) V.M.Sánchez-Cartagena, M.Esplà-Gomis, F.Sánchez-Martínez, & J.A.Pérez-Ortiz: Choosing the correct paradigm for unknown words in rule-based machine translation systems.  In: Free/Open-Source Rule-Based Machine Translation, ed.Cristina España-Bonet and Aarne  Ranta. Proceedings of a Workshop held in Gothenburg, 14-15 June, 2012; pp.27-39. [PDF, 502KB]

(2012) João Silva, Luísa Coheur, Ângela Costa, & Isabel Trancoso: Dealing with unknown words in statistical machine translation.  LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3977-3981. [PDF, 406KB]

(2012) Aleš Tamchyna, Petra Galuščáková, Amir Kamran, Miloš Stanojević, & Ondřej Bojar: Selecting data for English-to-Czech machine translation. WMT 2012: 7th Workshop on Statistical Machine Translation. Proceedings of the workshop, June 7-8, 2012, Montréal, Canada; pp.374-381. [PDF, 110KB]

(2011) Hal Daumé & Jagadeesh Jagarlamudi: Domain adaptation for machine translation by mining unseen words. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.407-412. [PDF, 80KB]

(2011) Chris Dyer, Kevin Gimpel, Jonathan H.Clark, & Noah A.Smith: The CMU-ARK German-English translation system. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.337-343. [PDF, 134KB]

(2011) Jun Hatori & Hisami Suzuki: Japanese pronunciation prediction as phrasal statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.120-128. [PDF, 580KB]

(2011) Shujie Liu, Chi-Ho Li & Ming Zhou: Statistic machine translation boosted with spurious word deletion. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.72-79. [PDF, 239KB]

(2011) Paul Maergner, Ian Lane, & Alex Waibel: Unsupervised vocabulary selection for domain-independent simultaneous lecture translation. MT Summit XIII: the Thirteenth Machine Translation Summit [organized by the] Asia-Pacific Association for Machine Translation (AAMT), 19-23 September 2011, Xiamen, China; pp.89-96. [PDF, 926KB]

 (2011) Emmanuel Prochasson & Pascale Fung: Rare word translation extraction from aligned comparable documents. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1327-1335. [PDF, 157KB]

 (2011) Sara Stymne: Spell checking techniques for replacment of unknown words and data cleaning for Haitian Creole SMS translation. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.470-477. [PDF, 107KB]

(2010) Wilker Aziz, Marc Dymetman, Shachar Mirkin, Lucia Specia, Nicola Cancedda, & Ido Dagan: Learning an expert from human annotations in statistical machine translation: the case of out-of-vocabulary words. EAMT 2010: Proceedings of the 14th Annual conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France. Proceedings ed.Viggo Hansen and François Yvon; 8pp. [PDF, 585KB]; presentation: 21 slides [PDF, 777KB]

(2010) Nicola Bertoldi, Mauro Cettolo, & Marcello Federico: Statistical machine translation of texts with misspelled words. NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Proceedings… June 2-4, 2010, Los Angeles, California; pp.412-419. [PDF, 160KB]

(2010) Rashmi Gangadharaiah, Ralf D.Brown, & Jaime Carbonell: Monolingual distributional profiles for word substitution in machine translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.320-328. [PDF, 605KB]

(2010) Chung-chi Huang, Ho-ching Yen, Shih-ting Huang, & Jason S.Chang: Using sublexical translations to handle the OOV problem in MT. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp. [PDF, 132KB]

 (2010) Laura Elisabeth Jehl: Machine translation for Twitter. Master of Science, University of Edinburgh, 2010. iv, 53pp. [PDF, 398KB]

(2010) Marion Potet, Laurent Besacier, & Hervé Blanchon: The LIG machine translation system for WMT 2010.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 161-166. [PDF, 81KB]

(2010) Germán Sanchis-Trilles, Jesús Andrés-Ferrer, Guillem Gascó, Jesús González-Rubio, Pascual Martínez-Gómez, Martha-Alicia Rocha, Joan-Andreu Sánchez, & Francisco Casacuberta: UPV-PRHLT English-Spanish system for WMT10. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 172-176. [PDF, 90KB]

(2010) Xabier Saralegi & Maddalen Lopez de Lacalle: Dictionary and monlingual corpus-based query translation for Basque-English CLIR. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.1353-1358. [PDF, 284KB]

 (2010) Sara Stymne, Maria Holmqvist, & Lars Ahrenberg: Vs and OOVs: two problems for translation between German and English. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 183-188. [PDF, 105KB]

(2010) Yuejie Zhang, Yang Wang, Lei Cen, Yanxia Su, Cheng Jin, Xiangyang Xue, & Jianping Fan: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. Coling 2010: 23rd International Conference on Computational Linguistics, 23-27 August 2010, Beijing International Convention Center, Beijing, China, Posters volume; pp.1435-1443. [PDF, 551KB]

Valency see Case grammar

Verbs and verb phrases

(2014) Marcin Junczys-Dowmunt & Bruno Pouliquen: SMT of German patents at WIPO: decompounding and verb structure pre-reordering.  Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.217-220. [PDF, 369KB]

(2014) Valia Kordoni & Iliana Simova: Multiword expressions in machine translation. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp.1208-1211. [PDF, 110KB]

(2014) Sharid Loáiciga, Thomas Meyer, & Andrei Popescu-Belis: English-French verb phrase alignment in Europarl for tense translation modeling. LREC 2014: Ninth International Conference on Language Resources and Evaluation, May 26-31, 2014 Harpa Concert Hall and Conference Center, Reykjavik, Iceland; ed. Nicoletta Calzolari et al.; pp. 674-681. [PDF, 179KB]

(2013) Ahmed Hamdi, Rahma Boujelbane, Nizar Habash, & Alexis Nasr: The effects of factorizing root and pattern mapping in translating between Tunisian Arabic and standard Arabic. Proceedings of the XIV Machine Translation Summit, Nice, September 2-6, 2013; ed. K.Sima’an, M.L.Forcada, D.Grasmick, H.Depraetere, A.Way; pp.125-134. [PDF, 672KB]

(2013) Thomas Meyer, Cristina Grisot, & Andrei Popescu-Belis: Detecting narrativity to improve English to French translation of simple past verbs. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.33-42. [PDF, 307KB]

(2013) Karl Pichotta & John DeNero: Identifying phrasal verbs using many bilingual corpora. [EMNLP 2013] Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA, 18-21 October 2013; pp.636-646. [PDF, 267KB]

(2013) Carlos Ramish, Laurent Besacier, & Alexander Kobzar: How hard is it to automatically translate phrasal verbs from English to French?  [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.53-61. [PDF, 303KB]

(2013) Annette Rios & Anne Göhring: Machine learning disambiguation of Quechua verb morphology. Proceedings of the Second Workshop on Hybrid Approaches to Translation, Sofia, Bulgaria, August 8, 2013; pp.13-18. [PDF, 226KB]

(2013) Iliana Simova & Valia Kordoni: Improving English-Bulgarian statistical machine translation by phrasal verb treatment. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.62-71. [PDF, 279KB]

(2013) Zdenka Uresova, Jana Sindlerova, Eva Fucikova, & Jan Hajic: An analysis of annotation of verb-noun idiomatic combinations in a parallel dependency corpus. [NAACL HLT 2013] Proceedings of the 9th Workshop on Multiword Expressions (MWE 2013), Atlanta, Georgia, 13-14 June, 2013; pp.58-63. [PDF, 115KB]

(2013) Eric Wehrli & Luka Nerima: Anaphora resolution, collocations and translation. [MT Summit XIV] Proceedings of the Workshop on Multi-word Units in Machine Translation and Translation Technology, Nice, September 3, 2013; pp.12-17. [PDF, 202KB]

(2012) Karteek Addanki, Chi-kui Lo, Markus Saers, & Dekai Wu: LTG vs. ITG coverage of cross-lingual verb frame alternations. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.295-302. [PDF, 283KB]

(2012) Karunesh Kumar Arora & R.Mahesh K.Sinha: Improving statistical machine translation through co-joining parts of verbal constructs in English-Hindi translation. SSST-6, Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation, Jeju, Republic of Korea, 12 July 2012; pp.95-101. [PDF, 164KB]

(2012) Sanjay Chatterji, Nabanita Datta, Arnab Dhar, Biswanath Barik, Sudeshna Sarkar, & Anupam Basu: Repairing Bengali verb chunks for improved Bengali to Hindi machine translation. COLING 2012: Proceedings of the 10th Workshop on Asian Language Resources, Mumbai, December 2012; pp.65-74. [PDF, 610KB]

(2012) Anita Gojun & Alexander Fraser: Determining the placement of German verbs in English-to-German SMT. [EACL 2012] Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, Avignon, France, April 23-27, 2012; pp. 726-735. [PDF, 138KB]

(2012) Aingeru Mayor, Mans Hulden, & Gorka Labaka: Developing an open-source FST grammar for verb chain transfer in a Spanish-Basque MT system. Proceedings of the 10th International Workshop on Finite State Methods and Natural Language Processing, Donostia-San Sebastián, July 23-25, 2012; pp.65-69. [PDF, 157KB]

(2012) Christian M.Meyer & Iryna Gurevych: To exhibit is not to loiter: a multilingual, sense-disambiguated Wiktionary for measuring verb similarity. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1763-1780. [PDF, 771KB]

(2012) Veronika Vincze: Light verb constructions in the SzegedParallelFX English-Hungarian parallel corpus. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.2381-2388. [PDF, 349KB]

(2011) Kfir Bar & Nachum Dershowitz: Using verb paraphrases for Arabic-to-English example-based translation. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 23 January, 2011; 3pp. [PDF, 94KB];  presentation: 27 slides [PDF of PPT, 1356KB]

(2011) Ankur Gandhe, Rashmi Gangadharaiah, Karthik Visweswariah, & Ananthakrishnan Ramanathan: Handling verb phrase morphology in highly inflected Indian languages for machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-13, 2011; pp.111-119. [PDF, 256KB]

(2011) Alex Rudnick: A resource-light approach to learning verb valencies. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 26 January, 2011; 2pp. [PDF, 54KB]; presentation: 31 slides [PDF of PPT, 697KB]

(2011) Philip Williams & Philipp Koehn: Agreement constraints for statistical machine translation into German. [WMT 2011] Proceedings of the 6th Workshop on Statistical Machine Translation, Edinburgh, Scotland, UK, July 30-31, 2011; pp.217-226. [PDF, 146KB]

(2011) Xianchao Wu, Takuya Matsuzaki, Jun’ichi Tsujii: Effective use of function words for rule generalization in forest-based translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.22-31. [PDF, 857KB]

(2011) Shoichi Yokoyama & Yuichi Takano: Investigation of translation disambiguation of verbs in patent sentences using word grouping. [MT Summit XIII] 4th Workshop on Patent Translation, Shoichi Yokoyama (ed,), Xiamen, China, September 23, 2011; pp.60-63. [PDF, 152KB]

(2010) Arianna Bisazza & Marcello Federico: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation.  ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 235-243. [PDF, 372KB]

(2010) Lynne Cahill: A syllable-based approach to verbal morphology in Arabic. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 19-26. [PDF, 747KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Improving Arabic-to-English statistical machine translation by reordering post-verbal subjects for alignment. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.178-183. [PDF, 108KB]

(2010) Marine Carpuat, Yuval Marton, & Nizar Habash: Reordering matrix post-verbal subjects for Arabic-to-English SMT. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 287KB] 

(2010) Dana Dannélls & John J.Camilleri: Verb morphology of Hebrew and Maltese – towards an open source type theoretical resource grammar in GF. LREC 2010: Workshop on Language Resources and Human Language Technology for Semitic Languages, Valletta, Malta, 17 May 2010; pp. 57-61. [PDF, 326KB]

(2010) Lucia Silva: Fine-tuning in Brazilian Portuguese-English statistical transfer machine translation: verbal tenses.  NAACL HLT 2010: Human Language Technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Student Research Workshop… June 2, 2010, Los Angeles, California; pp.58-63. [PDF, 251KB]

(2010) Lin Sun, Anna Korhonen, Thierry Poibeau, & Cédric Messiant: Investigating the cross-linguistic potential of VerbNet-style classification. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp. 1056-1064. [PDF, 227KB]

(2010) Shumin Wu, Jinho D.Choi, & Martha Palmer: Detecting cross-lingual semantic similarity using parallel PropBanks. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 9pp. [PDF, 274KB]

Word association

(2013) Beata Beigman Klebanov & Michael Flor: Associative texture is lost in translation. Proceedings of the Workshop on Discourse in Machine Translation (DiscoMT), Sofia, Bulgaria, August 9, 2013; pp.27-32. [PDF, 309KB]

(2012) Guo-Hau Lai, Ying-Mei Guo, & Richard Tzong-Han Tsai: Unsupervised Japanese-Chinese opinion word translation using dependency distance and feature-opinion association weight. Proceedings of COLING 2012: Technical Papers, Mumbai, December 2012; pp.1503-1517. [PDF, 254KB]

(2011) Ivan Vulić, Wim De Smet, & Marie-Francine Moens: Identifying word translations from comparable corpora using latent topic models. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.479-484. [PDF, 153KB]

(2010) Daniel Andrade, Tetsuya Nasukawa, & Jun’ichi Tsujii: Robust measurement and comparison of context similarity for finding translation pairs. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.19-27. [PDF, 709KB]

(2010) Audrey Laroche & Philippe Langlais: Revisiting context-based projection methods for term-translation spotting in comparable corpora. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.617-625. [PDF, 224KB]

(2010) Takashi Tsunakawa & Hiroyuki Kaji: Augmenting a bilingual lexicon with information for word translation disambiguation. [Coling 2010] Proceedings of the 8th Workshop on Asian Language Resources, Beijing, China, 21-22 August 2010; pp.30-37. [PDF, 175KB]

Word frequency

(2010) Adam Kilgarriff: Comparable corpora within and across languages, word frequency lists and the KELLY project. [LREC 2010] Proceedings of the 3rd Workshop on Building and Using Comparable Corpora, Malta, 22 May 2010; pp.1-5. [PDF, 113KB]

Written forms

 (2014) Nicholas Ruiz & Marcello Federico: Complexity of spoken versus written language for machine translation. Proceedings of the 17th annual conference of the European Association for Machine Translation, EAMT 2014, Dubrovnik, Croatia, 16th-18th June 2014, edited by Marko Tadić, Philipp Koehn, Johann Roturier, Andy Way; pp.173-180. [PDF, 479KB]

2013) Mohammad Sadegh Rasooli, Ahmed El Kholy, & Nizar Habash: Orthographic and morphological processing for Persian-to-English statistical machine translation. International Joint Conference on Natural Language Processing, Nagoya, Japan, 14-18 October 2013; pp.1047-1051. [PDF, 106KB]

(2013) Sara Stymne, Nicola Cancedda, & Lars Ahrenberg: Generation of compound words in statistical machine translation into compounding languages. Computational Linguistics 39 (4); pp.1067-1108. [PDF, 615KB]

(2011) Vicent Alabau, Alberto Sanchis, & Francisco Casacuberta: Improving on-line handwritten recognition using translation models in multimodal interactive machine translation. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland, Oregon, June 19-24, 2011; pp.389-394. [PDF, 417KB]

(2010) Ahmed El Kholy & Nizar Habash: Orthographic and morphological processing for English-Arabic statistical machine translation. TALN 2010. Proceedings of Traitement Automatique du Langage Naturel, 19-23 juillet 2010. Montréal, Canada. 10pp. [PDF, 147KB]

(2010) Ting Qian, Kristy Hollingshead, Su-youn Yoon, Kyoung-young Kim, & Richard Sproat: A Python toolkit for universal transliteration. LREC 2010: proceedings of the  seventh international conference on Language Resources and Evaluation, 17-23 May 2010, Valletta, Malta; pp.2897-2901. [PDF, 576KB]