Machine Translation Archive

Index to methods, techniques, processes

Publications 2000-2004

For other periods go to: Publications since 2010; publications 2005-2009; publications 1990-1999; publications 1970-1989; publications before 1970

[To return to home page click here]

Alignment [see also Character alignment, Statistical analysis, Word alignment]

(2004) Oliver Bender, Richard Zens, Evgeny Matusov & Hermann Ney: Alignment templates: the RWTH SMT system. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 79-84 [PDF, 193KB]

(2004) Christophe Chenon: Dynamical visualization of nested correspondences. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 17pp. [PDF, 191KB]

(2004) Percy Cheung & Pascale Fung: Sentence alignment in parallel, comparable, and quasi-comparable corpora.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 30-33. [PDF, 323KB]

(2004) Thomas C.Chuang, Jian-Cheng Wu, Tracy Lin, Wen-chie Shei & Jason S.Chang: Bilingual sentence alignment based on punctuation statistics and lexicon. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.224-232. [abstract]

(2004) Lea Cyrus & Hendrik Feddes: A model for fine-grained alignment of multilingual texts. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.15-22. [PDF, 141KB]

(2004) Hal Daumé III & Daniel Marcu: A phrase-based HMM approach to document/abstract alignment.  EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 111KB]

(2004) Pascale Fung & Percy Cheung: Multi-level bootstrapping for extracting parallel sentences from a quasi-comparable corpus.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 245KB]

(2004) Pascale Fung & Percy Cheung: Mining very-non-parallel corpora: parallel sentence and lexicon extraction via bootstrapping and EM. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 276KB]

(2004) Daniel Gildea:  Dependencies vs constituents for tree-based alignment. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 97KB]

(2004) Adrià de Gispert, José B. Mariño & Josep M. Crego: Phrase-based alignment combining corpus cooccurrences and linguistic knowledge. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 107-114 [PDF, 178KB]

(2004) Declan Groves, Mary Hearne, & Andy Way: Robust sub-sentential alignment of phrase-structure trees. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 94KB]

(2004) Paul Kingsbury, Nianwen Xue, & Martha Palmer: Propbanking in parallel.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 34-37. [PDF, 305KB]

(2004) Patrick Lambert & Núria Castell: Alignment of parallel corpora exploiting asymmetrically aligned phrases.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 26-29. [PDF, 239KB]

(2004) Li Weigang, Liu Ting, Wang Zhen, & Li Sheng: Aligning bilingual corpora using sentences location information. [ACL-2004] Proceedings of the Third SIGHAN Workshop on Chinese Language Learning, 25 July 2004, Barcelona; 7pp. [PDF, 100KB]

(2004) Feifan Liu, Qianli Jin, Jun Zhao & Bo Xu: Bilingual chunk alignment based on interactional matching and probabilistic latent semantic indexing.  First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.416-425. [abstract]

(2004) Chris Pike & I.Dan Melamed: An automatic filter for non-parallel texts. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 128KB]

(2004) Kiyotaka Uchimoto, Yujie Zhang, Kiyoshi Sudo, Masaki Murata, Satoshi Sekine, & Hitoshi Isahara: Multilingual aligned parallel treebank corpus reflecting contextual information and its applications. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.63-70. [PDF, 162KB]

(2004) Jui-Feng Yeh, Chung-Hsien Wu, Ming-Jun Chen, & Liang-Chih Yu: Automated alignment and extraction of bilingual ontology for cross-language domain-specific applications. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 196KB]

(2004) Hao Zhang & Daniel Gildea: Syntax-based alignment: supervised or unsupervised? Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 87KB]

(2003) Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka, & Hideki Tanaka: Word selection for EBMT based on monolingual similarity and translation confidence HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 1565KB]

(2003) Regina Barzilay & Noemie Elhadad: Sentence alignment for monolingual comparable corpora EMNLP-2003: proceedings of the 2003  conference on Empirical Methods in Natural Language Processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03,  11-12 July  2003, Sapporo, Japan; 8pp. [PDF, 94KB]

(2003) Daniel Gildea: Loosely tree-based alignment for machine translation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 109KB]

(2003) Hideki Kashioka, Takehiko Maruyama, & Hideki Tanaka: Building a parallel corpus for monologues with clause alignment MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.216-223. [PDF, 379KB]

(2003) Chun-Jen Lee & Jason S. Chang: Acquisition of English-Chinese transliterated word pairs from parallel-aligned texts using a statistical machine transliteration model HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 344KB]

(2003) Joel Martin, Howard Johnson, Benoit Farley, and Anna Maclachlan: Aligning and using an English-Inuktitut parallel corpus HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 30KB]

(2003) I.Dan Melamed: Multitext grammars and synchronous parsers HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 1126KB]

(2003) Francisco Nevado, Francisco Casacuberta, & Enrique Vidal: Parallel corpora segmentation using anchor words. 7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 33-40 [PDF, 382KB]

(2003) Stephen Nightingale & Hideki Tanaka: Comparing the sentence alignment yield from two news corpora using a dictionary-based alignment system HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 142KB]

(2003) Franz Josef Och & Hermann Ney: A systematic comparison of various statistical alignment models. Computational Linguistics 29 (1), pp.19-51 [PDF, 296KB]

(2003) Bo Pang, Kevin Knight, & Daniel Marcu: Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 91KB]

(2003) Emmanuel Planas & Osamu Furuse: Formalizing translation memory. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 157-188.

(2003) Lee Schwartz, Takako Aikawa, & Chris Quirk: Disambiguation of English PP attachment using multilingual aligned data MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.330-337. [PDF, 98KB]

(2003) Michel Simard: Translation spotting for translation memories HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 120KB]

(2003) Michel Simard & Philippe Langlais Statistical translation alignment with compositionality constraints HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 65KB]

(2003) Ashish Venugopal, Stephan Vogel, & Alex Waibel: Effective phrase translation extraction from aligned models ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 134KB]

(2003) Jian-Cheng Wu, Kevin C.Yeh, Thomas C.Chuang, Wen-Chi Shei, & Jason S.Chang: TotalRecall: a bilingual concordance for computer assisted translation and language learning ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 76KB]

(2003) Bing Zhao, Klaus Zechner, Stephen Vogel, & Alex Waibel: Efficient optimization for bilingual sentence alignment based on linear regression HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 93KB]

(2002) Mosleh H.Al-Adhaileh, Tang Enya Kong, & Zaharin Yusoff: A synchronization structure of SSTC and its applications in machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 8pp. [PDF, 272KB]

(2002) Michael Barlow: ParaConc: concordance software for multilingual parallel corpora. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.20-24. [PDF, 97KB]

(2002) Luisa Bentivogli & Emanuele Pianta: Opportunistic semantic tagging. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp. 1401-1406. [PDF, 61KB]

(2002) Thomas C.Chuang, G.N.You, & Jason Chang: Adaptive bilingual sentence alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 21-30. [go to publisher details]

(2002) George Foster, Philippe Langlais, & Guy Lapalme: Text prediction with fuzzy alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 44-53. [go to publisher details]

(2002) Ismael García Varea, Franz J. Och, Hermann Ney & Francisco Casacuberta: Improving alignment quality in statistical machine translation using context-dependent maximum entropy models. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 208KB]

(2002) Ismael Garcia Varea, Franz J.Och, Hermann Ney, & Francisco Casacuberta: Efficient integration of maximum entropy lexicon models within the training of statistical alignment models. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 54-63. [go to publisher details]

(2002) Mathieu Guidère: Towards a corpus-based machine translation for standard Arabic. Translation Journal 6 (1), January 2002; 12pp. [PDF, 173KB]

(2002) Kenji Imamura: Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 142KB]

(2002) Tz-Liang Kueng & Keh-Yih Su: A robust cross-style bilingual sentences alignment model. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 290KB]

(2002) Kenji Matsumoto & Hideki Tanaka: Automatic alignment of Japanese and English newspaper articles using an MT system and a bilingual company name dictionary. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.480-484. [PDF, 252KB]

(2002) Robert C. Moore: Fast and accurate sentence alignment of bilingual corpora. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 135-144. [go to publisher details]

(2002) R.Muñoz, R.Mitkov, M.Palomar, J.Peral, R.Evans, L.Moreno, C.Orasan, M.Saiz-Noeda, A.Ferrández, C.Barbu, P.Martínez-Barco, & A.Suárez: Bilingual alignment of anaphoric expressions. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.2088-2093. [PDF, 76KB]

(2002) Noah A.Smith: From words to corpora: recognizing translation. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.95-102 [PDF, 303KB]

(2002) Le Sun, Song Xue, Weimin Qu, Xiaofeng Wang, & Yufang Sun: Constructing a large-scale Chinese-English parallel corpus. Coling-2002: Third Workshop on Asian Language resources and International Standarization, 31 August 2002, Taipei,Taiwan; 8pp. [PDF, 359KB]

(2002) Taro Watanabe, Kenji Imamura and Eiichiro Sumita: Statistical machine translation based on hierarchical phrase alignment. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.188-198. [PDF, 165KB]

(2002) Bing Zhao & Stephan Vogel: Full-text story alignment models for Chinese-English bilingual news corpora. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.517-520; abstract [PDF, 45KB]

(2001) Kirsty Macdonald: Improving automatic alignment for translation memory creation. Translating and the Computer 23: papers from the Aslib conference held on 29 & 30 November 2001 (London: Aslib, 2001); 11pp. [PDF, 114KB]

(2001) Pierluigi Melotto: The alignment tool: how to exploit previously translated texts. International Journal for Language and Documentation 10, August 2001; pp.17-18. [PDF, 1144KB]

(2001) Arul Menezes & Stephen D. Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Example-Based Machine Translation. [PDF, 72KB]

(2001) Arul Menezes & Stephen D.Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.39-46. [PDF, 84KB]

(2001) Philip Resnik: [review of] Parallel text processing: alignment and use of translation corpora [ed. by] Jean Véronis. Computational Linguistics 27 (4), pp.592-595. [PDF, 333KB]

(2001) António Ribeiro, Gaël Dias, Gabriel Lopes & João Mexia: Cognates alignment. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.287-292. [PDF, 70KB]

(2001) Mark Stevenson: [review of] Jean Véronis: Parallel text processing (Kluwer). In: Machine Translation Review, issue 12: December 2001; pp.75-76.

(2000) Michael Carl: Extracting invertible translations from pre-aligned texts.  LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 321KB]

 (2000) Christian Fluhr, Fréderique Bisson, & Faïza Elkateb: Parallel text alignment using crosslingual information retrieval techniques [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 187-200.

(2000) Seonho Kim, Juntae Yoon, & Mansuk Song: Structural feature selection for English-Korean statistical machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 439-445 [PDF,.635KB]

(2000) Fang Li & Wilhelm Weisweber: Bilingual lexicon extraction from Internet. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 7pp. [PDF, 59KB]

(2000) Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufmann, & Stanley Peters: A bootstrapping method for extracting bilingual text pairs Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1066-1070 [PDF,.464KB]

 (2000) I.Dan Melamed: Pattern recognition for mapping bitext correspondence [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 25-47.

 (2000) Alan K.Melby: Sharing of translation memory databases from aligned parallel text [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 347-368.

(2000) Franz Joseph Och & Hermann Ney: Improved statistical alignment models. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 199KB]

(2000) Franz Josef Och & Hermann Ney: A comparison of alignment models for statistical machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1086-1090 [PDF,.430KB]

 (2000) Stelios Piperidis, Harris Papageorgiou, & Sotiris Boutsis: From sentences to words and clauses [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 117-138.

(2000) António Ribeiro, Gabriel Lopes, & João Mexia: Using confidence bands for parallel texts alignment. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 260KB]

(2000) António Ribeiro, Gabriel Lopes, & João Mexia: A self-learning method of parallel texts alignment. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.30-39. [go to publisher details]

 (2000) Laurent Romary & Patrice Bonhomme: Parallel alignment of structured documents [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 201-217.

(2000) Sayori Shimohata: An empricial method for identifying and translating technical terminology Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 782-788 [PDF,.513KB]

 (2000) Michel Simard: Multilingual text alignment: aligning three or more versions of a text [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 49-67.

(2000) Noah A. Smith & Michael E. Jahr: Cairo: an alignment visualization tool. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 549-551. [PDF, 45KB]

(2000) Ioannis Triantafyllou, Iason Demiros, Christos Malavazos, & Stelios Piperidis: An alignment architecture for translation memory bootstrapping. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1579KB]

(2000) Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000.

 (2000) Jean Véronis, & Philippe Langlais: Evaluation of parallel text alignment systems: the ARCADE project [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 369-388.

(2000) Jean Véronis: From the Rosetta stone to the information society: a survey of parallel text processing [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp.1-24. 

(2000) S.Vogel & H.Ney: Construction of a hierarchical translation memory Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1131-1135 [PDF,.396KB]

 (2000) Dekai Wu: Bracketing and aligning words and constituents in parallel text using stochastic inversion transduction grammars [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 139-167.

(2000) KaorouYamamoto & Yuji Matsumoto: Acquisition of phrase-level bilingual correspondence using dependency structure Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 933-939 [PDF,.592KB]

Analogies and analogical modelling (see also Example-based methods)

(2004) Yves Lepage: Lower and higher estimates of “true analogies” between sentences contained in a large multilingual corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 999KB]

(2000) Christos Malavazos & Stelios Piperidis: Application of analogical modelling to example-based machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 516-522 [PDF,.665KB]

Case-based methods

(2003) Bróna Collins & Harold Somers: EBMT seen as case-based reasoning. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 115-153.

(2003) Harold Somers & Hermione Lovel:Computer-based support for patients with limited English7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 41-49 [PDF, 198KB]

(2001) Harold Somers: EBMT seen as case-based reasoning MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Workshop on Example-Based Machine Translation. [PDF, 186KB]

Character alignment

(2000) Byung-Ju Kang & Key-Sun Choi: English-Korean automatic transliteration/back-transliteration system and character alignment. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.17-18 [PDF, 117KB]

Context-based methods

(2000) Stephan Koch, Uwe Küssner, & Manfred Stede: Contextual disambiguation. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.466-476. [abstract]

Disambiguation (see also Word sense disambiguation)

(2004) Bogdan Babych & Anthony Hartley: Disambiguating translation strategies in MT using automatic named entity recognition 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 18-25. [PDF, 243KB ]

(2004) Sanae Fujita & Francis Bond: An automatic method of creating valency entries using plain bilingual dictionaries; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.55-64. [PDF, 147KB]

(2004) Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz & Mikel L. Forcada: Cooperative unsupervised training of the part-of-speech taggers in a bidirectional machine translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.135-144. [PDF, 92KB]

(2004) Lee Schwartz & Takako Aikawa: Multilingual corpus-based approach to the resolution of English –ing.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.959-962. [PDF, 301KB]

(2003) Arendse Bernth & Michael C. McCord: A hybrid approach to deriving selectional preferences MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 61KB]

(2003) Lee Schwartz, Takako Aikawa, & Chris Quirk: Disambiguation of English PP attachment using multilingual aligned data MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.330-337. [PDF, 98KB]

(2003) Eric Wehrli: Translation of words in context MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.502-505. [PDF, 203KB]

(2002) Eneko Agirre, Mikel Lersundi, & David Martinez: A multi-lingual approach to disambiguate prepositions and case suffixes; ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", July 2002, Philadelphia, USA; pp. 1-8 [PDF, 166KB]

(2002) Paisarn Charoenpornsawat, Virach Sornlertlamvanich & Thatsanee Charoenporn: Improving translation quality of rule-based machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 6pp. [PDF, 355KB]

(2002) Mathieu Lafourcade & Christian Boitet: UNL lexical selection with conceptual vectors. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1958-1964. [PDF, 82KB]

(2002) Hyo-Kyung Lee: Classification approach to word selection in machine translation. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 114-123. [go to publisher details]

(2002) Li Tangqiu, Yang Xiaofeng, Hong Qingyang, & Li Shaozhi: A hybrid method for syntactic and semantic structure disambiguation of Chinese. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.43-55.

(2002) Arul Menezes: Better contextual translation using machine learning. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 124-134. [go to publisher details]

(2001) Ismael García-Varea, Franz J. Och, Hermann Ney, & Francisco Casacuberta: Refined lexicon models for statistical machine translation using a maximum entropy approach ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.204-211. [PDF, 72KB]

(2001) Keiko Horiguchi: Information-based machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF,  105KB]

 (2001) K.L.Kwok: NTCIR-2 Chinese, cross language retrieval experiment using PIRCS. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 8pp. [PDF, 202KB]

 (2000) Stephan Koch, Uwe Küssner, & Manfred Stede: Contextual disambiguation. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.466-476. [abstract]

(2000) Michael Schiehlen: Granularity effects in tense translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 712-718 [PDF,.568KB]

Generation (see also Text prediction)

(2004) Gábor Prószéky, László Tihanyi, & Gábor Ugray: Moose: a robust high-performance parser and generator 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 138-142. [PDF, 179KB]

(2004) Eric Ringger, Michael Gamon, Robert C.Moore, David Rojas, Martine Smets, & Simon Corston-Oliver: Linguistically informed statistical models of constituent structure for ordering in sentence realization.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 116KB]

(2004) Fei Xia & Michael McCord: Improving a statistical MT system with automatically learned rewrite patterns. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 1055KB]

(2004) Richard Zens, Hermann Ney, Taro Watanabe, & Eiichiro Sumita: Reordering constraints for phrase-based statistical machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 178KB]

(2004) Bing Zhao, Matthias Eck, & Stephan Vogel: Language model adaptation for statistical machine translation with structured query models. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 224KB]

(2003) Arendse Bernth: Controlled generation for speech-to-speech MT systems Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 28KB]

(2003) Charles Callaway: Multilingual revision. ENLG-2003: Proceedings of the 9th European Workshop on Natural Language Generation, April 13th-14th 2003, Budapest, Hungary; pp.15-22. [PDF, 1195KB]

(2003) Nano Gough &  Andy Way: Controlled generation in example-based machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 45KB]

(2003) Bo Pang, Kevin Knight, & Daniel Marcu: Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 91KB]

(2003) Richard Power, Donia Scott & Anthony Hartley: Multilingual generation of controlled languages Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.115-123. [PDF, 325KB]

(2003) Martine Smets, Michael Gamon, Jessie Pinkham, Tom Reutter, & Martine Pettenaro: High quality machine translation using a machine-learned sentence realization component MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.362-369. [PDF, 76KB]

(2003) Martine Smets, Michael Gamon, Simon Corston-Oliver & Eric Ringger: The adaptation of a machine-learned sentence realization system to French. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.323-330 [PDF, 373KB]

(2003) Nicola Ueffing & Hermann Ney: Using POS information for statistical machine translation into morphologically rich languages. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.347-354 [PDF, 384KB]

(2003) Cristina Vertan & Walther von Hahn: Menu choice translation: a flexible menu-based controlled natural language system. Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.194-199. [PDF, 62KB]

(2002) Guillermo Barrutieta, Joseba Abaitua, & JosuKa Díaz: Cascading XSL filters for content selection in multilingual document generation. Coling-2002: Second Workshop on NLP and XML (NLPXML-2002), August 2002, Taipei, Taiwan; 6pp. [PDF, 261KB]

(2002) Stephan Busemann: Issues in generating text from interlingua representations. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: First international workshop on UNL, other interlinguas and their applications, Las Palmas Canary Islands, 27 May 2002; pp.1-7. [PDF, 85KB]

(2002) T.Dhanabalan, K.Saravanan, & T.V.Geetha: Tamil to UNL enconverter. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 15pp. [PDF, 127KB]

(2002) Nizar Habash & Bonnie Dorr: Handling translation divergences: combining statistical and symbolic techniques in generation-heavy machine translation. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 84-93. [go to publisher details]

(2002) Ian Marshall and Éva Sáfár: Sign language generation using HPSG. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.105-114. [PDF, 229KB]

(2002) Yohei Seki & Ken'ichi Harada: Machine translation based on NLG from XML-DB. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 194KB]

(2002) Koichi Takeda: Sentence generation for pattern-based machine translation. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.168-177. [PDF, 224KB]

(2002) Bowen Zhou, Yuqing Gao, Jeffrey Sorensen, Zijian Diao, & Michael Picheny: Statistical natural language generation for speech-to-speech machine translation systems.  ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1897-1900; abstract [PDF, 54KB]

(2001) Takako Aikawa, Maite Melero, Lee Schwartz & Andi Wu : Generation for multilingual MT. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 9-14. [PDF, 210KB]

(2001) Takako Aikawa, Maite Melero, Lee Schwartz & Andi Wu : Multilingual sentence generation. ACL 2001 Eighth European Workshop on Natural Language Generation, July 2001, Toulouse, France; 7pp. [PDF, 81KB]

(2001) Guillermo Barrutieta, Joseba Abaitua & Josuha Díaz: Gross-grained RST through XML metadata for multilingual document generation. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 39-42. [PDF, 151KB]

(2001) Hai Doan-Nguyen: Generation of Vietnamese for French-Vietnamese and English-Vietnamese machine translation. ACL 2001 Eighth European Workshop on Natural Language Generation, July 2001, Toulouse, France; 10pp. [PDF, 209KB]

(2001) Marcos Franco Sabarís, José Luis Rojas Alonso, C. Dafonte & B. Arcay: Multilingual authoring through an artificial language. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 99-102. [PDF, 138KB]

(2001) Ulrich Germann, Michael Jahr, Kevin Knight, Daniel Marcu, & Kenji Yamada: Fast decoding and optimal decoding for machine translation ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.228-235. [PDF, 928KB]

(2001) Anthony Hartley, Donia Scott, John Bateman & Danail Dochev: AGILE - a system for multilingual generation of technical instructions. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 145-150. [PDF, 384KB]

(2001) Marisa Jiménez: Generation of named entities. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.175-179. [PDF, 177KB]

 (2000) Jan Alexandersson, Peter Poller, & Michael Kipp: Generating multilingual dialog summaries and minutes. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 507-518. [abstract]

(2000) Jan Alexandersson, Peter Poller, Michael Kipp, & Ralf Engel: Multilingual summary generation in a speech-to-speech translation system for multilingual dialogues. INLG'2000 Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.148-155. [PDF, 670KB]

 (2000) Tilman Becker, Anne Kilger, Patrice Lopez, & Peter Poller: The Verbmobil generation component VM-GECO. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.481-496. [abstract]

(2000) Etienne Blanc: From the UNL hypergraph to GETA’s multilevel tree. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 9pp. [PDF, 1671KB]

(2000) Caroline Brun, Marc Dymetman, & Veronika Lux: Document structure and multilingual authoring. INLG’2000 : Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.24-31. [PDF, 729KB]

(2000) Arantza Casillas, Joseba Abaitua & Raquel Martínez: DTD-driven bilingual document generation. INLG’2000 : Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.32-38. [PDF, 586KB]

(2000) Damir Ćavar, Uwe Küssner, & Dan Tidhar: From human evaluation to automatic selection of good translations. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 29-33. [PDF, 158KB]

(2000) Eric Crestan: Improvement of French generation for the KANT machine translation system. Diplôme de Recherche Technologique, 10 novembre 2000, Laboratoire d’Informatique d’Avignon. 55pp. [PDF, 208KB]

(2000) Rodolfo Delmonte, Dario Bianchi, & Emanuele Pianta: Generating from a discourse model. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 10pp. [PDF, 2145KB]

(2000) Nizar Habash: Oxygen: a language independent linearization engine. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.68-79. [go to publisher details]

(2000) Kevin Knight & Irene Langkilde: Preserving ambiguities in generation via automata intersection. 17th National conference of the American Association for Artificial Intelligence (AAAI 2000) July 30- August 3, 2000, Austin,Texas. [PDF, 149KB]

(2000) Philippe Langlais, George Foster, Guy Lapalme, & Sébastien Sauvé: TransType: a target-text mediated interactive machine translation system. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.36-37 [PDF, 176KB]

(2000) Benoit Lavoie, Richard Kittredge, Tanya Korelsy, & Owen Rambow: A framework for MT and multilingual NLG systems based on uniform lexico-structural processing. ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp. 60-67. [PDF, 527KB]

(2000) Christos Malavazos, Stelios Piperidis, & George Carayannis: Towards memory and template-based translation synthesis. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1929KB]

(2000) Jesús Peral & Antonio Ferrández: An application of the interlingua system ISS for Spanish-English pronominal anaphora generation. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 42-51 [PDF, 733KB]

(2000) Gilles Sérasset & Christian Boitet: On UNL as the future "html of the linguistic content" & reuse of existing NLP components in UNL-related applications with the example of a UNL-French deconverter Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 768-774 [PDF,.562KB]

(2000) Dan Tidhar & Uwe Küssner: Learning to select a good translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 843-849 [PDF,.659KB]

(2000) David Traum & Nizar Habash: Generation from lexical conceptual structures. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 52-59 [PDF, 629KB]

(2000) Hua Wu, Taiyi Huang, Chengqing Zong,& Bo Xu: Chinese generation in a spoken dialogue translation system Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1141-1145 [PDF,.398KB]

(2000) Tianfang Yao, Dongmo Zhang, & Qian Wang: MLWFA: a multilingual weather forecast text generation system.  ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.13-14 [PDF, 179KB]

Lexical transfer

(2004) Timothy Chklovski, Rada Mihalcea, Ted Pedersen, & Amruta Purandare: The SENSEVAL-3 multilingual English-Hindi lexical sample task. [ACL 2004] Proceedings of SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 25-26 July 2004, Barcelona. Spain; 4pp. [PDF, 35KB]

(2004) Hang Li & Cong Li: Word translation disambiguation using bilingual bootstrapping. Computational Linguistics 30 (1), pp. 1-22. [PDF, 2311KB]

(2003) Arul Menezes & Stephen D. Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp.421-442.

(2002) Etienne Blanc, Gilles Sérasset, & WangJu Tsai: Structural and lexical transfer from an UNL graph to an equivalent natural language dependency tree. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: First international workshop on UNL, other interlinguas and their applications, Las Palmas Canary Islands, 27 May 2002; pp.14-18. [PDF, 53KB]

(2002) Mathieu Lafourcade & Christian Boitet: UNL lexical selection with conceptual vectors. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1958-1964. [PDF, 82KB]

(2001) Srinivas Bangalore & Giuseppe Riccardi: A finite-state approach to machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF, 210KB]

(2001) Benoit Lavoie, Michael White, & Tanya Korelsky: Inducing lexico-structural transfer rules from parsed bi-texts. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.17-24. [PDF, 60KB]

 (2000) Martin C. Emele, Michael Dorna, Anke Lüdeling, Heike Zinsmeister, & Christian Rohrer: Semantic-based transfer. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp .359-376. [abstract]

(2000) Benoit Lavoie, Richard Kittredge, Tanya Korelsy, & Owen Rambow: A framework for MT and multilingual NLG systems based on uniform lexico-structural processing. ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp. 60-67. [PDF, 527KB]

(2000) Hiroshi Nakagawa: Disambiguation of lexical translations based on bilingual comparable corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 100KB]

Morphological analysis (see also Tagging)

(2004) Gabor Alberti & Judit Kleiber: The GeLexi MT project 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.1-10. [PDF, 258KB ]

(2004) Simon Corston-Oliver & Michael Gamon: Normalizing German and English inflectional morphology to improve statistical word alignment. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 48-57. [go to publisher details]

(2004) Jiri Hana, Anna Feldman & Chris Brew: A resource-light approach to Russian morphology: tagging Russian using Czech resources. [EMNLP 2004] Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 91KB]

 (2004) Edward A.Kool: Representing Dutch morphology in a machine translation system. International Journal of Translation 16 (1), Jan-June 2004; pp.109-120. [PDF, 108KB]

(2004) Young-Suk Lee: Morphological analysis for statistical machine translation. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA. Short Papers; pp. 57-60. [PDF, 150KB]

(2004) Sonja Nießen & Hermann Ney: Statistical machine translation with scarce resources using morpho-syntactic information. Computational Linguistics 30 (2), pp.181-204. [PDF, 147KB]

(2004) Maja Popović & Hermann Ney: Improving word alignment quality using morpho-syntactic information. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 5pp. [PDF, 141KB]

(2004) Maja Popović & Hermann Ney: Towards the use of word stems and suffixes for statistical machine translation.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1585-1588. [PDF, 534KB]

(2004) Gábor Prószéky, László Tihanyi, & Gábor Ugray: Moose: a robust high-performance parser and generator 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 138-142. [PDF, 179KB]

(2004) Bettina Schrader: Improving word alignment quality using linguistic knowledge. LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 46-49. [PDF, 233KB]

(2004) Stephan Vogel & Christian Monson: Augmenting manual dictionaries for statistical machine translation. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1593-1596. [PDF, 264KB]

(2003) Paul R.Bowden: A Latin morphological processor.  In: Machine Translation Review, issue 14: December 2003; pp.6-16.

(2003) Joseph Dichy & Ali Farghaly: Roots & patterns vs. stems plus grammar-lexis specifications: on what basis should a multilingual database centred on Arabic be built? MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF]

(2003) Alon Itai & Erel Segal: A corpus based morphological analyzer for unvocalized modern Hebrew. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF,250KB]

(2003) Kevin McTait: Translation patterns, linguistic knowledge and complexity in an approach to EBMT. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 307-338.

(2003) Nicola Ueffing & Hermann Ney: Using POS information for statistical machine translation into morphologically rich languages. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.347-354 [PDF, 384KB]

(2002) Kemal Altintas & Ilyas Cicekli: A machine translation system between a pair of closely related languages. Proceedings of the 17th International Symposium on Computer and Information Sciences (ISCIS 2002), Orlando, Florida, 2002 (CRC Press); pp.192-196.  [PDF, 204KB]

(2002) Alicia Garrido-Alenda, Mikel L. Forcada and Rafael C. Carrasco: Incremental construction and maintenance of morphological analysers based on augmented letter transducers TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 171KB]

(2002) Claudia Gdaniec & Esmé Manandise: Using word formation rules to extend MT lexicons. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 64-73. [go to publisher details]

(2002) Barbara Gawronska, Björn Erlendsson and Hanna Duczak: Extracting semantic classes and morphosyntactic features for English-Polish machine translation. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 252KB]

(2002) Sonja Nießen: Improving statistical machine translation using morpho-syntactic information. Ph D thesis, Rheinisch-Westfälische Technische Hochschule Aachen, Dezember 2002. 123pp. [PDF, 1505KB]

(2002) Gábor Prószéky, Mátyás Naszódi & Balázs Kis: Recognition assistance: treating errors in text acquired from various recognition processes. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 63KB]

(2001) Juan A. Alonso, Ramón Fanlo & Albert Llorens: Collapsing morphological information in lexical databases for NLP applications. . MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 21-26. [PDF, 872KB]

(2001) Claudia Gdaniec, Esmé Manandise, Michael C. McCord: Derivational morphology to the rescue: how it can help resolve unfound words in MT  MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 127-131. [PDF, 167KB]

(2001) Muhtar Mahsut, Yasuhiro Ogawa, Kazue Sugino & Yasuyoshi Inagaki: Utilizing agglutinative features in Japanese-Uighur machine translation. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.217-222. [PDF, 60KB]

(2001) Rajmund Piotrowski, Yuri Romanov, Yuri Tovmach, Natalia Zaitseva, & Michael Blekhman: Dictionary organization in Linguistic Automaton for oriental languages.  International Journal of Translation 13 (1-2), Jan-Dec 2001; pp.105-118. [PDF, 108KB]

(2001) Alberto Sanchis, David Picó, Joan Miquel del Val, Ferran Fabregat, Jesús Tomás, Moisés Pastor, Francisco Casacuberta & Enrique Vidal: A morphological analyser for machine translation based on finite-state transducers. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.305-309. [PDF, 48KB]

(2000) Mikel L. Forcada: Learning machine translation strategies using commercial systems: discovering word-reordering rules. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1379KB]

(2000) Jan Hajič: Morphological tagging: data vs.dictionaries. [NAACL 2000] 1st Meeting of the North American Chapter of the Association for Computational Linguistics: proceedings of the conference, April 29-May 4, 2000, Seattle, Washington, USA; pp.94-101. [PDF,

(2000) Hoda O.M.Mokhtar, Nevin M.Darwish, & Ahmed A.Rafea: An automated system for English-Arabic translation of scientific texts (SEATS). MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 5pp. [PDF, 1096KB]

(2000) Sonja Niessen & Hermann Ney: Improving SMT quality with morpho-syntactic analysis Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1081-1085 [PDF,.430KB]

(2000) Rémi Zajac, Steve Helmreich, & Karine Megerdoomian: Black-box/glass-box evaluation in Shiraz. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop proceedings: Evaluation of machine translation, Athens, Greece, 29 May 2000; pp. 13-20. [PDF, 693KB]

Post-editing

(2004) Jeff Allen: Mastering machine translation (MT) output. Tutorial notes, at AMTA 2004: Sixth biennial conference of the Association for Machine Translation in the Americas, September 28, 2004, Georgetown University, Washington, DC, USA [PDF of PPT presentation, 684KB]

(2004) Chris Callison-Burch, Colin Bannard, & Josh Schroeder: Improving statistical translation through editing 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp.26-32. [PDF, 227KB]

(2004) Ariadna Font Llitjós & Jaime Carbonell: The translation correction tool: English-Spanish user studies. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.347-350. [PDF, 606KB]

(2004) Lorena Guerra Martínez: PROMT Professional and the importance of building and updating machine translation dictionaries. International Journal of Translation 16 (1), Jan-June 2004; pp.121-139. [PDF, 254KB]

(2004) Lambros Kranias & Anna Samiotou: Automatic translation memory fuzzy match post-editing: a step beyond traditional TM/MT integration. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.331-334. [PDF, 346KB]

(2004) Ana Niño: Recycling MT: a course on foreign language writing via MT post-editing.  Proceedings of the 7th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics [CLUK-2004], Birmingham, UK, 6-7 January 2004; 9pp. [PDF, 170KB]

(2004) Sharon O’Brien: Machine translatability and post-editing effort: how do they relate? Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 31pp. [PDF, 38KB]

(2004) Celia Rico & Enrique Torrejón: Controlled translation as a new translation scenario: training the future user. Translating and the Computer 26: proceedings of the Twenty-sixth International Conference on Translating and the Computer, 18-19 November 2004, London. (London: Aslib, 2004); 24pp. [PDF, 141KB]

(2003) Jeffrey Allen: Post-editing. In: Harold Somers (ed.) Computers and translation: a translator’s guide (Amsterdam/Philadelphia: John Benjamins Publishing Company, 2003); pp.297-317.

(2003) Judith Belam: Buying up to falling down: a deductive approach to teaching post-editing. MT Summit IX, New Orleans, USA, Workshop on Teaching Translation Technologies and Tools, 27 September 2003. [PDF, 208KB]

(2003) Charles Callaway: Multilingual revision. ENLG-2003: Proceedings of the 9th European Workshop on Natural Language Generation, April 13th-14th 2003, Budapest, Hungary; pp.15-22. [PDF, 1195KB]

(2003) Catherine Hyland: Testing "Prompt": the development of a rapid post-editing service at CLS Corporate Language Services AG, Switzerland MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.189-193 [PDF, 201KB]

(2003) Antonio Sayáns Gómez & Elena Villar Conde: The functionality of a tool bar for postedition in machine translation between languages with linguistic interference: the Spanish-Galician case MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.118-124. [PDF, 354KB]

(2003) Falko Schäfer: MT post-editing: how to shed light on the "unknown task". Experiences at SAP. Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.133-140. [PDF, 332KB]

(2002) Sharon O'Brien: Teaching post-editing: a proposal for course content Sixth EAMT Workshop "Teaching machine translation", November 14-15, 2002, UMIST, Manchester, England; pp.99-106. [PDF, 170KB]

(2001) Jeff Allen: Post-editing or  no post-editing? International Journal for Language and Documentation 8, December 2000/January 2001; pp.41-42. [PDF, 596KB]

(2001) Jeffrey Allen:  Postediting: an integrated part of a translation software program. Language International 13 (2), April 2001; pp.26-29. [PDF, 971KB]

(2001) Walter Hartmann: ATA Postediitng Workshop.   In: MT News International no.29, December 2001. [PDF]

(2001) Claus Povlsen & Annelise Bech: Ape: reducing the monkey business in post-editing by automating the task intelligently. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.283-286. [PDF, 134KB]

(2000) Gustavo A. Silva: The use of ENGSPAN at the Pan American Health Organization: a reviser’s perspective. AMTA 2000 pre-conference workshop “Machine translation in practice: from old guard to new guard”, Cuernavaca,Mexico, October10,  2000 . 8p. [PDF, 123KB]

Pre-editing and pre-processing (see also Controlled languages)

(2004) Rita Nübel: Evaluation and adaptation of a specialised language checking tool for non-specialised machine translation and non-expert MT users for multi-lingual telecooperation. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2027-2030. [PDF, 536KB]

(2002) Christian Boitet & Wang-Ju Tsai: Coedition to share text revision across languages and improve MT a posteriori; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 116KB]

(2002) Christian Boitet & Tsai Wang-Ju: Coedition of texts and UNL graphs to share text revision across languages and improve MT a posteriori. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 46 slides [PDF of PPT presentation, 418KB]

(2002) Lorna Joy: Translating tagged text – imperfect matches and a good finished job.  Translating and the Computer 24: proceedings from the Aslib conference held on 21-22 November 2002 (London: Aslib, 2002); 15pp. [PDF, 222KB]

(2002) Chi-Chiang Shei: Teaching MT through pre-editing: three case studies Sixth EAMT Workshop "Teaching machine translation", November 14-15, 2002, UMIST, Manchester, England; pp.89-98. [PDF, 58KB]

(2002): Hideo Watanabe, Katashi Nagao, Michael C. McCord & Arendse Bernth: An annotation system for enhancing quality of natural language processing. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 202KB]

(2001) Daniel J. Walker, David E. Clements, Maki Darwin and Jan W. Amtrup: Sentence boundary detection: a comparison of paradigms for improving MT quality. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 369-372. [PDF, 181KB]

(2000) Geoffrey Kingscott: Smart profile: John M.Smart – a profile of the man whose company wants to create a “global language” for the 21st century. International Journal for Language and Documentation 5, June 2000; pp.26-27. [PDF, 1629KB]

(2000) Daniel Marcu, Lynn Carlson, & Maki Watanabe: The automatic translation of discourse structures. [NAACL 2000] 1st Meeting of the North American Chapter of the Association for Computational Linguistics: proceedings of the conference, April 29-May 4, 2000, Seattle, Washington, USA; pp.9-17. [PDF,

Rapid development of MT systems

 (2004) Tod Allman & Stephen Beale: An environment for quick ramp-up multi-lingual authoring. International Journal of Translation 16 (1), Jan-June 2004; pp.73-85. [PDF, 142KB]

(2004) Craig Kopris: Rapid MT experience in an LCTL (Pashto). Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 130-133. [go to publisher details]

(2004) Alon Lavie, Erik Peterson, Katharina Probst, Shuly Wintner & Yaniv Eytani: Rapid prototyping of a transfer-based Hebrew-to-English machine translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.1-10 [PDF, 158KB]

(2004) Alon Lavie, Katharina Probst, Erik Peterson, Stephan Vogel, Lori Levin, Ariadna Font-Llitjos, & Jaime Carbonell: A trainable transfer-based MT approach for languages with limited resources 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 116-123. [PDF, 265KB]

(2004) Franz Josef Och, Daniel Gildea, Sanjeev Khudanpur, Anoop Sarkar, Kenji Yamada, Alex Fraser, Shankar Kumar, Libin Shen, David Smith, Katherine Eng, Viren Jain, Zhen Jin, & Dragomir Radev: A smorgasbrod of features for statistical machine translation.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 161-168. [PDF, 192KB]

(2003) George Foster,  Simona Gandrabur , Philippe Langlais, Pierre Plamondon, Graham Russell, & Michel Simard: Statistical machine translation: rapid development with limited resources MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 82KB]

(2003) Viet Bac Le, Brigitte Bigi, Laurent Besacier, & Eric Castelli: Using the web for fast language model construction in minority languages. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.3117-3120; abstract [PDF, 34KB]

 (2003) Douglas W.Oard, David Doermann, Bonnie Dorr, Daqing He, Philip Resnik, Amy Weinberg, William Byrne, Sanjeev Khudanpur, David Yarowsky, Anton Leuski, Philipp Koehn, & Kevin Knight: Desparately seeking Cebuano. HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 21KB]

(2003) Douglas W. Oard & Franz Josef Och: Rapid-response machine translation for unexpected languages MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.277-283 [PDF, 84KB]

(2003) Minako O’Hagan: Can language technology respond to the subtitler’s dilemma? – A preliminary study. Translating and the Computer 25: proceedings of the Twenty-fifth International Conference on Translating and the Computer, 20-21 November 2003, London. (London: Aslib, 2003); 18pp. [PDF, 117KB]

(2002) Alan W. Black, Ralf Brown, Robert Frederking, Rita Singh, John Moody, & Eric Steinbrecher: TONGUES: rapid development of a speech-to-speech translation system. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 183-186. [PDF, 106KB]

(2002) Alan W.Black, Ralf D.Brown, Robert Frederking, Kevin Lenzo, John Moody, Alexander I.Rudnicky, Rita Singh, & Eric Steinbrecher: Rapid development of speech-to-speech translation. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1709-1712; abstract [PDF, 47KB]

(2002) Jessie Pinkham & Martine Smets: Modular MT with a learned bilingual dictionary: rapid deployment of a new language pair. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 59KB]

(2002) Alicia Tribble, Alon Lavie and Lori Levin: Rapid adaptive development of semantic analysis grammars. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.178-187. [PDF, 159KB]

(2001) Jessie Pinkham, Monica Corston-Oliver, Martine Smets & Martine Pettenaro: Rapid assembly of a large-scale French-English MT system. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.277-281. [PDF, 180KB]

(2000) Jan W. Amtrup, Karine Megerdoomian, & Rémi Zajac: Rapid development of translation tools: application to Persian and Turkish Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp.982-986 [PDF,.508KB]

Re-ordering (see also Generation, Syntactic transfer)

(2004) Michel Galley, Mark Hopkins, Kevin Knight & Daniel Marcu: What’s in a translation rule? HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp.273-280. [PDF, 568KB]

(2004) Fei Xia & Michael McCord: Improving a statistical MT system with automatically learned rewrite patterns. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 1055KB]

(2001) Srinivas Bangalore & Giuseppe Riccardi: A finite-state approach to machine translation. [NAACL-2001] Language Technologies 2001: the Second meeting of the North American Chapter of the Association for Computational Linguistics, Carnegie Mellon University, Pittsburgh, PA, 2-7 June 2001; 8pp. [PDF, 210KB]

Reusability

(2004) Igor Boguslavsky, Leonid Iomdin, & Victor Sizov: Multilinguality in ETAP-3: reuse of lexical resources. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.7-14. [PDF, 582KB]

(2004) Catarina Ribeiro, Ricardo Santos, Rui Pedro Chaves, & Palmira Marrafa: Semi-automatic UNL dictionary generation using WordNet.PT.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.279-282. [PDF, 344KB]

(2003) Martine Smets, Michael Gamon, Simon Corston-Oliver & Eric Ringger: The adaptation of a machine-learned sentence realization system to French. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.323-330 [PDF, 373KB]

(2002) Natalie Kübler: Creating a term base to customise an MT system: reusability of resources and tools from the translator’s point of view. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: Language resources for translation work and research, Las Palmas Canary Islands, 27 May 2002; pp.44-48. [PDF, 31KB]

(2001) Michael Paul: Translation knowledge recycling for related languages. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.265-268. [PDF, 223KB]

 (2000) Lars Ahrenberg, Mikael Andersson, & Magnus Merkel: A knowledge-lite approach to word alignment [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 97-116.

(2000) Arantza Casillas, Joseba Abaitua, & Raquel Martínez: Recycling annotated parallel corpora for bilingual document composition. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.117-126. [go to publisher details]

(2000) Arantxa Diaz de Ilarraza, Aingeru Mayor, & Kepa Sarasola: Reusability of wide-coverage linguistic resources in the construction of a multilingual machine translation system. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 9pp. [PDF, 1599KB]

(2000) Gilles Sérasset & Christian Boitet: On UNL as the future "html of the linguistic content" & the reuse of existing NLP components in UNL-related applications with the exampleof a UNL-French deconverter. Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp.768-774. [PDF, 564KB]

Reversibility

(2001) Michael Carl: Inducing probabilistic invertable translation grammars from aligned texts.  ACL-EACL 2001 Workshop on Computational Natural Language Learning (CoNLL), July 2001, Toulouse, France; 7pp. [PDF, 160KB]

(2000) Michael Carl: Combining invertible example-based machine translation with translation memory technology. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.127-136. [go to publisher details]

Segmentation (see also Morphological analysis, Tagging)

(2004) Slaven Bilac & Hozumi Tanaka: A hybrid back-transliteration system for Japanese.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 999KB]

(2004) Takao Doi & Eiichiro Sumita: Splitting input sentences for machine translation using language model with sentence similarity.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 61KB]

(2004) Nano Gough & Andy Way: Robust large-scale EBMT with marker-based segmentation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.95-104. [PDF, 94KB]

(2004) Feifan Liu, Qianli Jin, Jun Zhao & Bo Xu: Bilingual chunk alignment based on interactional matching and probabilistic latent semantic indexing.  First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.416-425. [abstract]

(2004) Christoph Tillmann: A unigram orientation model for statistical machine translation.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA – Short Papers; pp.101-104. [PDF, 84KB]

(2004) Francisco Nevado, Francisco Casacuberta, & Josu Landa: Translation memories enrichment by statistical bilingual segmentation. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.335-338. [PDF, 354KB]

(2004) Jia Xu, Richard Zens & Hermann Ney: Do we need Chinese word segmentation for statistical machine translation? [ACL-2004] Proceedings of the Third SIGHAN Workshop on Chinese Language Learning, 25 July 2004, Barcelona; 7pp. [PDF, 513KB]

(2003) Limdin Du & Boxing Chen: Automatic extraction of bilingual chunk lexicon for spoken language translation. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2333-2336; abstract [PDF, 34KB]

(2003) Francisco Nevado, Francisco Casacuberta, & Enrique Vidal: Parallel corpora segmentation using anchor words. 7th EAMT Workshop, "Improving machine translation through other language technology tools", 13 April 2003, Budapest, Hungary; pp. 33-40 [PDF, 382KB]

(2003) Bo Xu, Shuwu Zhang, & Chengqing Zong: Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2329-2332; abstract [PDF, 34KB]

(2002) Taro Watanabe & Eiichiro Sumita: Statistical machine translation decoder based on phrase. ICSLP 2002, Interspeech 2002:7th International Conference on  Spoken Language Processing, September 16-20, 2002, Denver, Colorado, USA; pp.1889-1902; abstract [PDF, 47KB]

(2001) Aitao Chen, Frederic C.Gey & Hailing Jiang: Berkeley at NTCIR-2: Chinese, Japanese, and English IR experiments. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 9pp. [PDF, 99KB]

(2000) Timothy Baldwin & Hozumi Tanaka: The effects of word order and segmentation on translation retrieval performance Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 35-41 [PDF,.733KB]

(2000) I.Dan Melamed: Pattern recognition for mapping bitext correspondence [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 25-47.

(2000) Tanapong Potipiti, Virach Sornlertlamvanich, & Thatsanee Charoenporn: Towards building a corpus-based dictionary for non-word-boundary languages. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 5pp. [PDF, 56 KB]

Semantic analysis and representation (see also Disambiguation, Treebanks)

(2004) Eneko Agirre & David Martinez: The Basque Country University system: English and Basque tasks. [ACL 2004] Proceedings of SENSEVAL-3 Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, 25-26 July 2004, Barcelona. Spain; 5pp. [PDF, 94KB]

 (2004) Tod Allman & Stephen Beale: An environment for quick ramp-up multi-lingual authoring. International Journal of Translation 16 (1), Jan-June 2004; pp.73-85. [PDF, 142KB]

(2004) Anthony Aue, Arul Menezes, Bob Moore, Chris Quirk & Eric Ringger: Statistical machine translation using labeled semantic dependency graphs; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.125-134. [PDF, 127KB]

(2004) Luisa Bentivogli, Pamela Forner, Bernardo Magnini, & Emanuele Pianta: Revising the Wordnet Domains hierarchy: semantics, coverage and balancing. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.101-108. [PDF, 80KB]

(2004) Key-Sun Choi, Hee-Sook Bae, Wonseok Kang, Juho Lee, Eunhe Kim, Hekyeong Kim, Donghee Kim, Youngbin Song, & Hyosik Shin: Korean-Chinese-Japanese multilingual wordnet with shared semantic hierarchy.  LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1131-1134. [PDF, 518KB]

(2004) Pascale Fung & Benfeng Chen: BiFrameNet: bilingual frame semantics resource construction by cross-lingual induction. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 432KB]

(2004) Liang Gu & Yuqing Gao: On feature selection in maximum entropy approach to statistical concept-based speech-to-speech translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 115-121 [PDF, 213KB]

(2004) Fei Huang, Stephan Vogel, & Alex Waibel: Improving name entity translation combining phonetic and semantic similarities. HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA;

(2004) Hiroshi Kanayama, Tetsuya Nasukawa, & Hideo Watanabe: Deeper sentiment analysis using machine translation technology. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 323KB]

(2004) Paul Kingsbury, Nianwen Xue, & Martha Palmer: Propbanking in parallel.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 34-37. [PDF, 305KB]

(2004) Yves Lepage: Lower and higher estimates of “true analogies” between sentences contained in a large multilingual corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 999KB]

(2004) Stephan Oepen, Helge Dyvik, Jan Tore Lønning, Erik Velldal, Dorothee Beerman, John Carroll, Dan Flickinger, Lars Hellan, Janne Bondi Johannessen, Paul Meurer, Torbjørn Nordgård & Victoria Rosén: Som å kapp-ete med trollet? -- Towards MRS-based Norwegian-English machine translation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.11-20. [PDF, 227KB]

(2004) Florence Reeder, Bonnie Dorr, David Farwell, Nizar Habash, Stephen Helmreich, Eduard Hovy, Lori Levin, Teruko Mitamura, Keith Miller, Owen Rambow, & Advaith Siddharthan: Interlingual annotation for MT development. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 236-245. [go to publisher details]

(2003) Kurt Eberle: Coordination, incorporation and dynamic semantic representation in transfer Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003 [PDF, 176KB]

(2003) Bassam Haddad & Mustafa Yaseen: Towards semantic composition of Arabic: a λ-DRT based approach. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans,USA, 23 September 2003 [PDF, 162KB]

(2003) Hiroyuki Kaji: Word sense acquisition from bilingual comparable corpora HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; pp.32-39 [PDF, 339KB]

(2003) Qing Ma, Yujie Zhang, Masaki Murata, & Hitoshi Isahara: Semantic maps for word alignment in bilingual parallel corpora  ACL-2003: Second SIGHAN Workshop on Chinese Language Processing, July 11-12, 2003, Sapporo, Japan; 6pp.. [PDF, 199KB]

(2003) Sabine Ploux & Hyungsuk Ji: A model for matching semantic maps between languages (French/English, English/French). Computational Linguistics 29 (2), pp.155-178. [PDF, 4506KB]

(2003) Hua Wu & Ming Zhou: 'Synonymous collocation extraction using translation information' ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 139KB]

 (2003) Natalia Zaitseva: Matrix-and-frame methods as a means of structural and semantic systematization of terminological vocabulary. International Journal of Translation 15 (1), Jan-June 2003; pp.87-105. [PDF, 360KB]

(2002) Howard J.Bender: Natural intelligence in a machine translation system. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 224-228. [go to publisher details]

(2002) Luisa Bentivogli & Emanuele Pianta: Opportunistic semantic tagging. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp. 1401-1406. [PDF, 61KB]

(2002) Hans C. Boas: Bilingual FrameNet dictionaries for machine translation. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1364-1371. [PDF, 123KB]

(2002) Igor Boguslavsky: Some lexical issues of UNL. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: First international workshop on UNL, other interlinguas and their applications, Las Palmas Canary Islands, 27 May 2002; pp.19-22. [PDF, 146KB]

(2002) Chris Brockett, Takako Aikawa, Anthony Aue, Arul Menezes, Chris Quirk, & Hisami Suzuki: English-Japanese example-based machine translation using abstract linguistic representation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan ; 7pp. [PDF, 127KB]

(2002) Barbara Gawronska, Björn Erlendsson and Hanna Duczak: Extracting semantic classes and morphosyntactic features for English-Polish machine translation. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002. [PDF, 252KB]

(2002) Chu-Ren Huang, I-Ju E.Tseng, & Dylan B.S.Tsai: Translating lexical semantic relations: the first step towards multilingual wordnets. Coling-2002: Workshop SEMANET: Building and using Semantic Networks, August 2002, Taipei, Taiwan; 7pp. [PDF, 129KB]

(2002) Mathieu Lafourcade & Christian Boitet: UNL lexical selection with conceptual vectors. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.1958-1964. [PDF, 82KB]

(2002) Li Tangqiu, Yang Xiaofeng, Hong Qingyang, & Li Shaozhi: A hybrid method for syntactic and semantic structure disambiguation of Chinese. In: Chan Sin-wai (ed.) Translation and Information Technology (Hong Kong: Chinese University Press, 2002); pp.43-55.

(2002) Eric Nyberg, Teruko Mitamura, Kathryn Baker, David Svoboda, Brian Peterson, & Jennifer Williams: Deriving semantic knowledge from descriptive texts using an MT system. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp.145-154. [go to publisher details]

(2002) Deepak Sharma, K.Vikram, Manav Ratan Mital, Amitabha Mukerjee & Achla M.Raina: An integrated discourse semantic model for bilingual corpora. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 16pp. [PDF,  645KB]

(2002) Alicia Tribble, Alon Lavie and Lori Levin: Rapid adaptive development of semantic analysis grammars. TMI-2002 conference, Keihanna, Japan, March 13-17, 2002; pp.178-187. [PDF, 159KB]

 (2001) Igor Bolshakov & Alexander Gelbukh: A large database of collocations and semantic references: interlingual applications.  International Journal of Translation 13 (1-2), Jan-Dec 2001; pp.167-187. [PDF, 292KB]

(2001) Kurt Eberle: FUDR-based MT, head switching and the lexicon. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp. 93-98. [PDF, 210KB]

(2001) Alon Lavie, Lorin Levin, Tanja Schultz, Chad Langley, Benjamin Han, Alicia Tribble, Donna Gates, Dorcas Wallace, & Kay Peterson: Domain portability in speech-to-speech translation.  HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 5pp. [PDF, 58KB]

(2001) Mari Olsen, David Traum, Carol van Ess-Dykema & Amy Weinberg: Implicit cues for explicit generation: using telicity as a cue for tense structure in a Chinese to English MT system. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.259-264. [PDF, 185KB]

 (2000) Hans Ulrich Block & Tobias Ruland: Integrated shallow linguistic processing, In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.143-146. [abstract]

 (2000) Johan Bos & Julia Heine: Discourse and dialog semantics for translation. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp 336-347. [abstract]

(2000) Caroline Brun, Marc Dymetman, & Veronika Lux: Document structure and multilingual authoring. INLG’2000 : Proceedings of the First International Conference on Natural Language Generation, Mitzpe Ramon, Israel, 12-16 June 2000; pp.24-31. [PDF, 729KB]

(2000) Bonnie J. Dorr, Gina-Anne Levow, Dekang Lin, & Scott Thomas: Chinese-English semantic resource construction. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 757-760. [PDF, 31KB]

 (2000) Martin C. Emele, Michael Dorna, Anke Lüdeling, Heike Zinsmeister, & Christian Rohrer: Semantic-based transfer. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp .359-376. [abstract]

(2000) Barbara Gawronska & Hannah Duczak: Understanding politics by studying weather: a cognitive approach to representation of Polish verbs of motion, appearance, and existence. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.147-157. [go to publisher details]

(2000) Chung-hye Han, Benoit Lavoie, Martha Palmer, Owen Rambow, Richard Kittredge, Tanya Korelsky, Nari Kim, & Myunghee Kim: Handling structural divergences and recovering dropped arguments in a Korean/English machine translation system. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.40-53. [go to publisher details]

(2000) Stephen Helmreich & David Farwell: Text meaning representation as a basis for representation of text interpretation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.179-188. [go to publisher details]

 (2000) Walter Kasper: Multilingual semantic databases. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp .348-356. [abstract]

(2000) Karin Kipper and Martha Palmer: Representation of actions as an interlingua. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 12-17 [PDF, 747KB]

(2000) R. Muñoz, M. Saiz-Noeda, A. Suárez, & M. Palomar: Semantic approach to bridging reference resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1750KB]

(2000) Uzzi Ornan & Israel Gutter: Machine translation by semantic features. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1471KB]

 (2000) Manfred Pinkal, C.J. Rupp, & Karsten Worm: Robust semantic processing of spoken language. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 321-335. [abstract]

(2000) C.J. Rupp, Jörg Spilker, Martin Klarner, & Karsten L. Worm: Combining analyses from various parsers. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 311-320. [abstract]

(2000) Maximiliano Saiz-Noeda, Manuel Palomar, & David Farwell: NLP system oriented to anaphora resolution. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 7pp. [PDF, 1564KB]

 (2000) Michael Schiehlen: Semantic construction.  In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 200-215. [abstract]

 (2000) Michael Schiehlen, Johan Bos, & Michael Dorna: Verbmobil Interface Terms (VITs). In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.183-199. [abstract]

(2000) David Traum & Nizar Habash: Generation from lexical conceptual structures. NAACL-ANLP 2000 Workshop: Applied Interlinguas: Practical Applications of Interlingual Approaches to NLP, Seattle, May 2000; pp. 52-59 [PDF, 629KB]

 (2000) Hans Uszkoreit, Dan Flickinger, Walter Kasper, & Ivan A. Sag: Deep linguistic analysis with HPSG. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 216-237. [abstract]

Speech recognition

(2004) Hanne Fersøe, Elviira Hartikainen, Henk van den Heuvel, Giulio Maltese, Asuncion Moreno, Shaunie Shammass, & Ute Ziegenhain: Creation and validation of large lexica for speech-to-speech translation purposes. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1431-1434. [PDF, 701KB]

(2004) Satoshi Nakamura, Konstantin Markov, Takatoshi Jitsuhiro, Jin-Song Zhang, Hirofumi Yamamoto & Genichiro Kikui: Multi-lingual speech recognition system for speech-to-speech translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 146-154 [PDF, 181KB]

(2004) Tanja Schultz, Szu-Chen Jou, Stephan Vogel, & Shirin Saleem: Using word lattice information for a tighter coupling in speech translation systems. Interspeech 2004 – ICSLP 8th International  Conference on  Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004; pp.41-44; abstract [PDF, 61KB]

(2004) Ruiqiang Zhang, Genichiro Kikui, Hirofumi Yamamoto, Taro Watanabe, Frank Soong, & Wai Kit Lo: A unified approach in speech-to-speech translation: integrating features of speech recognition and machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 116KB]

(2004) Ruiqiang Zhang, Genichiro Kikui, Hirofumi Yamamoto, Frank K.Soong, Taro Watanabe, Eiichiro Sumita, & Wai-Kit Lo: Improved spoken language translation using n-best speech recognition hypotheses. Interspeech 2004 – ICSLP 8th International  Conference on  Spoken Language Processing, Jeju Island, Korea, October 4-8, 2004; pp.1629-1632; abstract [PDF, 61KB]

(2003) Yves Champollion: Convergence in CAT: blending MT, TM, OCR & SR to boost productivity. Translating and the Computer 25: proceedings of the Twenty-fifth International Conference on Translating and the Computer, 20-21 November 2003, London. (London: Aslib, 2003); 13pp. [PDF, 51KB]

(2003) Woosung Kim & Sanjeev Khudanpur: Cross-lingual lexical triggers in statistical language modeling EMNLP-2003: proceedings of the 2003  conference on Empirical Methods in Natural Language Processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03,  11-12 July  2003, Sapporo, Japan; 8pp. [PDF, 96KB]

(2003) Fu-Hua Liu, Yuqing Gao, Liang Gu, & Michael Picheny: Noise robustness in speech to speech translation.  Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2797-2800; abstract [PDF, 34KB]

(2003) David Stallard, John Makhoul, Frederick Choi, Ehry Macrostie, Premkumar Natarajan, Richard Schwartz, & Bushra Zawaydeh: Design and evaluation of a limited two-way speech translator. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2221-2224; abstract [PDF, 34KB]

(2003) Bo Xu, Shuwu Zhang, & Chengqing Zong: Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2329-2332; abstract [PDF, 34KB]

(2002) Michael Benis: Softly spoken or hard of hearing?  Language International 14 (3), June 2002; pp.26-29. [PDF, 1061KB]

(2002) Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble, & Alex Waibel: Improvements in non-verbal cue identification using multilingual phone strings; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp. 101-108 [PDF, 107KB]

(2002) Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble,& Alex Waibel: Speaker, accent, and language identification using multilingual phone strings. HLT 2002: Human Language Technology Conference: proceedings of the second international conference on human language technology research, March 24-27, 2002, San Diego, California; ed. Mitchell Marcus [San Francisco, CA: Morgan Kaufmann for DARPA]; pp. 125-131. [PDF, 130KB]

(2002) Fumiaki Sugaya, Toshiyuki Takezawa, Genichiro Kikui, & Seiichi Yamamoto: Proposal of a very-large-corpus acquisition method by cell-formed registration. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.326-328. [PDF, 32KB]

(2001) Jeff Allen: Speech technology after the hoax. International Journal for Language and Documentation 9, May 2001/June 2001; pp.41-42. [PDF, 403KB]

(2001) Helen Meng, Berlin Chen, Sanjeev Khudanpur, Gina-Anne Levow, Wai-Kit Lo, Douglas Oard, Patrick Schone, Karen Tang, Hsin-Min Wang, & Jianqiang Wang: Mandarin-English information (MEI): investigating translingual speech retrieval. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 7pp. [PDF, 187KB]

(2001) Franz Josef Och & Hermann Ney: What can machine translation learn from speech recognition? MT Summit VIII, Santiago de Compostela, Spain, 18-22 September 2001. Towards a Road Map for MT [PDF, 61KB]

(2000) Matthias Eichner, Matthias Wolff,& Rüdiger Hoffmann: Data driven generation of pronunciation dictionaries. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.95-105. [abstract]

(2000) Lode Goukens: CEBIT overview: voice processing and MT get popular. International Journal for Language and Documentation 4, April 2000; pp.22-25. [PDF, 1054KB]

(2000) Udo Haiber, Helmut Mangold, Thilo Pfau, Peter Regel-Brietzmann, Günther Ruske,&  Volker Schleß: Robust recognition of spontaneous speech . In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.46-62. [abstract]

(2000) Erhard W. Hinrichs, Sandra Kübler, Valia Kordoni, & Frank H. Müller: Robust chunk parsing for spontaneous speech. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.163-182. [abstract]

(2000) Gareth J.F.Jones: Applied machine translation resources for cross-language information access from spoken documents. MT2000: machine translation and multilingual applications in the new millennium: international conference at the  University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 9pp. [PDF, 2044KB]

(2000) Stephan Kanthak, Achim Sixtus, Sirko Molau, Ralf Schlüter, & Hermann Ney: Fast search for large vocabulary speech recognition. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.63-78. [abstract]

(2000) Yevgeny Ludovik & Ron Zacharski: MT and topic-based techniques to enhance speech recognition systems for professional translators Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1061-1065 [PDF,.382KB]

(2000) Michael Malenke, Marcus Bäumler, & Erwin Paulus: Speech recognition performance assessment. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 583-591.[abstract]

(2000) Jochen Peters & Dietrich Klakow: Capturing long range correlations using log-linear language models. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.79-94. [abstract]

(2000) Jörg Spilker, Martin Klarner, & Günther Görz: Processing self corrections in a speech to speech system. Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp.1116-1120. [PDF, 456KB]

(2000) Jörg Spilker, Martin Klarner, & Günther Görz: Processing self-corrections in a speech-to-speech system. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.131-140. [abstract]

(2000) Jay Tucker & Ace Sarich: A voice-enabled phrase-based translation system. Translating and the Computer 22: proceedings of the Twenty-second International Conference… 16-17 November 2000 (London: Aslib, 2000); 4pp.  [PDF, 32KB]

(2000) Alex Waibel, Hagen Soltau, Tanja Schultz, Thomas Schaaf & Florian Metze: Multilingual speech recognition. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.33-45. [abstract]

Speech synthesis

(2001) Jeff Allen: Speech technology after the hoax. International Journal for Language and Documentation 9, May 2001/June 2001; pp.41-42. [PDF, 403KB]

(2000) Jochen Steffens & Erwin Paulus: Speech synthesis quality assessment. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 592-596.  [abstract]

(2000) Karlheinz Stöber, Petra Wagner, Jörg Helbig, Stefanie Köster, David Stall, Matthias Thomae, Jens Blauert, Wolfgang Hess, Rüdiger Hoffmann, & Helmut Mangold: Speech synthesis using multilevel selection and concatenation of units from large speech corpora. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 519-534. [abstract]

Syntactic analysis and representation (see also Case grammar and case frames, Chart parsing, Dependency relations, Discontiguous relations, Tagging)

(2004) Elliott Franco Drábek & David Yarowsky: Improving bitext word alignments via syntax-based reordering of English.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 199KB]

(2004) Mark Dras & Chung-hye Han: Non-contiguous tree parsing; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.30-39. [PDF, 82KB]

(2004) Daniel Gildea:  Dependencies vs constituents for tree-based alignment. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 97KB]

(2004) Declan Groves, Mary Hearne, & Andy Way: Robust sub-sentential alignment of phrase-structure trees. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 94KB]

(2004) Jonas Kuhn: Experiments in parallel-text based grammar induction. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.470-477. [PDF, 99KB]

(2004) Sonja Nießen & Hermann Ney: Statistical machine translation with scarce resources using morpho-syntactic information. Computational Linguistics 30 (2), pp.181-204. [PDF, 147KB]

(2004) Sylwia Ozdowska: Identifying correspondences between words: an approach based on a bilingual syntactic analysis of French/English parallel corpora. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.55-62. [PDF, 213KB]

(2004) Katharina Probst & Alon Lavie: A structurally diverse minimal corpus for eliciting structural mappings between languages. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 217-226. [go to publisher details]

(2004) I.Dan Melamed: Algorithms for syntax-aware statistical machine translation; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.40-54. [PDF, 425KB]

(2004) I. Dan Melamed: Statistical machine translation by parsing.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.653-660. [PDF, 134KB]

(2004) Franz Josef Och, Daniel Gildea, Sanjeev Khudanpur, Anoop Sarkar, Kenji Yamada, Alex Fraser, Shankar Kumar, Libin Shen, David Smith, Katherine Eng, Viren Jain, Zhen Jin, & Dragomir Radev: A smorgasbrod of features for statistical machine translation.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 161-168. [PDF, 192KB]

(2004) Maja Popović & Hermann Ney: Improving word alignment quality using morpho-syntactic information. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 5pp. [PDF, 141KB]

(2004) Gábor Prószéky, László Tihanyi, & Gábor Ugray: Moose: a robust high-performance parser and generator 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 138-142. [PDF, 179KB]

(2004) Eric Ringger, Michael Gamon, Robert C.Moore, David Rojas, Martine Smets, & Simon Corston-Oliver: Linguistically informed statistical models of constituent structure for ordering in sentence realization.  Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 116KB]

(2004) Bettina Schrader: Improving word alignment quality using linguistic knowledge. LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 46-49. [PDF, 233KB]

(2004) David A.Smith & Noah A.Smith: Bilingual parsing with factored estimation: using English to parse Korean. [EMNLP 2004] Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 90KB]

(2004) Wong Fai, Hu Dong Cheng, Mao Yu Hang, Tang Chi Wai, & Dong Ming Chui: Application of translation corresponding tree (TCT) annotation schema in example-based machine translation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 42-45. [PDF, 474KB]

(2003) Eugene Charniak, Kevin Knight, & Kenji Yamada: Syntax-based language models for statistical machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 71KB]

(2003) Dinh Dien, Nguyen Luu Thuy Ngan, Do Xuan Quang, & Van Chi Nam: A hybrid approach to word order transfer in the English-to-Vietnamese machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 184KB]

(2003) Dinh Dien, Kiem Hoang, & Eduard Hovy: BTL: a hybrid model for English-Vietnamese machine translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 199KB]

(2003) Jason Eisner: Learning non-isomorphic tree mappings for machine translation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 89KB]

(2003) Daniel Gildea: Loosely tree-based alignment for machine translation ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 109KB]

(2003) Mary Hearne & Andy Way: Seeing the wood for the trees: data-oriented translation MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 88KB]

(2003) Dekang Lin & Colin Cherry: Word alignment with cohesion constraint HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 77KB]

(2003) I.Dan Melamed: Multitext grammars and synchronous parsers HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 1126KB]

(2003) Margrethe Møller: Grammatical metaphor, controlled language and machine translation Controlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.95-104. [PDF, 66KB]

(2003) Eman Othman, Khaled Shaalan & Ahmed Rafea: A chart parser for analyzing modern standard Arabic sentence. MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 186KB]

(2003) Bo Pang, Kevin Knight, & Daniel Marcu: Syntax-based alignment of multiple translations: extracting paraphrases and generating new sentences HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 91KB]

(2003) Remedios Ruiz Cascales & Richard F.E.Sutcliffe: A specification and validating parser for simplified technical SpanishControlled language translation, EAMT-CLAW-03, Dublin City University, 15-17 May 2003; pp.35-44. [PDF, 133KB]

(2003) Charles Schafer & David Yarowsky: A two-level syntax-based approach to Arabic-English statistical machine translation.MT Summit IX -- workshop: Machine translation for semitic languages, New Orleans, USA, 23 September 2003 [PDF, 403KB]

(2003) Charles Schafer & David Yarowsky: Statistical machine translation using coercive two-level syntactic transduction EMNLP-2003: proceedings of the 2003  conference on Empirical Methods in Natural Language Processing, a meeting of SIGDAT, a special interest group of the ACL, held in conjunction with ACL-03,  11-12 July  2003, Sapporo, Japan; 8pp. [PDF, 363KB]

(2003) Bo Xu, Shuwu Zhang, & Chengqing Zong: Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing. Eurospeech 2003 - Interspeech 2003 8th European  Conference on  Speech Communication and Technology, Geneva, Switzerland, September 1-4, 2003; pp.2329-2332; abstract [PDF, 34KB]

(2003) Nianwen Xue & Seth Kulick: Automatic predicate argument structure analysis of the Penn Chinese Treebank MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.418-425. [PDF, 53KB]

 (2003) Natalia Zaitseva: Matrix-and-frame methods as a means of structural and semantic systematization of terminological vocabulary. International Journal of Translation 15 (1), Jan-June 2003; pp.87-105. [PDF, 360KB]

(2003) Bing Zhao & Stephan Vogel: Word alignment based on bilingual bracketing HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 56KB]

(2002) Prachya Boonkwan & Asanee Kawtrakul: Plaesarn: machine-aided translation tool for English-to-Thai; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan ; 7pp. [PDF, 218KB]

(2002) Heidi J. Fox: Phrasal cohesion and statistical machine translation. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.304-311 [PDF, 106KB]

(2002) Nizar Habash & Bonnie Dorr: Handling translation divergences: combining statistical and symbolic techniques in generation-heavy machine translation. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 84-93. [go to publisher details]

(2002) Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates, & Kay Peterson: Spoken language parsing using phrase-level grammars and trainable classifiers; ACL-2002 workshop "Speech-to-speech translation",11 July 2002, Philadelphia, USA; pp.15-22 [PDF, 54KB]

(2002) Sonja Nießen: Improving statistical machine translation using morpho-syntactic information. Ph D thesis, Rheinisch-Westfälische Technische Hochschule Aachen, Dezember 2002. 123pp. [PDF, 1505KB]

(2002) Hisami Suzuki: A development environment for large-scale multi-lingual parsing systems.  Coling-2002: Workshop on Grammar Engineering and Evaluation, August 2002, Taipei, Taiwan; 7pp. [PDF, 83KB]

(2002) Kenji Yamada & Kevin Knight: A decoder for syntax-based statistical MT. ACL-2002: 40th Annual meeting of the Association for Computational Linguistics, Philadelphia, July 2002; pp.303-310. [PDF, 788KB]

(2001) Mikel L.Forcada: Learning machine translation strategies using commercial systems: discovering word reordering rules. In: Machine Translation Review, issue 12: December 2001; pp.13-18.

(2001) Enya Kong Tang & Mosleh H. Al-Adhaileh: Converting a bilingual dictionary into a bilingual knowledge bank based on the synchronous SSTC.  MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.351-356. [PDF, 76KB]

(2001) Kenji Yamada & Kevin Knight: A syntax-based statistical translation model ACL-EACL-2001: 39th Annual meeting [of the Association for Computational Linguistics] and 10th Conference of the European Chapter [of ACL], July 9th - 11th 2001, Toulouse, France; pp.523-530. [PDF, 147KB]

 (2000) Hans Ulrich Block & Tobias Ruland: Integrated shallow linguistic processing, In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.143-146. [abstract]

(2000) Margo Budzikowska: Information structure transfer: bridging the information gap in structurally different languages. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.80-88. [go to publisher details]

(2000) Marc Dymetman & Frédéric Tendeau: Context-free grammar rewriting and the transfer of packed linguistic representations Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1016-1020 [PDF,.515KB]

(2000) Mikel L. Forcada: Learning machine translation strategies using commercial systems: discovering word-reordering rules. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1379KB]

(2000) Chung-hye Han, Benoit Lavoie, Martha Palmer, Owen Rambow, Richard Kittredge, Tanya Korelsky, Nari Kim, & Myunghee Kim: Handling structural divergences and recovering dropped arguments in a Korean/English machine translation system. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.40-53. [go to publisher details]

 (2000) Erhard W. Hinrichs, Sandra Kübler, Valia Kordoni, & Frank H. Müller: Robust chunk parsing for spontaneous speech. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.163-182. [abstract]

(2000) Jin-Xia Huang, Hyi-Jeong Song, Yeong-Mi Song, Ji-Hyoun Kim, Won-Sek Kang, Chung-Won Seo, Young-Souk Chae, & Key-Sun Choi: MATES/CK: a Chinese-Korean machine translation. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong. Demonstration notes, 3-6 October 2000; pp.32-33 [PDF, 154KB]

(2000) Kyonghi Moon & Jong-Hyeok Lee: Representation and recognition method for multi-word translation units in Korean-to-Japanese MT system Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 544-550 [PDF,.598KB]

(2000) Sonja Niessen & Hermann Ney: Improving SMT quality with morpho-syntactic analysis Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1081-1085 [PDF,.430KB]

 (2000) Tobias Ruland: Probabilistic LR-parsing with symbolic postprocessing. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp.147-162. [abstract]

(2000) Hristo Tanev & Ruslan Mitkov: LINGUA – a robust architecture for text processing and anaphora resolution in Bulgarian. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1780KB]

(2000) Christoph Tillmann & Hermann Ney: Word reordering and DP-based search in statistical machine translation Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 850-856 [PDF,.603KB]

 (2000) Hans Uszkoreit, Dan Flickinger, Walter Kasper, & Ivan A. Sag: Deep linguistic analysis with HPSG. In: Wolfgang Wahlster (ed.) Verbmobil: foundations of speech-to-speech translation. (Berlin: Springer, 2000); pp. 216-237. [abstract]

Syntactic transfer (see also Reordering)

(2004) Eneko Agirre, Aitziber Atutxa, Koldo Gojenola, & Kepa Sarasola: Exploring portability of syntactic information from English to Basque. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.47-50. [PDF, 270KB]

(2004) Eiji Aramaki & Sadao Kurohashi: Example-based machine translation using structural translation examples. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 91-94 [PDF, 203KB]

(2004) Yuan Ding & Martha Palmer: Synchronous dependency insertion grammars: a grammar formalism for syntax based statistical MT. Coling’04 workshop: Proceedings Recent Advances in Dependency Grammar, August 28, 2004, Geneva, Switzerland; 8pp. [PDF, 135KB]

(2004) Jonathan Graehl & Kevin Knight: Training tree transducers.  HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp. 105-112. [PDF, 157KB]

(2004) Liang Gu & Yuqing Gao: On feature selection in maximum entropy approach to statistical concept-based speech-to-speech translation. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 115-121 [PDF, 213KB]

(2004) Kenji Imamura, Hideo Okuma, Taro Watanabe, & Eiichiro Sumita: Example-based machine translation based on syntactic transfer with statistical models. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 56KB]

(2004) Dekang Lin: A path-based transfer model for machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 6pp. [PDF, 201KB]

(2004) Fai Wong, Dong Cheng Hu, Yu Hang Mao, Ming Chui Dong: A flexible example annotation schema: translation corresponding tree representation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 178KB]

(2004) Hao Zhang & Daniel Gildea: Syntax-based alignment: supervised or unsupervised? Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 87KB]

(2003) Arul Menezes & Stephen D. Richardson: A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp.421-442.

(2003) Hideo Watanabe, Sadao Kurohashi & Eiji Aramaki: Finding translation patterns from paired source and target dependency structures. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 397-420.

(2002) Etienne Blanc, Gilles Sérasset, & WangJu Tsai: Structural and lexical transfer from an UNL graph to an equivalent natural language dependency tree. LREC-2002: Third International Conference on Language Resources and Evaluation. Workshop: First international workshop on UNL, other interlinguas and their applications, Las Palmas Canary Islands, 27 May 2002; pp.14-18. [PDF, 53KB]

(2001) Andy Way: Solving headswitching translation cases in LFG-DOT. Proceedings of the LFG conference, University of Hong Kong, 25-27 June 2001 [Stanford University, CSLI Publications, 2001]; pp.305-323. [PDF, 142KB]

(2000) Margo Budzikowska: Information structure transfer: bridging the information gap in structurally different languages. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.80-88. [go to publisher details]

(2000) Marc Dymetman & Frédéric Tendeau: Context-free grammar rewriting and the transfer of packed linguistic representations Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1016-1020 [PDF,.515KB]

(2000) Benoit Lavoie, Richard Kittredge, Tanya Korelsy, & Owen Rambow: A framework for MT and multilingual NLG systems based on uniform lexico-structural processing. ANLP-NAACL-2000: proceedings of the Sixth conference on Applied Natural Language Processing and 1st Meeting of the North American Chapter of the Association for Computational Linguistics, April 29 – May 4, 2000, Seattle, Washington; pp. 60-67. [PDF, 527KB]

Tagging (see also Segmentation)

(2004) Jiri Hana, Anna Feldman & Chris Brew: A resource-light approach to Russian morphology: tagging Russian using Czech resources. [EMNLP 2004] Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 91KB]

(2004) Hermann Ney, Maja Popović, & David Sündermann: Error measures and Bayes decision rules revisited with applications to POS tagging.  EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 7pp. [PDF, 72KB]

(2004) Felipe Sánchez-Martínez, Juan Antonio Pérez-Ortiz & Mikel L. Forcada: Cooperative unsupervised training of the part-of-speech taggers in a bidirectional machine translation system; TMI-2004: proceedings of the Tenth Conference on Theoretical and Methodological Issues in Machine Translation, October 4-6, 2004, Baltimore, Maryland, USA; pp.135-144. [PDF, 92KB]

(2003) Tantely Andriamanankasina, Kenji Araki & Koji Tochinai: EBMT of POS-tagged sentences by recursive division via inductive learning. In: Michael Carl & Andy Way (eds.) Recent advances in example-based machine translation (Dordrecht: Kluwer Academic Publishers, 2003), pp. 225-252.

(2003) Katharina Probst: Using ‘smart’ bilingual projection to feature-tag a monolingual dictionary HLT-NAACL 2003: proceedings of Seventh Conference on Natural Language Learning,  May 27 – June 1,  2003, Edmonton, Canada; 8pp. [PDF, 108KB]

(2003) Nicola Ueffing & Hermann Ney: Using POS information for statistical machine translation into morphologically rich languages. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.347-354 [PDF, 384KB]

(2003) Dinh Dien & Hoang Kiem: POS-tagger for English Vietnamese bilingual corpus HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 202KB]

(2002) Reinhard Rapp: A part-of-speech-based search algorithm for translation memories. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.466-472. [PDF, 102KB]

(2001) David Yarowsky, Grace Ngai, & Richard Wicentowski: Inducing multilingual text analysis tools via robust projection across aligned corpora.  HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 4pp. [PDF, 161KB]

(2000) Mona Diab: An unsupervised method for multilingual word sense tagging using parallel corpora: a preliminary investigation. ACL 2000 Workshop on Word Senses and Multi-linguality, Hong Kong, October 2000; pp.1-9 [PDF, 785KB]

(2000) Jan Hajič: Morphological tagging: data vs.dictionaries. [NAACL 2000] 1st Meeting of the North American Chapter of the Association for Computational Linguistics: proceedings of the conference, April 29-May 4, 2000, Seattle, Washington, USA; pp.94-101. [PDF,

(2000) Carlo Minnaja & Laura G. Paccagnella: A part-of-speech tagger for Esperanto oriented to MT. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 5pp. [PDF, 1045KB]

(2000) Hristo Tanev & Ruslan Mitkov: LINGUA – a robust architecture for text processing and anaphora resolution in Bulgarian. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1780KB]

Word alignment

(2004) Necip Fazil Ayan, Bonnie J. Dorr, & Nizar Habash: Multi-Align: combining linguistic and statistical techniques to improve alignments for adaptable MT. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 17-26. [go to publisher details]

(2004) Ana-Maria Barbu: Simple linguistic methods for improving a word alignment algorithm. JADT 2004: 7es Journées internationals d’Analyse statistique des Données Textuelles, 10-12 mars 2004, Louvain La Neuve, Belgique; pp.88-98. [PDF, 89KB]

(2004) Ana-Maria Barbu: A word alignment system based on a translation equivalence extractor. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.2195-2198. [PDF, 388KB]

(2004) Luisa Bentivogli, Pamela Forner, & Emanuele Pianta: Evaluating cross-language annotation transfer in the MultiSemCor corpus. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 66KB]

(2004) Michael Carl, Ecaterina Rascu, & Johann Haller: Using weighted abduction to align term variant translations in bilingual texts. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1973-1976. [PDF, 294KB]

(2004) Simon Corston-Oliver & Michael Gamon: Normalizing German and English inflectional morphology to improve statistical word alignment. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 48-57. [go to publisher details]

(2004) Hal Daumé III & Daniel Marcu: A phrase-based HMM approach to document/abstract alignment.  EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 111KB]

(2004) Elliott Franco Drábek & David Yarowsky: Improving bitext word alignments via syntax-based reordering of English.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 199KB]

(2004) Donghui Feng, Yajuan Lv, & Ming Zhou: A new approach for English-Chinese named entity alignment. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 207KB]

 (2004) Michel Galley, Mark Hopkins, Kevin Knight & Daniel Marcu: What’s in a translation rule? HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp.273-280. [PDF, 568KB]

(2004) Cyril Goutte, Kenji Yamada, & Eric Gaussier: Aligning words using matrix factorisation. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.502-509. [PDF, 97KB]

(2004) Hou Hongxu, Deng Dan, Zou Gang, Yu Hongkui, Liu Yang, Xiong Deyi & Liu Qun: An EBMT system based on word alignment. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 47-49 [PDF, 132KB]

(2004) Jonas Kuhn: Experiments in parallel-text based grammar induction. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.470-477. [PDF, 99KB]

(2004) Chun-Jen Lee, Jason S. Chang, & Thomas C. Chuang: Alignment of bilingual named entities in parallel corpora using statistical model. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 144-153. [go to publisher details]

(2004) Dekang Lin: A path-based transfer model for machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 6pp. [PDF, 201KB]

(2004) Evgeny Matusov, Richard Zens, & Hermann Ney: Symmetric word alignments for statistical machine translation. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 162KB]

(2004) Robert C. Moore: Improving IBM word-alignment model 1. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.518-525. [PDF, 70KB]

(2004) Franz Josef Och & Hermann Ney: The alignment template approach to statistical machine translation. Computational Linguistics 30 (4), pp.417-449 [PDF, 486KB]

(2004) Sylwia Ozdowska: Identifying correspondences between words: an approach based on a bilingual syntactic analysis of French/English parallel corpora. Coling 2004: Proceedings of the Workshop on Multilingual Linguistic Resources (MLR2004), August 28th 2004, University of Geneva, Switzerland; pp.55-62. [PDF, 213KB]

(2004) Emanuele Pianta & Luisa Bentivogli: Knowledge intensive word alignment with KNOWA. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 70KB]

(2004) Maja Popović & Hermann Ney: Improving word alignment quality using morpho-syntactic information. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 5pp. [PDF, 141KB]

(2004) Suhaimi Ab. Rahman & Normaziah Abdul Aziz: Improving word alignment in an English-Malay parallel corpus for machine translation.  LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 22-25. [PDF, 414KB]

(2004) Bettina Schrader: Improving word alignment quality using linguistic knowledge. LREC-2004. Workshop, 25th May 2004: The amazing utility of parallel and comparable corpora; pp. 46-49. [PDF, 233KB]

(2004) Jörg Tiedemann: Word to word alignment strategies. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 161KB]

(2004) Dan Tufiş, Radu Ion, & Nancy Ide: Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 192KB]

(2004) Dan Tufiş: Term translations in multilingual corpora: discovery and consistency check. LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1981-1984. [PDF, 749KB]

(2004) Wei Wang & Ming Zhou: Improving word alignment models using structured monolingual corpora. EMNLP-2004: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, 25-26 July 2004, Barcelona, Spain; 8pp. [PDF, 209KB]

(2004) Hua Wu & Haifeng Wang: Improving domain-specific word alignment with a general bilingual corpus. Machine translation: from real users to research: 6th conference of the Association for Machine Translation in the Americas, AMTA 2004, Washington, DC, September 28 – October 2, 2004; ed. Robert E.Frederking and Kathryn B.Taylor (Berlin: Springer Verlag, 2004); pp. 262-271. [go to publisher details]

(2004) Wu Hua & Wang Haifeng: Improving statistical word alignment with a rule-based machine translation system. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 364KB]

(2004) Wu Hua & Wang Haifeng: Improving domain-specific word alignment for computer assisted translation. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 221KB]

(2004) Yang Muyun, Zhao Tiejun, Liu Haijie, Shi Xiasheng & Jiang Hongfei: Auto word alignment based Chinese-English EBMT. International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation [IWSLT 2004], September 30 – October 1, 2004, Kyoto, Japan; pp. 27-29 [PDF, 232KB]

(2004) Richard Zens, Evgeny Matusov, & Hermann Ney: Improved word alignment using a symmetric lexicon model. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 153KB]

(2003) Lars Ahrenberg, Magnus Merkel & Michael Petterstedt: Interactive word alignment for language engineering.  EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.49-52 [PDF, 415KB]

(2003) Magnus Merkel, Michael Petterstedt & Lars Ahrenberg: Interactive word alignment for corpus linguistics. Proceedings Corpus Linguistics 2003, 28-31 March, 2003, Lancaster. UK. 10pp. [PDF, 428KB]

(2003) Michael Carl & Sisay Fissaha: Phrase-based evaluation of word-to-word alignments HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 253KB]

(2003) Colin Cherry & Dekang Lin: A probability model to improve word alignment ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 90KB]

(2003) Katri A. Clodfelder: An LSA implementation against parallel texts in French and English HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 119KB]

(2003) Herve Dejean, Eric Gaussier, Cyril Goutte & Kenji Yamada: Reducing parameter space for word alignment HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 42KB]

(2003) Yuan Ding, Daniel Gildea, & Martha Palmer: An algorithm for word-level alignment of parallel dependency trees MT Summit IX, New Orleans, USA, 23-27 September 2003 [PDF, 242KB]

(2003) John C. Henderson: Word alignment baselines HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada [PDF, 54KB]

(2003) Grzegorz Kondrak, Daniel Marcu, & Kevin Knight: Cognates can improve statistical translation models HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 28KB]

(2003) Shankar Kumar & William Byrne: A weighted finite state transducer implementation of the alignment template model for statistical machine translation HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; pp.63-70 [PDF, 141KB]

(2003) Dekang Lin & Colin Cherry: Word alignment with cohesion constraint HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; 3pp. [PDF, 77KB]

(2003) Dekang Lin & Colin Cherry: ProAlign: shared task system description HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 85KB]

(2003) Qing Ma, Yujie Zhang, Masaki Murata, & Hitoshi Isahara: Semantic maps for word alignment in bilingual parallel corpora  ACL-2003: Second SIGHAN Workshop on Chinese Language Processing, July 11-12, 2003, Sapporo, Japan; 6pp.. [PDF, 199KB]

(2003) Bridget Thomson McInnes & Ted Pedersen: The Duluth word alignment system HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF,52KB]

(2003) Magnus Merkel, Michael Petterstedt & Lars Ahrenberg: Interactive word alignment for corpus linguistics. Proceedings Corpus Linguistics 2003, 28-31 March, 2003, Lancaster. UK. [PDF]

(2003) Rada Mihalcea & Ted Pedersen: An evaluation exercise for word alignment HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada.[PDF, 131KB]

(2003) Franz Josef Och & Hermann Ney: A systematic comparison of various statistical alignment models. Computational Linguistics 29 (1), pp.19-51 [PDF, 296KB]

(2003) Jörg Tiedemann: Combining clues for word alignment. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.339-346 [PDF, 433KB]

(2003) Dan Tufis, Ana-Maria Barbu, & Radu Ion: TREQ-AL: a word alignment system with limited language resources HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 164KB]. [A new version describing the TREQ-AL system after bug fixes is also available [PDF, 53KB]

(2003) Stephan Vogel: Using noisy bilingual data for statistical machine translation. EACL 2003: 10th Conference of the European Chapter of the Association for Computational Linguistics, April 12-17, 2003, Budapest, Hungary. Proceedings; pp.175-178 [PDF, 194KB]

(2003) Bing Zhao & Stephan Vogel: Word alignment based on bilingual bracketing HLT-NAACL 2003 Workshop, "Building and using parallel texts: data driven machine translation and beyond", 31 May 2003, Edmonton, Canada. [PDF, 56KB]

(2002) Lars Ahrenberg, Mikael Andersson, & Magnus Merkel: A system for incremental and interactive word linking. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.485-490 . [PDF, 963KB]

(2002) Lawrence Cheung, Tom Lai, Robert Luk, Oi Yee Kwong, King Kui Sin, & Benjamin K.Tsou: Some considerations on guidelines for bilingual alignment and terminology extraction; Coling-2002: First SIGHAN Workshop on Chinese Language Processing, 1 September 2002, Taipei,Taiwan; 5pp. [PDF, 177KB]

(2002) Paul C. Davis: Stone soup translation: the linked automata model. Ph D thesis, Ohio State University, 2002. 323pp. [PDF, 1078KB]

(2002) Dien Dinh: Building a training corpus for word sense disambiguation in English-to-Vietnamese machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 281KB]

(2002) Bonnie J.Dorr, Lisa Pearl, Rebecca Hwa, & Nizar Habash: DUSTer: a method for unraveling cross-language divergences for statistical word-level alignment. Machine translation: from research to real users: 5th conference of the Association for Machine Translation in the Americas, AMTA 2002, Tiburon, CA, October 2002; ed. Stephen D. Richardson (Berlin: Springer Verlag, 2002); pp. 31-43. [go to publisher details]

(2002) Shankar Kumar & William Byrne: Minimum Bayes-risk word alignments of bilingual texts. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.140-147 [PDF, 263KB]

(2002) Oi Yee Kwong, Benjamin K.Tsou, Tom B.Y.Lai, Robert W.P.Luk, Lawrence Y.L.Cheung, & Francis C.Y.Chik: Alignment and extraction of bilingual legal terminology from context profiles; Coling-2002: Second international workshop on computational terminology (COMPUTERM 2002), 31 August 2002, Taipei,Taiwan; 7pp. [PDF, 224KB]

(2002) Reinhard  Rapp: A part-of-speech-based search algorithm for translation memories. LREC-2002: Third International Conference on Language Resources and Evaluation. Proceedings, Las Palmas de Gran Canaria, Spain, 27 May – 2 June 2002; pp.466-472. [PDF, 102KB]

(2002) Kristina Toutanova, H.Tolga Ilan, & Christopher D.Manning: Extensions to HMM-based statistical word alignment models. EMNLP-2002: Proceedings of the 2002 conference on Empirical Methods in Natural Language Processing, July 2002, Philadelphia, USA; pp.87-94 [PDF, 130KB]

(2001) Seonho Kim, Juntae Yoon, & Mansuk Song: Improving lexical mapping model of English-Korean bitext using structural features. EMNLP-2001: Empirical Methods in Natural Language Processing conference, 2-7 June 2001, Cornell University; 8pp.  [PDF, 261KB].

(2001) Philipp Koehn & Kevin Knight: Knowledge sources for word-level translation models. EMNLP-2001: Empirical Methods in Natural Language Processing conference, 2-7 June 2001, Cornell University; 9pp.  [PDF, 254KB].

(2001) Jessie Pinkham & Monica Corston-Oliver: Adding domain specificity to an MT system. ACL-EACL 2001 workshop "Data-driven machine translation", July 7, 2001, Toulouse, France; pp.103-110. [PDF, 73KB]

(2000) Lars Ahrenberg, Magnus Merkel, Anna Sågvall Hein, & Jörg Tiedemann: Evaluation of word alignment systems. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 1255-1261. [PDF, 124KB]

 (2000) Lars Ahrenberg, Mikael Andersson, & Magnus Merkel: A knowledge-lite approach to word alignment [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 97-116.

(2000) Luisa Bentivogli, Emanuele Pianta, & Fabio Pianesi: Coping with lexical gaps when building aligned multilingual wordnets. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 993-997. [PDF, 32KB]

(2000) Lars Borin: You'll take the high road and I'll take the low road: using a third language to improve bilingual word alignment Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 97-103 [PDF,.534KB]

 (2000) Yaacov Choueka, Ehud S.Conley & Ido Dagan: A comprehensive bilingual word alignment system: application to disparate languages: Hebrew and English [abstract]. In: Jean Véronis (ed.) Parallel text processing: alignment and use of translation corpora. Dordrecht/Boston/London: Kluwer Academic Publishers, 2000; pp. 69-96.

(2000) Pernilla Danielsson & Katarina Mühlenbock: Small but efficient: the misconception of high-frequency words in Scandinavian translation. Envisioning machine translation in the information future: 4th conference of the Association for Machine Translation in the Americas, AMTA 2000, Cuernavaca,Mexico, October 2000; ed. John S. White (Berlin: Springer Verlag, 2000); pp.158-168. [go to publisher details]

(2000) Jin-Xia Huang & Key-Sun Choi: Chinese-Korean word alignment based on linguistic comparison. ACL-2000: 38th Annual meeting of the Association for Computational Linguistics, Hong Kong, October 2000. [PDF, 208KB]

(2000) Olivier Kraif: Evaluation of statistical tools for automatic extraction of lexical correspondences between parallel texts. MT2000: machine translation and multilingual applications in the new millennium: international conference at the University of Exeter, 20-22 November 2000, organised by the British Computer Society. [London: BCS]; 8pp. [PDF, 1579KB]

(2000) I. Dan Melamed: Models of translation equivalence among words. Computational Linguistics 26 (2), pp.221-249 [PDF, 1844KB]

(2000) Sonja Niessen & Hermann Ney: Improving SMT quality with morpho-syntactic analysis Coling 2000 in Europe: the 18th International Conference on Computational Linguistics. Proceedings of the conference, Universität des Saarlandes, Saarbrücken, Germany, 31 July -4 August 2000; pp. 1081-1085 [PDF,.430KB]

(2000) Sun Le, Jin Youbing, Du Lin, & Sun Yufang: Word alignment of English-Chinese bilingual corpus based on chunks. EMNLP-2000: Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, 7-8 October 2000, Hong Kong; pp. 110-116 [PDF, 509KB]

(2000) Jörg Tiedemann: Extracting phrasal terms using bitext. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 7pp. [PDF, 160KB]

(2000) Tamás Váradi: Lexical and translation equivalence in parallel corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 539-543. [PDF, 756KB]

(2000) Lluís de Yzaguirre, Marta Ribas, Jordi Vivaldi, & M.Teresa Cabré: Some technical aspects about aligning near languages. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 545-548. [PDF, 2276KB]

Word sense disambiguation

(2004) Indrajit Bhattacharya, Lise Getoor, & Yoshua Bengio: Unsupervised sense disambiguation using bilingual probabilistic models.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp. 287-294. [PDF, 166KB]

(2004) Chris Callison-Burch, David Talbot, & Miles Osborne: Statistical machine translation with word- and sentence-aligned parallel corpora.  ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the conference, 21-26 July 2004, Barcelona, Spain; pp.175-182. [PDF, 133KB]

(2004) In-Su Kang, Seung-Hoon Na & Jong-Hyeok Lee: Influence of WSD on cross-language information retrieval. First International Joint Conference on Natural Language Processing, Hainan Island, China, March 22-24, 2004; pp.358-366. [abstract]

(2004) Hang Li & Cong Li: Word translation disambiguation using bilingual bootstrapping. Computational Linguistics 30 (1), pp. 1-22. [PDF, 2311KB]

(2004) Tayebeh Mosavi Miangah & Ali Delavar Khalafi: Statistical analysis of target language corpus for word sense disambiguation in a machine translation system 9th EAMT Workshop, "Broadening horizons of machine translation and its applications", 26-27 April 2004, Malta; pp. 129-137.[PDF, 151KB]

(2004) Charles Schafer & David Yarowsky: Exploiting aggregate properties of bilingual dictionaries for distinguishing senses of English words and inducing English sense clusters. ACL 2004: 42nd annual meeting of the Association for Computational Linguistics: Proceedings of the interactive poster and demonstration sessions, 21-26 July 2004, Barcelona, Spain; 4pp. [PDF, 199KB]

(2004) Dan Tufiş, Radu Ion, & Nancy Ide: Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp. [PDF, 192KB]

(2004) Dan Tufis, Radu Ion, & Nancy Ide: Word sense disambiguation as a WordNets’ validation method in Balkanet: LREC-2004: Fourth International Conference on Language Resources and Evaluation, Proceedings, Lisbon, Portugal, 26-28 May 2004; pp.1071-1074. [PDF, 422KB]

(2003) Nuno M.F.Dionisio: Nominal taxonomies and word sense disambiguation. Ph D thesis, University of East Anglia, Norwich, November 15, 2003. 312pp. [PDF, 3062KB]

(2003) Hiroyuki Kaji: Word sense acquisition from bilingual comparable corpora HLT-NAACL 2003: conference combining Human Language Technology conference series and the North American Chapter of the Association for Computational Linguistics conference series,  May 27 – June 1,  2003, Edmonton, Canada; pp.32-39 [PDF, 339KB]

(2003) Hwee Tou Ng, Bin Wang, & Yee Seng Chan: Exploiting parallel texts for word sense disambiguation: an empirical study ACL-2003: 41st Annual meeting of the Association for Computational Linguistics, July 7-12, 2003, Sapporo, Japan. [PDF, 364KB]

(2003) Reinhard Rapp: Word sense discovery based on sense descriptor dissimilarity MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.315-322. [PDF, 156KB]

(2003) Dominic Widdows, Stanley Peters, Scott Cederberg, Chiu-Ki Chan, Diana Steffen, & Paul Buitelaar: Unsupervised monolingual and bilingual word-sense disambiguation of medical documents using UMLS. ACL 2003 Workshop on Natural Language Processing in Biomedicine, July 2003, Sapporo, Japan; 8pp. [PDF, 124KB]

(2002) Eneko Agirre, Mikel Lersundi, & David Martinez: A multi-lingual approach to disambiguate prepositions and case suffixes; ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", July 2002, Philadelphia, USA; pp. 1-8 [PDF, 166KB]

(2002) You-Jin Chung, Sin-Jae Kang, Kyong-Hi Moon, & Jong-Hyeok Lee: Word sense disambiguation in a Korean-to-Japanese MT system using neural networks; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 497KB]

(2002) Dien Dinh: Building a training corpus for word sense disambiguation in English-to-Vietnamese machine translation; Coling-2002 workshop "Machine translation in Asia", 1 September 2002, Taipei,Taiwan; 7pp. [PDF, 281KB]

(2002) Nancy Ide, Tomaz Erjavec, & Dan Tufis: Sense discrimination with parallel corpora. ACL-2002 SIGLEX/SENSEVAL workshop on Word Sense Disambiguation "Recent successes and future directions", 11 July 2002, Philadelphia, USA; pp. 54-60 [PDF, 387KB]

(2002) Hyun Ah Lee & Gil Chang Kim: Translation selection through source word sense disambiguation and target word selection. Coling 2002, Taipei, Taiwan, 26-30 August 2002 [PDF, 303KB]

(2002) Deepak Sharma, K.Vikram, Manav Ratan Mital, Amitabha Mukerjee & Achla M.Raina: An integrated discourse semantic model for bilingual corpora. ICUKL-2002: International Conference on Universal Knowledge and Language, 25th-29th November 2002, Goa, India, organised by UNDL Foundation and Indian Institute of  Technology Bombay; 16pp. [PDF,  645KB]

(2001) Guo-Wei Bian & Chi-Ching Lin: Trans-EZ at NTCIR-2: synset co-occurrence method for English-Chinese cross-lingual information retrieval. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 6pp. [PDF, 41KB]

(2001) Sin-Jae Kang & Jong-Hyeok Lee: Ontology-based word sense disambiguation using semi-automatically constructed ontology. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.181-186. [PDF, 287KB]

(2001) Adam Kilgarriff & David Tugwell: WASP-Bench: an MT lexicographers' workstation supporting state-of-the-art lexical disambiguation. MT Summit VIII: Machine Translation in the Information Age, Proceedings, Santiago de Compostela, Spain, 18-22 September 2001; pp.187-190. [PDF, 175KB]

(2001) Hodong Lee & Jong C.Park: Automatic augmentation of translation dictionary with database terminologies in multilingual query interpretation. ACL 2001 Workshop on Human Language Technology and Knowledge Management, July 2001, Toulouse, France; 10pp. [PDF, 108KB]

(2001) Young-Suk Lee, Wu Sok Yi, Stephanie Seneff, & Clifford J. Weinstein: Interlingua-based broad-coverage Korean-English translation at CCLINC. HLT-2001: Proceedings of the First International Conference on Human Language Technology Research, San Diego, CA, March 18-21, 2001; 6pp. [PDF, 80KB]

(2001) Tatsunori Mori, Tomoharu Kokubu, & Takashi Tanaka: Cross-lingual information retrieval based on LSI with multiple word spaces. NTCIR Workshop 2: Proceedings of the Second NTCIR Workshop on Research in Chinese & Japanese Text retrieval and Text Summarization, March 7-9, 2001, Tokyo, Japan; 8pp. [PDF, 105KB]

(2001) Mark Stevenson & Yorick Wilks: The interaction of knowledge sources in word sense disambiguation. Computational Linguistics 27 (3), pp. 321-349 [PDF, 2076KB]

 (2001) Srinivasan Vaidyaraman: Word sense disambiguation through visual thought analysis. International Journal of Translation 13 (1-2), Jan-Dec 2001; pp.147-158. [PDF, 52KB]

(2000) Mona Diab: An unsupervised method for multilingual word sense tagging using parallel corpora: a preliminary investigation. ACL 2000 Workshop on Word Senses and Multi-linguality, Hong Kong, October 2000; pp.1-9 [PDF, 785KB]

(2000) Zheng Jie & Mao Yuhang: A word sense disambiguation model using bilingual corpus. LREC-2000: Second International Conference on Language Resources and Evaluation. Proceedings, Athens, Greece, 31 May – 2 June 2000; pp. 517-521. [PDF, 85KB]

(2000) Mary Xiaoyong Liu, Ted Diamond, & Anne R.Diekma: Word sense disambiguation for cross-language information retrieval. [NAACL 2000] 1st Meeting of the North American Chapter of the Association for Computational Linguistics: proceedings of the conference, April 29-May 4, 2000, Seattle, Washington, USA; pp.35-40. [PDF,

(2000) Bernardo Magnini & Carlo Strapparava: Experiments in word domain disambiguation for parallel texts. ACL 2000 Workshop on Word Senses and Multi-linguality, Hong Kong, October 2000; pp. 27-33 [PDF, 605KB]

(2000) Hiroshi Nakagawa: Disambiguation of lexical translations based on bilingual comparable corpora. LREC-2000: Second International Conference on Language Resources and Evaluation. Workshop on Terminology Resources and Computation, Athens, Greece, 29 May 2000; 6pp. [PDF, 100KB]