Proceedings of the 11th

International Workshop on

Spoken Language Translation

(IWSLT 2014)

Lake Tahoe, California, US

December 4th and 5th, 2014

Edited by

Marcello Federico, Sebastian Stüker, François Yvon

 

Table of contents

 

Satoshi Nakamura - Foreword

 

Participants

 

Program

 

Keynotes

Arul Menezes: Speech translation for everyone – breaking down the barriers [abstract]

 

Evaluation campaign

 

Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico: Report on the 11th IWSLT evaluation campaign; pp.2-17 [PDF, 450KB]

 

B.BabaAli, R.Serizel, S.Jalalvand, R.Gretter, D.Giuliani: FBK @ IWSLT 2014 – ASR track; pp.18-25 [PDF, 199KB]

 

Peter Bell, Pawel Swietojanski, Joris Driesen, Mark Sinclair, Fergus McInnes, Steve Renals: The UEDIN ASR systems for the IWSLT 2014 evaluation; pp.26-33 [PDF, 332KB]

 

Meriem Beloucif, Chi-kiu Lo, Dekai Wu: Improving MEANT based semantically tuned SMT; pp.34-41 [PDF, 497KB]

 

Nicola Bertoldi, Prashanu Mathur, Nicolas Ruiz, Marcello Federico: FBK’s machine translation and speech translation systems for the IWSLT 2014 evaluation campaign; pp.42-48 [PDF, 271KB]

 

Alexandra Birch, Matthias Huck, Nadir Durrani, Nikolay Bogoychev, Philipp Koehn: Edinburgh SLT and MT system description for the IWSLT 2014 evaluation; pp.49-56 [PDF, 199KB]

 

Markus Freitag, Joern Wuebker, Stephan Peitz, Hermann Ney, Matthias Huck, Alexandra Birch, Nadir Durrani, Philipp Koehn, Mohammed Mediani, Isabel Slawik, Jan Niehues, Eunach Cho, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, Marcello Federico: Combined spoken language translation; pp.57-64 [PDF, 243KB]

 

Michael Kazi, Elizabeth Salesky, Brian Thompson, Jessica Ray, Michael Coury, Tim Anderson, Grant Erdmann, Jeremy Gwinnup, Katherine Young, Brian Ore, Michael Hutt: The MITLL-AFRL IWSLT 2014 MT system; pp.65-72 [PDF, 243KB]

 

Kevin Kilgour, Michael Heck, Markus Müller, Matthias Sperber, Sebastian Stüker, Alex Waibel: The 2014 KIT IWSLT speech-to-text systems for English, German and Italian; pp.73-79 [PDF, 217KB]

 

Mohamed Morchid, Stéphane Huet, Richard Dufour: A topic-based approach for post-processing correction of automatic translations; pp.80-85 [PDF, 385KB]

 

Raymond W.M.Ng, Mortaza Doulaty, Rama Doddipatla, Wilker Aziz, Kashif Shah, Oscar Saz, Madina Hasan, Ghada AlHaribi, Lucia Specia, Thomas Hain: The USFD SLT system for IWSLT 2014; pp.86-91 [PDF, 287KB]

 

Quoc Bao Nguyen, Tat Thang Vu, Chi Mai Luong: The speech recognition systems of IOIT for IWSLT 2014; pp.92-95. [PDF, 233KB]

 

Achraf Ben Romdhane, Salma Jamoussi, Abdelmajid Ben Hamadou, Kamel Smaïli: Phrase-based language modelling for statistical machine translation; pp.96-99. [PDF, 209KB]

 

Anthony Rousseau, Loïc Barrault, Paul Deléglise, Yannick Estève, Holger Schwenk, Samir Bennacef, Armando Muscariello, Stephan Vanni: The LIUM English-to-French spoken language translation system and the Vecsys/LIUM automatic speech recognition system for Italian language for IWSLT 2014; pp.100-105. [PDF, 220KB]

 

Natalia Segal, Hélène Bonneau-Maynard, Quoc Khanh Do, Alexandre Allauzen, Jean-Luc Gauvain, Lori Lamel, François Yvon: LIMSI English-French speech translation system; pp.106-112. [PDF, 289KB]

 

Peng Shen, Yugang Lu, Xinhui Hu, Naoyuki Kanda, Masahiro Saiko, Chiori Hori: The NCT ASR system for IWSLT 2014; pp.113-118. [PDF, 200KB]

 

Isabel Slawik, Mohammed Mediani, Jan Niehues, Yuqi Zhang, Eunah Cho, Teresa Herrmann, Thanh-Le Ha, Alex Waibel: The KIT translation systems for IWSLT 2014; pp.119-126. [PDF, 298KB]

 

Katsuhito Sudoh, Graham Neubig, Kevin Duh, Katsuhiko Hayashi: NTT-NAIST syntax-based SMT systems for IWSLT 2014; pp.127- 133. [PDF, 282KB]

 

Shijin Wang, Yuguang Wang, Jianfeng Li, Yiming Cui, Lirong Dai: The USTC machine translation system for IWSLT 2014; pp.134-138. [PDF, 417KB]

 

Xiaolin Wang, Andrew Finch, Masao Utiyama, Taro Watanabe, Eiichiro Sumita: The NICT translation system for IWSLT 2014; pp.139-142 [PDF, 176KB]

 

Krzysztof Wolk, & Krzysztof Marasek: Polish-English speech statistical machine translation systems for the IWSLT 2014; pp.143-149. [PDF, 302KB]

 

Joern Wuebker, Stephan Peitz, Andreas Guta, Hermann Ney: The RWTH Aachen machine translation systems for IWSLT 2014; pp.150-154. [PDF, 233KB]

 

Technical papers

 

Ahmed Ali, Hamdy Mubarak, Stephan Vogel: Advances in dialectal Arabic speech recognition: a study using Twitter to improve Egyptian ASR; pp.156-162. [PDF, 337KB]

 

Timo Baumann, Srinivas Bangalore, Julia Hirschberg: Towards simultaneous interpreting: the timing of incremental machine translation and speech synthesis; pp.163-168. [PDF, 271KB]

 

L.Besacier, B.Lecouteux, N.Q.Luong, K.Hour, M.Hadjsalah: Word confidence estimation for speech translation; pp.169-175. [PDF, 317KB]

 

Eunah Cho, Jan Niehues, Alex Waibel: Machine translation of multi-party meetings: segmentation and disfluency removal strategies; pp.176-183. [PDF, 233KB]

 

Chenchen Ding, Ye Kyaw Thu, Masao Utiyama, Andrew Finch, Eiichiro Sumita: Empircal dependency-based head finalization for statistical Chinese-, English-, and French-to-Myanmar (Burmese) machine  translation; pp.184-191. [PDF, 401KB]

 

Quoc-Khanh Do, Alexandre Allauzen, François Yvon: Discriminative adaptation of continuous space translation models; pp.192-199. [PDF, 383KB]

 

Matthias Eck, Yuri Zemlyanskiy, Joy Zhang, Alex Waibel: Extracting translation pairs from social network content; pp.200-205. [PDF, 288KB]

 

Andrew Finch, Xiaolin Wang, Eiichiro Sumita: An exploration of segmentation strategies in stream decoding; pp.206-213. [PDF, 298KB]

 

Li Gong, Aurélien Max, François Yvon: Incremental development of statistical machine translation systems; pp.214-222. [PDF, 948KB]

 

Thanh-Le Ha, Jan Niehues, Alex Waibel: Lexical translation model using a deep neural network architecture; pp.223-229. [PDF, 301KB]

 

Sanjika Hewavitharana, Dennis Mehay, Sankaranarayanan Ananthakrishnan, Rohit Kumar, John Makhoul: Anticipatory translation model adaptation for bilingual conversations; pp.230-235. [PDF, 256KB]

 

Sariya Karimova, Patrick Simianer, Stefan Riezler: Offline extraction of overlapping phrases for hierarchical phrase-based translation; pp.236-243. [PDF, 251KB]

 

Gaurav Kumar, Yuan Cao, Ryan Cotterell, Chris Callison-Burch, Daniel Povey, Sanjeev Khudanpur: Translations of the Callhome Egyptian Arabic  corpus for conversational speech translation; pp.244-248. [PDF, 175KB]

 

Mohammed Mediani, Joshua Winebarger, Alexander Waibel: Improving in-domain data selection for small in-domain sets; pp.249-256. [PDF, 321KB]

 

Markus Müller, Sebastian Stüker, Zaid Sheikh, Florian Metze, Alex Waibel: Multilingual deep bottle neck features: a study on language selection and training techniques; pp.257-264. [PDF, 162KB]

 

Graham Neubig, Katsuhiro Sudoh, Yusuke Oda, Kevin Duh, Hajime Tsukuda, Masaaki Nagata: The NAIST-NTT TED talk treebank; pp.265-270. [PDF, 453KB]

 

Stephan Peitz, Markus Freitag, Hermann Ney: Better punctuation prediction with hierarchical phrase-based translation; pp.271-278. [PDF, 221KB]

 

Ge Wu, Yuqi Zhang, Alexander Waibel: Rule-based preordering on multiple syntactic levels in statistical machine translation; pp.279-286. [PDF, 446KB]