• DocumentCode
    672343
  • Title

    Language style and domain adaptation for cross-language SLU porting

  • Author

    Stepanov, Evgeny A. ; Kashkarev, Ilya ; Bayer, Ali Orkan ; Riccardi, Giuseppe ; Ghosh, A.

  • Author_Institution
    Dept. of Inf. Eng. & Comput. Sci., Univ. of Trento, Trento, Italy
  • fYear
    2013
  • fDate
    8-12 Dec. 2013
  • Firstpage
    144
  • Lastpage
    149
  • Abstract
    Automatic cross-language Spoken Language Understanding porting is plagued by two limitations. First, SLU are usually trained on limited domain corpora. Second, language pair resources (e.g. aligned corpora) are scarce or unmatched in style (e.g. news vs. conversation). We present experiments on automatic style adaptation of the input for the translation systems and their output for SLU. We approach the problem of scarce aligned data by adapting the available parallel data to the target domain using limited in-domain and larger web crawled close-to-domain corpora. SLU performance is optimized by reranking its output with Recurrent Neural Network-based joint language model. We evaluate end-to-end SLU porting on close and distant language pairs: Spanish - Italian and Turkish - Italian; and achieve significant improvements both in translation quality and SLU performance.
  • Keywords
    language translation; natural language processing; recurrent neural nets; Italian language; Spanish language; Turkish language; Web crawled close-to-domain corpora; automatic cross-language spoken language understanding porting; automatic style adaptation; cross-language SLU porting; end-to-end SLU porting; language pair resources; parallel data; recurrent neural network-based joint language model; statistical machine translation; Adaptation models; Data models; Google; Joints; Numerical models; Speech; Training; Domain Adaptation; Spoken Language Understanding; Statistical Machine Translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
  • Conference_Location
    Olomouc
  • Type

    conf

  • DOI
    10.1109/ASRU.2013.6707720
  • Filename
    6707720