• DocumentCode
    179344
  • Title

    Role play dialogue topic model for language model adaptation in multi-party conversation speech recognition

  • Author

    Masumura, Ryo ; Oba, Tomohiro ; Masataki, Hirokazu ; Yoshioka, Osamu ; Takahashi, Satoshi

  • Author_Institution
    NTT Media Intell. Labs., NTT Corp., Tokyo, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    4873
  • Lastpage
    4877
  • Abstract
    This paper introduces an unsupervised language model adaptation technique for multi-party conversation speech recognition. The use of topic models provides one of the most accurate frameworks for unsupervised language model adaptation since they can inject long-range topic information into language models. However, conventional topic models are not suitable for multi-party conversation because they assume that each speech set has each different topic. In a multi-party conversation, each speaker will share the same conversation topic and each speaker utterance will depend on both topic and speaker role. Accordingly, this paper proposes new concept of the “role play dialogue topic model” to utilize multiparty conversation attributes. The proposed topic model can share the topic distribution among each speaker and can also consider both topic and speaker role. The proposed topic model based adaptation realizes a new framework that sets multiple recognition hypotheses for each speaker and simultaneously adapts a language model for each speaker role. We use a call center dialogue data set in speech recognition experiments to show the effectiveness of the proposed method.
  • Keywords
    natural language processing; speech recognition; unsupervised learning; call center dialogue data set; long-range topic information; multiparty conversation speech recognition; multiple recognition hypotheses; role play dialogue topic model; speaker role; speaker utterance; speech set; topic distribution; unsupervised language model adaptation technique; Adaptation models; Hidden Markov models; Indexes; Probability; Semantics; Speech; Speech recognition; Unsupervised language model adaptation; multi-party conversation speech recognition; topic model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854528
  • Filename
    6854528