• DocumentCode
    3167619
  • Title

    Detection of unseen words in conversational Mandarin

  • Author

    Bufyko, I. ; Kimball, Owen ; Siu, Man-Hung ; Herrero, José ; Blum, Dan

  • Author_Institution
    Raytheon BBN Technol., Cambridge, MA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    5181
  • Lastpage
    5184
  • Abstract
    We present a Mandarin keyword search system that uses a large vocabulary recognizer to generate consensus networks at various resolutions: word, character, syllable and phone. In order to achieve fast and accurate search, we propose the use of an efficient approximate-match dynamic programming algorithm that finds the best alignment between the target query and the consensus network. Experiments with Mandarin conversational telephone speech show that the approximate-match search improves detection accuracy by more than 10% for rare words that are not present in the recognizer´s dictionary (OOV terms). We also found OOV terms to benefit most from system combination, where we observe a roughly 10% improvement relative to the best single system.
  • Keywords
    dynamic programming; natural language processing; speech processing; Mandarin conversational telephone speech; OOV terms; approximate-match dynamic programming algorithm; consensus network; spoken term detection; target query; unseen words detection; Decision support systems; Mandarin; OOV; Spoken term detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6289087
  • Filename
    6289087