• DocumentCode
    507311
  • Title

    Sample-Based Automatic Dictionary Generation for Keyword Spotting System

  • Author

    Lu, Li ; Ge, Fengpei ; Li, Ta ; Zhao, Qingwei ; Yan, Yonghong

  • Author_Institution
    Inst. of Acoust., Chinese Acad. of Sci. Beijing, Beijing, China
  • Volume
    5
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    505
  • Lastpage
    508
  • Abstract
    In this paper we develop an approach to automatic, data-driven generation of pronunciation dictionaries for keyword spotting(KWS) systems. In practical applications, KWS tasks often have to deal with keywords whose pronunciations can not be found in the dictionary. To solve this problem, we study how to derive pronunciations automatically from speech samples of keywords. Recognized sequences from these samples are used as candidates, and merged to form a phoneme confusion network(PCN) from which the pronunciations are extracted based on a confidence-based metric. Experimental results show that sample-based dictionary reaches similar performance with the canonical dictionary, and the proposed approach is independent of the sample set.
  • Keywords
    dictionaries; speech recognition; canonical dictionary; confidence-based metric; data-driven generation; keyword spotting system; phoneme confusion network; pronunciation dictionary; pronunciations; recognized sequences; sample-based automatic dictionary generation; speech samples; Acoustic measurements; Automatic speech recognition; Dictionaries; Fuzzy systems; Hidden Markov models; Interpolation; Lattices; Merging; Personal communication networks; Speech recognition; Confidence Metric; Data-driven; Phoneme Confusion Network; Pronunciation Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.506
  • Filename
    5360571