• DocumentCode
    590869
  • Title

    Introduction of false detection control parameters in spoken term detection

  • Author

    Furuya, Yasubumi ; Natori, Satoshi ; Nishizaki, Hiromitsu ; Sekiguchi, Yuta

  • Author_Institution
    Dept. of Educ., Univ. of Yamanashi, Kofu, Japan
  • fYear
    2012
  • fDate
    3-6 Dec. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper describes spoken term detection (STD) with false detection control. Our STD method uses phoneme transition network (PTN) derived by multiple automatic speech recognizers (ASRs) as an index. An PTN is almost the same to a sub-word based confusion network (CN), which is derived from an output of an ASR. The PTN-based index we proposed is made of the outputs of multiple ASRs, which is known to be robust to certain recognition errors and the out-of-vocabulary problem. Our PTN was very effective at detecting query terms. However, the PTN generates a lot of false detections especially for short query terms. Therefore, we applied two false detection control parameters to the Dynamic Time Warping-based term detection engine. In addition, we changed the search parameters depending on length of a query term. Finally, the STD performance was better (0.785 of F-measure) than without any parameters (0.717).
  • Keywords
    query processing; speaker recognition; ASR; CN; PTN-based index; STD; automatic speech recognition; confusion network; dynamic time warping-based term detection; false detection control parameter; phoneme transition network; query term detection; search parameter; spoken term detection; Educational institutions; Engines; Hidden Markov models; Indexing; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC), 2012 Asia-Pacific
  • Conference_Location
    Hollywood, CA
  • Print_ISBN
    978-1-4673-4863-8
  • Type

    conf

  • Filename
    6412016