• DocumentCode
    591902
  • Title

    Incorporating syllable duration into line-detection-based spoken term detection

  • Author

    Ohno, Tetsufumi ; Akiba, Tatsuro

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Toyohashi Univ. of Technol., Toyohashi, Japan
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    204
  • Lastpage
    209
  • Abstract
    A conventional method for spoken term detection (STD) is to apply approximate string matching to subword sequences in a spoken document obtained by speech recognition. An STD method that considers string matching as line detection in a syllable distance plane has been proposed. While this has demonstrated fast ordered-by-distance detections, it has still suffered from the insertion and deletion errors introduced by the speech recognition. In this work, we aim to improve detection performance by employing syllable-duration information. The proposed method enables robust detection by introducing a distance plane that uses frames as units instead of using syllables as units. Our experimental evaluation showed that the incorporation of syllable-duration achieved higher detection performance in high-recall regions.
  • Keywords
    speech recognition; string matching; word processing; STD method; approximate string matching; deletion errors; high-recall regions; insertion errors; line detection-based spoken term detection; ordered-by-distance detection; speech recognition; spoken document; subword sequences; syllable distance plane; syllable duration information; Equations; Estimation error; Hidden Markov models; Indexing; Mathematical model; Speech recognition; Approximate String Matching; Spoken Term Detection; Syllable Duration;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424223
  • Filename
    6424223