• DocumentCode
    323529
  • Title

    A fast vocabulary independent algorithm for spotting words in speech

  • Author

    Dharanipragada, S. ; Roukos, S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    233
  • Abstract
    In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. We present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in word spotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time
  • Keywords
    acoustic signal detection; indexing; information retrieval; signal representation; speech processing; speech recognition; accuracy; acoustic match; audio-indexing; coarse match; coarse search; coarse-to-detailed search; fast vocabulary independent algorithm; keywords; out of vocabulary words detection; phone sequence spotting; phone-level representation; phone-ngram matching; preprocessing stage; putative word hits; speech recognizer; spoken message retrieval; video-browsing; word spotting; Computer networks; Decoding; Information retrieval; Lattices; Real time systems; Speech recognition; Text recognition; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674410
  • Filename
    674410