• DocumentCode
    2804904
  • Title

    Spoken term detection based on the most probable phoneme sequence

  • Author

    Gosztolya, Gábor ; Tóth, László

  • Author_Institution
    Dept. of Inf., Univ. of Szeged, Szeged, Hungary
  • fYear
    2011
  • fDate
    27-29 Jan. 2011
  • Firstpage
    101
  • Lastpage
    106
  • Abstract
    The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. In this area, besides the accuracy of hits returned, the speed of search is also very important, for which an intermediate representation of recordings is normally used. In this paper we evaluate a spoken term detection method which represents the speech signals by their most probable phoneme sequence, on which a dynamic search is then performed. As the accuracy of the phoneme recognizer used is vital, we shall test this method by using several approaches of phoneme identification. We found that our method already achieves satisfactory accuracy, although its run time is still rather high. We also found that this approach is heavily dependent on the performance of the phoneme recognizer.
  • Keywords
    speech recognition; audio recordings; most probable phoneme sequence; phoneme identification; phoneme recognizer; spoken term detection method; Accuracy; Acoustics; Artificial neural networks; Hidden Markov models; Measurement; Speech; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applied Machine Intelligence and Informatics (SAMI), 2011 IEEE 9th International Symposium on
  • Conference_Location
    Smolenice
  • Print_ISBN
    978-1-4244-7429-5
  • Type

    conf

  • DOI
    10.1109/SAMI.2011.5738856
  • Filename
    5738856