DocumentCode
2804904
Title
Spoken term detection based on the most probable phoneme sequence
Author
Gosztolya, Gábor ; Tóth, László
Author_Institution
Dept. of Inf., Univ. of Szeged, Szeged, Hungary
fYear
2011
fDate
27-29 Jan. 2011
Firstpage
101
Lastpage
106
Abstract
The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. In this area, besides the accuracy of hits returned, the speed of search is also very important, for which an intermediate representation of recordings is normally used. In this paper we evaluate a spoken term detection method which represents the speech signals by their most probable phoneme sequence, on which a dynamic search is then performed. As the accuracy of the phoneme recognizer used is vital, we shall test this method by using several approaches of phoneme identification. We found that our method already achieves satisfactory accuracy, although its run time is still rather high. We also found that this approach is heavily dependent on the performance of the phoneme recognizer.
Keywords
speech recognition; audio recordings; most probable phoneme sequence; phoneme identification; phoneme recognizer; spoken term detection method; Accuracy; Acoustics; Artificial neural networks; Hidden Markov models; Measurement; Speech; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Applied Machine Intelligence and Informatics (SAMI), 2011 IEEE 9th International Symposium on
Conference_Location
Smolenice
Print_ISBN
978-1-4244-7429-5
Type
conf
DOI
10.1109/SAMI.2011.5738856
Filename
5738856
Link To Document