DocumentCode :
2804904
Title :
Spoken term detection based on the most probable phoneme sequence
Author :
Gosztolya, Gábor ; Tóth, László
Author_Institution :
Dept. of Inf., Univ. of Szeged, Szeged, Hungary
fYear :
2011
fDate :
27-29 Jan. 2011
Firstpage :
101
Lastpage :
106
Abstract :
The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. In this area, besides the accuracy of hits returned, the speed of search is also very important, for which an intermediate representation of recordings is normally used. In this paper we evaluate a spoken term detection method which represents the speech signals by their most probable phoneme sequence, on which a dynamic search is then performed. As the accuracy of the phoneme recognizer used is vital, we shall test this method by using several approaches of phoneme identification. We found that our method already achieves satisfactory accuracy, although its run time is still rather high. We also found that this approach is heavily dependent on the performance of the phoneme recognizer.
Keywords :
speech recognition; audio recordings; most probable phoneme sequence; phoneme identification; phoneme recognizer; spoken term detection method; Accuracy; Acoustics; Artificial neural networks; Hidden Markov models; Measurement; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applied Machine Intelligence and Informatics (SAMI), 2011 IEEE 9th International Symposium on
Conference_Location :
Smolenice
Print_ISBN :
978-1-4244-7429-5
Type :
conf
DOI :
10.1109/SAMI.2011.5738856
Filename :
5738856
Link To Document :
بازگشت