Title :
Spoken term detection based on the most probable phoneme sequence
Author :
Gosztolya, Gábor ; Tóth, László
Author_Institution :
Dept. of Inf., Univ. of Szeged, Szeged, Hungary
Abstract :
The aim of the spoken term detection task is to find the occurrence of user-entered keywords in an archive of audio recordings. In this area, besides the accuracy of hits returned, the speed of search is also very important, for which an intermediate representation of recordings is normally used. In this paper we evaluate a spoken term detection method which represents the speech signals by their most probable phoneme sequence, on which a dynamic search is then performed. As the accuracy of the phoneme recognizer used is vital, we shall test this method by using several approaches of phoneme identification. We found that our method already achieves satisfactory accuracy, although its run time is still rather high. We also found that this approach is heavily dependent on the performance of the phoneme recognizer.
Keywords :
speech recognition; audio recordings; most probable phoneme sequence; phoneme identification; phoneme recognizer; spoken term detection method; Accuracy; Acoustics; Artificial neural networks; Hidden Markov models; Measurement; Speech; Speech recognition;
Conference_Titel :
Applied Machine Intelligence and Informatics (SAMI), 2011 IEEE 9th International Symposium on
Conference_Location :
Smolenice
Print_ISBN :
978-1-4244-7429-5
DOI :
10.1109/SAMI.2011.5738856