Title :
Enhanced Out of Vocabulary Word Detection Using Local Acoustic Information
Author :
Xuyang Wang ; Ta Li ; Pengyuan Zhang ; Jielin Pan ; Yonghong Yan
Author_Institution :
Key Lab. of Speech Acoust. & Content Understanding, Beijing, China
Abstract :
The detection of Out-of-vocabulary (OOV) words is a crucial problem for spoken term detection (STD). In this paper, the use of integration with local acoustic information is investigated to retrieve more OOV words. Tokens with high local acoustic probabilities propagated in the search space at the decoding stage will be forced to propagate to the next frame. In this way, acoustic similar words can be reserved in recognition results without considering of language model probabilities. Experimental results show that this new approach results in a significant increase in the performance of OOV words detection. At least a relative improvement of 8.5% in equal error rate is achieved over the baseline system. Meanwhile, it will do no harm to in-vocabulary (IV) words detection. With some refinement of beam pruning, the decoding time only rises 3% relative to the baseline system.
Keywords :
acoustic signal processing; probability; speech recognition; vocabulary; word processing; OOV word detection; STD; baseline system; beam pruning; decoding time; enhanced out-of-vocabulary word detection; equal error rate; in-vocabulary word detection; language model probabilities; local acoustic information; local acoustic probabilities; spoken term detection; Acoustic beams; Acoustics; Decoding; Hidden Markov models; Signal processing; Speech; Speech recognition; OOV; spoken term detection; token passing;
Conference_Titel :
Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014 Tenth International Conference on
Conference_Location :
Kitakyushu
Print_ISBN :
978-1-4799-5389-9
DOI :
10.1109/IIH-MSP.2014.154