Title :
Spoken term detectionfor OOV terms based on phone fragment
Author :
Xu, Yong ; Guo, Wu ; Su, Shan ; Dai, LiRong
Author_Institution :
Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
In this paper, we focus on the problem of search for Out Of Vocabulary (OOV) in spoken term detection (STD). The phone level fragment is adopted as the speech recognition decoding unit. Furthermore, weop-timize the phone level fragment in speech recognition system by adding word-position marker. Then inverted triphone index is built to implement fuzzy search for OOV terms. In the term detection confidence measure procedure, we present a method based on multi-layer perceptron (MLP) to complement for lattice-based confidence measure. Experimental result indicates that the optimizationof fragment can give a 3% relative increase in Actual Term Weighted Value (ATWV) for OOV terms. The confidence measure based on MLP could provide another relativeimprovement of 5.5% in ATWV.
Keywords :
multilayer perceptrons; optimisation; speech recognition; ATWV; MLP; OOV terms; STD; actual term weighted value; fuzzy search; inverted triphone index; lattice-based confidence measure; multi-layer perceptron; optimization; out of vocabulary terms; phone fragment; phone level fragment; speech recognition decoding unit; speech recognition system; spoken term detection; term detection confidence measure procedure; word-position marker; Acoustics; Hidden Markov models; Indexes; Lattices; NIST; Speech recognition; Training;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376767