Title :
Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback
Author :
Tu, Tsung-wei ; Lee, Hung-yi ; Lee, Lin-shan
Author_Institution :
Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
This paper reports a new approach to improving spoken term detection that uses support vector machine (SVM) with acoustic and linguistic features. As SVM is a good technique for discriminating different features in vector space, we recently proposed to use pseudo-relevance feedback to automatically generate training data for SVM training and use SVM to re-rank the first-pass results considering the context consistency in the lattices. In this paper, we further extend this concept by considering acoustic features at word, phone and HMM state levels and linguistic features of different order. Extensive experiments under various recognition environments demonstrate significant improvements in all cases. In particular, the acoustic features at the HMM state level offered the most significant improvements, and the improvements achieved by acoustic and linguistic features are shown to be additive.
Keywords :
acoustic signal processing; linguistics; relevance feedback; speech processing; support vector machines; HMM state level; SVM training; acoustic feature; context consistency; linguistic feature; phone level; pseudorelevance feedback; recognition environment; rerank; spoken term detection; support vector machine; vector space; word level; Context; Lattices; Mel frequency cepstral coefficient; Pragmatics; Support vector machines; Vectors;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
DOI :
10.1109/ASRU.2011.6163962