DocumentCode :
180190
Title :
Use of articulatory bottle-neck features for query-by-example spoken term detection in low resource scenarios
Author :
Mantena, Gautam ; Prahallad, K.
Author_Institution :
Int. Inst. of Inf. Technol. - Hyderabad, Hyderabad, India
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
7128
Lastpage :
7132
Abstract :
For query-by-example spoken term detection (QbE-STD), generation of phone posteriorgrams requires labelled data which would be difficult for languages with low resources. One solution is to build models from rich resource languages and use them in the low resource scenario. However, phone classes are not language universal and alternate representation such as articulatory classes is explored. In this paper, we use articulatory information and their derivatives such as bottle-neck (BN) features (also referred to as articulatory BN features) for QbE-STD. We obtain Gaussian posteriorgrams of articulatory BN features in tandem with the acoustic parameters such as frequency domain linear prediction cepstral coefficients to perform the search. We compare the search performance of articulatory and phone BN features and show that articulatory BN features are a better representation. We also provide experimental results to show that low amounts (30 mins) of training data could be used to derive articulatory BN features.
Keywords :
Gaussian processes; feature extraction; query processing; speech processing; Gaussian posteriorgram; QbE-STD; acoustic parameters; articulatory bottle-neck features; articulatory classes representation; frequency domain linear prediction cepstral coefficients; low resource scenario; phone posteriorgram generation; query-by-example spoken term detection; Feature extraction; Frequency-domain analysis; Mel frequency cepstral coefficient; Speech; Speech recognition; Training data; Query-by-example spoken term detection; articulatory features; bottle-neck features; low resource; multi-layer perceptron;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854983
Filename :
6854983
Link To Document :
بازگشت