DocumentCode :
187712
Title :
Query word retrieval from continuous speech using GMM posteriorgrams
Author :
Reddy, Pappagari Raghavendra ; Rout, Kallola ; Rama Murty, K. Sri
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Hyderabad, Hyderabad, India
fYear :
2014
fDate :
22-25 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
The objective of this work is to study the issues involved in building an automatic query word retrieval system for broadcast news in an unsupervised framework, i.e., without using any labelled speech data. In the absence of labelled data, sequence of feature-vectors extracted from the query word have to be matched with those extracted from the test utterance. This is a non-trivial task, as typical feature-vectors like Mel-frequency cepstral coefficients (MFCC) carry both speech-specific and speaker-specific information. In this work, we have employed Gaussian mixture models (GMM) to extract speaker-independent features from the speech signal. Gaussian mixture model, trained on a large amount of speech data, is used to derive posterior features for each frame of speech signal. The sequence of posterior features are matched using dynamic time-warping algorithm to detect the presence of query word in the test utterance. The performance of the proposed method is evaluated on Telugu broadcast news database. It is observed that the posterior features extracted from GMM are better suited for query word retrieval compared to the MFCC features.
Keywords :
Gaussian processes; mixture models; query processing; speaker recognition; GMM posteriorgrams; Gaussian mixture model; MFCC; automatic query word retrieval system; broadcast news; continuous speech; dynamic time-warping algorithm; feature-vector; mel-frequency cepstral coefficient; speaker-independent feature extraction; speaker-specific information; speech-specific information; unsupervised framework; Euclidean distance; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications (SPCOM), 2014 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-4666-2
Type :
conf
DOI :
10.1109/SPCOM.2014.6984011
Filename :
6984011
Link To Document :
بازگشت