Title :
Multiresolution feature extraction (MRFE) based speech recognition system
Author :
Priyanka, M. Anbu Swarna ; Solomi, V. Sherlin ; Vijayalakshmi, P. ; Nagarajan, T.
Abstract :
A speech recognition system will recognise the speech uttered into text. The accuracy of the recognition system depends on the models generated. Models are trained based on the features extracted from the available training data. These models are used to recognise the spoken text. In the conventional feature extraction method, features are extracted using single window size (say 20ms). Instead of this fixed window size, we propose to extract features using multiple window sizes from the same speech signal. When multiple window sizes are used, multiple sets of feature vectors are derived for the same word thereby increasing the number of examples. Experiments show that when features are extracted with multiple window sizes, the variations among the feature vectors are considerably increased, which will lead to better acoustic models. This multiresolution feature extraction technique is successfully used for building a speech recogniser. To analyse the performance of multiresolution feature extraction, isolated word speech recognition system is developed for the TIMIT speech corpus. Results reveal that around 8% improvement in recognition accuracy is obtained over conventional single resolution feature extraction based method.
Keywords :
feature extraction; speech recognition; MRFE; TIMIT speech corpus; acoustic models; feature vectors; isolated word speech recognition system; multiple window sizes; multiresolution feature extraction; Feature extraction; Information technology; Market research; Speech; Speech recognition; Testing; Training;
Conference_Titel :
Recent Trends in Information Technology (ICRTIT), 2013 International Conference on
Conference_Location :
Chennai
DOI :
10.1109/ICRTIT.2013.6844197