Title :
Dynamic Time Warping based speech recognition for isolated sinhala words
Author :
Priyadarshani, P.G.N. ; Dias, N.G.J. ; Punchihewa, Amal
Author_Institution :
Dept. of Stat. & Comput. Sci., Univ. of Kelaniya, Kelaniya, Sri Lanka
Abstract :
Currently there is a considerable tendency in developing Automatic Speech Recognition (ASR) systems which are capable of tracking the human speech done in local specific languages and identifying them because the people prefer to work with computers using their native language. In Sri Lanka, a sizable portion of the population is discouraged to use computers simply because of the language barrier and difficulty of using the conventional interfaces. Consequently there is a great demand for a computer interface which enables the communication in Sinhala. This paper presents an approach to identify Sinhala speech based on Dynamic Time Warping (DTW) and the Mel Frequency Cepstral Coefficients (MFCC). The correct recognition was achieved in several phases and each phase is described in detail.
Keywords :
speech recognition; ASR systems; DTW; MFCC; Mel frequency cepstral coefficients; Sinhala speech; Sri Lanka; automatic speech recognition; computer interface; dynamic time warping; human speech tracking; isolated Sinhala words; Computers; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speech; Speech recognition; Vocabulary; DCT; DTW; FFT; MFCC; hamming window;
Conference_Titel :
Circuits and Systems (MWSCAS), 2012 IEEE 55th International Midwest Symposium on
Conference_Location :
Boise, ID
Print_ISBN :
978-1-4673-2526-4
Electronic_ISBN :
1548-3746
DOI :
10.1109/MWSCAS.2012.6292164