DocumentCode
3207276
Title
Dynamic Time Warping based speech recognition for isolated sinhala words
Author
Priyadarshani, P.G.N. ; Dias, N.G.J. ; Punchihewa, Amal
Author_Institution
Dept. of Stat. & Comput. Sci., Univ. of Kelaniya, Kelaniya, Sri Lanka
fYear
2012
fDate
5-8 Aug. 2012
Firstpage
892
Lastpage
895
Abstract
Currently there is a considerable tendency in developing Automatic Speech Recognition (ASR) systems which are capable of tracking the human speech done in local specific languages and identifying them because the people prefer to work with computers using their native language. In Sri Lanka, a sizable portion of the population is discouraged to use computers simply because of the language barrier and difficulty of using the conventional interfaces. Consequently there is a great demand for a computer interface which enables the communication in Sinhala. This paper presents an approach to identify Sinhala speech based on Dynamic Time Warping (DTW) and the Mel Frequency Cepstral Coefficients (MFCC). The correct recognition was achieved in several phases and each phase is described in detail.
Keywords
speech recognition; ASR systems; DTW; MFCC; Mel frequency cepstral coefficients; Sinhala speech; Sri Lanka; automatic speech recognition; computer interface; dynamic time warping; human speech tracking; isolated Sinhala words; Computers; Feature extraction; Filter banks; Mel frequency cepstral coefficient; Speech; Speech recognition; Vocabulary; DCT; DTW; FFT; MFCC; hamming window;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems (MWSCAS), 2012 IEEE 55th International Midwest Symposium on
Conference_Location
Boise, ID
ISSN
1548-3746
Print_ISBN
978-1-4673-2526-4
Electronic_ISBN
1548-3746
Type
conf
DOI
10.1109/MWSCAS.2012.6292164
Filename
6292164
Link To Document