DocumentCode :
137075
Title :
Experiments on front-end techniques and segmentation model for robust Indian Language speech recognizer
Author :
Sriranjani, R. ; Karthick, B. Murali ; Umesh, S.
Author_Institution :
Dept. of Appl. Mech., Indian Inst. of Technol., Chennai, Chennai, India
fYear :
2014
fDate :
Feb. 28 2014-March 2 2014
Firstpage :
1
Lastpage :
6
Abstract :
Recent contributions in the area of Automatic Speech Recognition (ASR) for Indian Languages has been increased. This paper serves as a comprehensive study of different feature extraction methods namely MFCC, PLP, RASTA-PLP and PNCC. An attempt to find out which of these front end techniques performs better for real world Indian Language data is analyzed experimentally. Then, an isolated word recognizer is built for three Indian languages (i.e., Tamil, Assamese and Bengali) under real world conditions and investigates the importance of handling long silence using segmentation method. The experimental analysis shows that PNCC provides better performance for clean data whereas MFCC shows improved performance in case of multi-condition speech data.
Keywords :
feature extraction; natural language processing; speech recognition; automatic speech recognition; feature extraction methods; front-end techniques; isolated word recognizer; multicondition speech data; robust language speech recognizer; segmentation model; Data models; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Speech; Feature extraction; Noise robustness; Segmentation; Silence handling; Speech recognition; comparison of front-end techniques; real world speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
Type :
conf
DOI :
10.1109/NCC.2014.6811284
Filename :
6811284
Link To Document :
بازگشت