DocumentCode :
2044270
Title :
Segmentation of Sindhi Speech using Formants
Author :
Khawaja, M. Asif ; Haider, Najmi G.
Author_Institution :
Sch. of Electr. Eng., Univ. of NSW, Sydney, NSW, Australia
fYear :
2007
fDate :
24-27 Nov. 2007
Firstpage :
796
Lastpage :
799
Abstract :
A speech segmentation method using formant frequencies is presented. The method uses speech samples of a major language of Indian sub-continent, Sindhi. It performs VCP (vowel-consonant-pause) segmentation and generates VCP strings for speech signals. The VCP strings and their formation may enable a recognizer to identify the speech on-the-fly, hence minimizing the system training and making the recognizer very efficient. The method applies velocity and acceleration parameters of rate-of-change dynamics on formants of speech to segment it into vowel, consonant, and pause parts. A test-bed software, to implement the proposed method and conduct all experiments, is also presented. Results show that the method is speaker as well as gender independent. Its segmentation performance is almost over 90% in most conditions and over 60% under some worst conditions. Long-term goal is to develop an efficient speaker-independent speech recognizer based on proposed method. A model of such a recognizer is also presented.
Keywords :
speech processing; Sindhi speech; formant frequencies; rate-of-change dynamics; speech segmentation; vowel consonant pause segmentation; Australia; Automatic speech recognition; Frequency; Natural languages; Signal processing; Speech analysis; Speech processing; Speech recognition; Standardization; Strontium; Segmentation; Sindhi; Speech processing; formant frequencies; rate of change dynamics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1235-8
Electronic_ISBN :
978-1-4244-1236-5
Type :
conf
DOI :
10.1109/ICSPC.2007.4728439
Filename :
4728439
Link To Document :
بازگشت