Title :
Segmentation of Sindhi Speech using Formants
Author :
Khawaja, M. Asif ; Haider, Najmi G.
Author_Institution :
Sch. of Electr. Eng., Univ. of NSW, Sydney, NSW, Australia
Abstract :
A speech segmentation method using formant frequencies is presented. The method uses speech samples of a major language of Indian sub-continent, Sindhi. It performs VCP (vowel-consonant-pause) segmentation and generates VCP strings for speech signals. The VCP strings and their formation may enable a recognizer to identify the speech on-the-fly, hence minimizing the system training and making the recognizer very efficient. The method applies velocity and acceleration parameters of rate-of-change dynamics on formants of speech to segment it into vowel, consonant, and pause parts. A test-bed software, to implement the proposed method and conduct all experiments, is also presented. Results show that the method is speaker as well as gender independent. Its segmentation performance is almost over 90% in most conditions and over 60% under some worst conditions. Long-term goal is to develop an efficient speaker-independent speech recognizer based on proposed method. A model of such a recognizer is also presented.
Keywords :
speech processing; Sindhi speech; formant frequencies; rate-of-change dynamics; speech segmentation; vowel consonant pause segmentation; Australia; Automatic speech recognition; Frequency; Natural languages; Signal processing; Speech analysis; Speech processing; Speech recognition; Standardization; Strontium; Segmentation; Sindhi; Speech processing; formant frequencies; rate of change dynamics;
Conference_Titel :
Signal Processing and Communications, 2007. ICSPC 2007. IEEE International Conference on
Conference_Location :
Dubai
Print_ISBN :
978-1-4244-1235-8
Electronic_ISBN :
978-1-4244-1236-5
DOI :
10.1109/ICSPC.2007.4728439