Title :
Segmentation of speech signal into phonemes using two-level GMM tokenization
Author :
Monica, T. ; Nagarajan, T.
Author_Institution :
Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
Abstract :
This paper proposes an algorithm for identifying the phoneme boundaries in a given speech signal without the need for its orthographic transcription. The algorithm is a two level process whereby in the first level the phoneme boundaries are determined by silence/voiced/unvoiced classification and in the second level the voiced parts are alone tokenized further. TIMIT database was used to carry out the experiments and to check the correctness of the automatically detected phoneme boundaries. The experimental results showed that the performance of the algorithm in identifying the correct boundaries was ~75%.
Keywords :
Gaussian processes; signal classification; speech processing; Gaussian mixture model; TIMIT database; phoneme boundaries; silence classification; speech signal segmentation; two-level GMM tokenization; unvoiced classification; voiced classification; Correlation; Feature extraction; Indexes; Smoothing methods; Speech; Speech recognition; Training; GMM; phoneme; segmentation; speech;
Conference_Titel :
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location :
Chennai, Tamil Nadu
Print_ISBN :
978-1-4577-0588-5
DOI :
10.1109/ICRTIT.2011.5972311