DocumentCode :
2576751
Title :
Segmentation of speech signal into phonemes using two-level GMM tokenization
Author :
Monica, T. ; Nagarajan, T.
Author_Institution :
Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
fYear :
2011
fDate :
3-5 June 2011
Firstpage :
843
Lastpage :
847
Abstract :
This paper proposes an algorithm for identifying the phoneme boundaries in a given speech signal without the need for its orthographic transcription. The algorithm is a two level process whereby in the first level the phoneme boundaries are determined by silence/voiced/unvoiced classification and in the second level the voiced parts are alone tokenized further. TIMIT database was used to carry out the experiments and to check the correctness of the automatically detected phoneme boundaries. The experimental results showed that the performance of the algorithm in identifying the correct boundaries was ~75%.
Keywords :
Gaussian processes; signal classification; speech processing; Gaussian mixture model; TIMIT database; phoneme boundaries; silence classification; speech signal segmentation; two-level GMM tokenization; unvoiced classification; voiced classification; Correlation; Feature extraction; Indexes; Smoothing methods; Speech; Speech recognition; Training; GMM; phoneme; segmentation; speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location :
Chennai, Tamil Nadu
Print_ISBN :
978-1-4577-0588-5
Type :
conf
DOI :
10.1109/ICRTIT.2011.5972311
Filename :
5972311
Link To Document :
بازگشت