Segmentation of speech signal into phonemes using two-level GMM tokenization

Author

Monica, T. ; Nagarajan, T.

Author_Institution

Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India

fYear

2011

fDate

3-5 June 2011

Firstpage

843

Lastpage

847

Abstract

This paper proposes an algorithm for identifying the phoneme boundaries in a given speech signal without the need for its orthographic transcription. The algorithm is a two level process whereby in the first level the phoneme boundaries are determined by silence/voiced/unvoiced classification and in the second level the voiced parts are alone tokenized further. TIMIT database was used to carry out the experiments and to check the correctness of the automatically detected phoneme boundaries. The experimental results showed that the performance of the algorithm in identifying the correct boundaries was ~75%.

Keywords

Gaussian processes; signal classification; speech processing; Gaussian mixture model; TIMIT database; phoneme boundaries; silence classification; speech signal segmentation; two-level GMM tokenization; unvoiced classification; voiced classification; Correlation; Feature extraction; Indexes; Smoothing methods; Speech; Speech recognition; Training; GMM; phoneme; segmentation; speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Recent Trends in Information Technology (ICRTIT), 2011 International Conference on

Conference_Location

Chennai, Tamil Nadu

Print_ISBN

978-1-4577-0588-5

Type

conf

DOI

10.1109/ICRTIT.2011.5972311

Filename

5972311

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2576751