Title :
Analysis of fricatives, stop consonants and nasals in the automatic segmentation of speech using the group delay algorithm
Author :
Musfir, Mohammed ; Krishnan, K. Raghava ; Murthy, Hema A.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Madras, Chennai, India
fDate :
Feb. 28 2014-March 2 2014
Abstract :
Unit Selection based speech synthesis systems (USS) require accurate labeling of units. Accurate segmentation of speech waveforms manually is a laborious task. Syllable-based systems for Indian languages use a group delay based approach for semi-automatic segmentation of speech waveforms into syllables. This performance of the group delay based algorithm is poor when the syllables contain fricatives, nasals and unvoiced stop consonants. This paper proposes a modification to the algorithm that exploits the properties of these types of units to reduce errors. In particular, the ratio of energy in the high frequency bands to low frequency bands is used as a cue to segment the speech signal.
Keywords :
speech synthesis; Indian languages; automatic speech segmentation; fricatives; group delay algorithm; group delay based algorithm; group delay based approach; nasals; semiautomatic segmentation; speech signal; speech waveforms segmentation; stop consonants; syllable-based systems; unit selection based speech synthesis systems; Acoustics; Delays; Energy resolution; Silicon; Speech; Speech processing; Speech recognition;
Conference_Titel :
Communications (NCC), 2014 Twentieth National Conference on
Conference_Location :
Kanpur
DOI :
10.1109/NCC.2014.6811364