DocumentCode :
2855794
Title :
Subspace based vowel-consonant segmentation
Author :
Muralishankar, R. ; Krishna, A. Kjaya ; Ramakrishnan, A.G.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
fYear :
2003
fDate :
28 Sept.-1 Oct. 2003
Firstpage :
589
Lastpage :
592
Abstract :
In our (knowledge-based) synthesis system [G. L. Jayavardhana Rama et al., 2002], we use single instances of basic-units, which are polyphones such as CV, VC, VCV, VCCV and VCCCV, where C stands for consonant and V for vowel. These basic-units are recorded in an isolated manner from a speaker and not from continuous speech or carrier-words. Modification of the pitch, amplitude and duration of basic-units is required in our speech synthesis system [G. L. Jayavardhana Rama et al., 2002] to ensure that the overall characteristics of the concatenated units matches with the true characteristic of the target word or sentence. Duration modification is carried out on the vowel parts of the basic-unit leaving the consonant portion in the basic-unit intact. Thus, we need to segment these polyphones into consonant and vowel parts. When the consonant present in any basic-unit is a plosive or fricative, the energy based method is good enough to segment the vowel and consonant parts. However, this method fails when there is a co-articulation between the vowel and the consonant. We propose the use of oriented principal component analysis (OPCA) to segment the co-articulated units. The test feature vectors (LPC-cepstrum & mel-cepstrum) are projected on the consonant and vowel subspaces. Each of these subspaces are represented by generalized eigenvectors obtained by applying OPCA on the training feature vectors. Our approach successfully segments co-articulated basic-units.
Keywords :
cepstral analysis; eigenvalues and eigenfunctions; principal component analysis; speech processing; speech synthesis; amplitude; carrier-words; coarticulated units; concatenated units; continuous speech; duration modification; eigenvectors; fricative consonant; oriented principal component analysis; pitch; plosive consonant; polyphone; speech synthesis system; Concatenated codes; Covariance matrix; Humans; Knowledge engineering; Linear predictive coding; Loudspeakers; Speech synthesis; Testing; Virtual colonoscopy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Statistical Signal Processing, 2003 IEEE Workshop on
Print_ISBN :
0-7803-7997-7
Type :
conf
DOI :
10.1109/SSP.2003.1289544
Filename :
1289544
Link To Document :
بازگشت