• DocumentCode
    2855794
  • Title

    Subspace based vowel-consonant segmentation

  • Author

    Muralishankar, R. ; Krishna, A. Kjaya ; Ramakrishnan, A.G.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • fYear
    2003
  • fDate
    28 Sept.-1 Oct. 2003
  • Firstpage
    589
  • Lastpage
    592
  • Abstract
    In our (knowledge-based) synthesis system [G. L. Jayavardhana Rama et al., 2002], we use single instances of basic-units, which are polyphones such as CV, VC, VCV, VCCV and VCCCV, where C stands for consonant and V for vowel. These basic-units are recorded in an isolated manner from a speaker and not from continuous speech or carrier-words. Modification of the pitch, amplitude and duration of basic-units is required in our speech synthesis system [G. L. Jayavardhana Rama et al., 2002] to ensure that the overall characteristics of the concatenated units matches with the true characteristic of the target word or sentence. Duration modification is carried out on the vowel parts of the basic-unit leaving the consonant portion in the basic-unit intact. Thus, we need to segment these polyphones into consonant and vowel parts. When the consonant present in any basic-unit is a plosive or fricative, the energy based method is good enough to segment the vowel and consonant parts. However, this method fails when there is a co-articulation between the vowel and the consonant. We propose the use of oriented principal component analysis (OPCA) to segment the co-articulated units. The test feature vectors (LPC-cepstrum & mel-cepstrum) are projected on the consonant and vowel subspaces. Each of these subspaces are represented by generalized eigenvectors obtained by applying OPCA on the training feature vectors. Our approach successfully segments co-articulated basic-units.
  • Keywords
    cepstral analysis; eigenvalues and eigenfunctions; principal component analysis; speech processing; speech synthesis; amplitude; carrier-words; coarticulated units; concatenated units; continuous speech; duration modification; eigenvectors; fricative consonant; oriented principal component analysis; pitch; plosive consonant; polyphone; speech synthesis system; Concatenated codes; Covariance matrix; Humans; Knowledge engineering; Linear predictive coding; Loudspeakers; Speech synthesis; Testing; Virtual colonoscopy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing, 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7997-7
  • Type

    conf

  • DOI
    10.1109/SSP.2003.1289544
  • Filename
    1289544