• DocumentCode
    404824
  • Title

    Subspace and hypothesis based effective segmentation of co-articulated basic-units for concatenative speech synthesis

  • Author

    Muralishankar, R. ; Srikanth, R. ; Ramakrishnan, A.G.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    1
  • fYear
    2003
  • fDate
    15-17 Oct. 2003
  • Firstpage
    388
  • Abstract
    In this paper, we present two new methods for vowel-consonant segmentation of a co-articulated basic-units employed in our Thirukkural Tamil text-to-speech synthesis system (G. L. Jayavardhana Rama et al, IEEE workshop on Speech Synthesis, 2002). The basic-units considered in this are CV, VC, VCV, VCCV and VCCC, where C stands for a consonant and V for any vowel. In the first method, we use a subspace-based approach for vowel-consonant segmentation. It uses oriented principal component analysis (OPCA) where the test feature vectors are projected on to the V and C subspaces. The crossover of the norm-contours obtained by projecting the test basic-unit onto the V and C subspaces give the segmentation points which in turn helps in identifying the V and C durations of a test basic-unit. In the second method, we use probabilistic principal component analysis (PPCA) to get probability models for V and C. We then use the Neymen-Pearson (NP) test to segment the basic-unit into V and C. Finally, we show that the hypothesis testing turns out to be an energy detector for V-C segmentation which is similar to the first method.
  • Keywords
    principal component analysis; speech synthesis; Neymen-Pearson test; OPCA; PPCA; Thirukkural Tamil text-to-speech synthesis system; concatenative speech synthesis; hypothesis based co-articulated basic-unit segmentation; hypothesis testing; norm-contours crossover; oriented principal component analysis; probabilistic principal component analysis; subspace based co-articulated basic-unit segmentation; test feature vectors; vowel-consonant segmentation energy detector; Acoustic signal detection; Covariance matrix; Detectors; Matrix decomposition; Signal to noise ratio; Speech recognition; Speech synthesis; Testing; Vectors; Virtual colonoscopy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
  • Print_ISBN
    0-7803-8162-9
  • Type

    conf

  • DOI
    10.1109/TENCON.2003.1273351
  • Filename
    1273351