• DocumentCode
    137210
  • Title

    Analysis of fricatives, stop consonants and nasals in the automatic segmentation of speech using the group delay algorithm

  • Author

    Musfir, Mohammed ; Krishnan, K. Raghava ; Murthy, Hema A.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Madras, Chennai, India
  • fYear
    2014
  • fDate
    Feb. 28 2014-March 2 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Unit Selection based speech synthesis systems (USS) require accurate labeling of units. Accurate segmentation of speech waveforms manually is a laborious task. Syllable-based systems for Indian languages use a group delay based approach for semi-automatic segmentation of speech waveforms into syllables. This performance of the group delay based algorithm is poor when the syllables contain fricatives, nasals and unvoiced stop consonants. This paper proposes a modification to the algorithm that exploits the properties of these types of units to reduce errors. In particular, the ratio of energy in the high frequency bands to low frequency bands is used as a cue to segment the speech signal.
  • Keywords
    speech synthesis; Indian languages; automatic speech segmentation; fricatives; group delay algorithm; group delay based algorithm; group delay based approach; nasals; semiautomatic segmentation; speech signal; speech waveforms segmentation; stop consonants; syllable-based systems; unit selection based speech synthesis systems; Acoustics; Delays; Energy resolution; Silicon; Speech; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2014 Twentieth National Conference on
  • Conference_Location
    Kanpur
  • Type

    conf

  • DOI
    10.1109/NCC.2014.6811364
  • Filename
    6811364