• DocumentCode
    2179568
  • Title

    Decomposition of speech signals for analysis of aperiodic components of excitation

  • Author

    Yegnanarayana, B. ; Joseph, M. Anand ; Suryakanth, V.G. ; Dhananjaya, N.

  • Author_Institution
    Language Technol. Res. Center, Int. Inst. of Inf. Technol., Hyderabad, India
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5396
  • Lastpage
    5399
  • Abstract
    The motivation for this study is the need for careful analysis of aperiodicity of the excitation component in expressive voices. The paper proposes analysis methods which can preserve the excitation information corresponding to sequence of impulse-like excitation with variable strengths. To analyze the details of the excitation source characteristics, the epochs and the strength of the excitation at the epochs are obtained using the output of an ideal zero-frequency digital resonator. The vocal tract system characteristics are derived from the signal between two successive epochs using the numerator of the group delay function. The spectrogram of the zero-frequency filtered signal and the group delay spectrum correspond to characteristics of the excitation and the vocal tract system, respectively. Decomposition of the speech signal into these two components bring out the features of excitation and vocal tract system, which can be used to explain the perception of expressive voices in terms of features of aperiodicity, pitch, harmonics and sub-harmonics. The decomposition method is illustrated using examples from linguistically significant glottalized sounds (glottal stops and ejectives), singing voices and Noh voice.
  • Keywords
    speech processing; aperiodic component analysis; excitation component; group delay spectrum; impulse-like excitation; speech signal decomposition; vocal tract system; zero-frequency digital resonator; zero-frequency filtered signal; Correlation; Data mining; Delay; Harmonic analysis; Production; Spectrogram; Speech; Epochs; Noh voice; aperiodicity; glottalized sounds; group delay spectra; singing voice; subharmonics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947578
  • Filename
    5947578