• DocumentCode
    393957
  • Title

    Joint pitch and voicing estimation for multiband excitation and sinusoidal speech coders

  • Author

    Jia, Wenhui ; Chan, Wui-Yip

  • Author_Institution
    Brooktrout Technol., Los Gatos, CA, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    3-6 Nov. 2002
  • Firstpage
    210
  • Abstract
    In conventional multi-band excitation (MBE) speech encoding, pitch is estimated first from the speech signal. Using the estimated pitch, voicing decisions are made for pitch-spaced spectral bands. As the method invariably includes unvoiced components in the speech signal to estimate the pitch, the accuracy of the estimated pitch and voicing decisions are degraded. A novel pitch and voicing estimation scheme is presented, wherein the spectrum of the speech signal is segmented into voiced and unvoiced regions without knowledge of the pitch. Pitch is then estimated only from the voice regions. Experimental results show that the new scheme improves the accuracy of the estimated pitch and voicing decisions, and offers better speech quality.
  • Keywords
    frequency estimation; spectral analysis; speech coding; vocoders; MBE; estimated pitch; joint pitch estimation; multiband excitation; pitch-spaced spectral bands; sinusoidal speech coders; speech signal spectrum; unvoiced components; unvoiced regions; voiced regions; voicing decisions; voicing estimation; Computer errors; Degradation; Discrete Fourier transforms; Frequency conversion; Frequency estimation; Frequency synthesizers; Harmonic analysis; Speech analysis; Speech coding; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2002. Conference Record of the Thirty-Sixth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA, USA
  • ISSN
    1058-6393
  • Print_ISBN
    0-7803-7576-9
  • Type

    conf

  • DOI
    10.1109/ACSSC.2002.1197178
  • Filename
    1197178