• DocumentCode
    310621
  • Title

    Fast and robust joint estimation of vocal tract and voice source parameters

  • Author

    Ding, Wen ; Campbell, Nick ; Higuchi, Norio ; Kasuya, Hideki

  • Author_Institution
    ATR Interpreting Telephony Res. Labs., Kyoto, Japan
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1291
  • Abstract
    A new pitch-synchronous method of joint estimation is described to estimate vocal tract and voice source parameters from speech signals based on an autoregressive model with an exogenous input (ARX) model. The method uses Kalman filtering to estimate the time-varying coefficients and simulated annealing to deal with the non-linear optimization of Rosenberg-Klatt parameters. A compact method is suggested in the algorithm in order to reduce the computation cost. Further, an automatic model order selection method is proposed to determine the proper analysis pole-order of the ARX model, based on the estimated formant bandwidths. The new method has been shown to be much faster than our previous method and the order selection technique has been shown to be effective. Finally, an ATR two-channel speech database including varying sentence-level prominence patterns is used to verify the proposed method
  • Keywords
    Kalman filters; autoregressive processes; filtering theory; parameter estimation; simulated annealing; speech processing; ARX model; ATR two-channel speech database; Kalman filtering; analysis pole-order; automatic model order selection method; autoregressive model; computation cost reduction; estimated formant bandwidths; exogenous input; fast joint estimation; nonlinear optimization; pitch synchronous method; robust joint estimation; sentence level prominence patterns; simulated annealing; speech signals; time-varying coefficients; vocal tract parameters; voice source parameters; Bandwidth; Computational efficiency; Computational modeling; Databases; Filtering; Kalman filters; Optimization methods; Robustness; Simulated annealing; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596182
  • Filename
    596182