• DocumentCode
    337528
  • Title

    Diphone multi-trajectory subspace models

  • Author

    Reinhard, Klaus ; Niranjan, Mahesan

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    1001
  • Abstract
    We report on the extension of capturing speech transitions embedded in diphones using trajectory models. The slowly varying dynamics of spectral trajectories carry much discriminant information that is very crudely modelled by traditional approaches such as HMMs. We improved our methodology of explicitly capturing the trajectory of short time spectral parameter vectors introducing multi-trajectory concepts in a probabilistic framework. Optimal subspace selection is presented which finds the most discriminant plane for classification. Using the E-set from the TIMIT database results suggest that discriminant information is preserved in the subspace
  • Keywords
    parameter space methods; probability; signal classification; spectral analysis; speech processing; E-set; HMM; TIMIT database results; classification; diphone multi-trajectory subspace models; discriminant information; optimal subspace selection; probabilistic framework; short time spectral parameter vectors; slowly varying dynamics; spectral trajectories; speech signals; speech transitions; Context modeling; Databases; Explosions; Hidden Markov models; Information resources; Parameter estimation; Principal component analysis; Recurrent neural networks; Robustness; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759870
  • Filename
    759870