• DocumentCode
    323834
  • Title

    Parametric subspace modelling of speech transitions

  • Author

    Reinhard, Klaus ; Niranjan, Mahesan

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    1105
  • Abstract
    We report on attempting to capture segmental transition information for speech recognition tasks. The slowly varying dynamics of spectral trajectories carries much discriminant information that is very crudely modelled by traditional approaches such as HMMs. In attempts such as recurrent neural networks there is the hope, but not convincing demonstration, that such transitional information could be captured. We start from the very different position of explicitly capturing the trajectory of short time spectral parameter vectors on a subspace in which the temporal sequence information is preserved (time constrained principal component analysis). On this subspace, we attempt a parametric modelling of the trajectory, and compute a distance metric to perform classification of diphones. Much of the discriminant information is still retained in this subspace. This is illustrated on the isolated transitions /bee/,/dee/ and /gee/
  • Keywords
    parameter estimation; pattern classification; recurrent neural nets; spectral analysis; speech processing; HMM; diphones classification; discriminant information; distance metric; isolated speech transitions; parametric subspace modelling; recurrent neural networks; segmental transition information; short time spectral parameter vectors; slowly varying dynamics; spectral trajectories; speech recognition; temporal sequence information; time constrained principal component analysis; Context modeling; Hidden Markov models; Information resources; Parameter estimation; Parametric statistics; Principal component analysis; Recurrent neural networks; Speech recognition; Subspace constraints; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675462
  • Filename
    675462