• DocumentCode
    2713555
  • Title

    Hidden Markov models based on multi-space probability distribution for pitch pattern modeling

  • Author

    Tokuda, Keiichi ; Masuko, Takushi ; Miyazaki, Noboru ; Kobayashi, Takuo

  • Author_Institution
    Dept. of Comput. Eng., Nagoya Inst. of Technol., Japan
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    229
  • Abstract
    This paper discusses a hidden Markov model (HMM) based on multi-space probability distribution (MSD). The HMMs are widely-used statistical models to characterize the sequence of speech spectra and have successfully been applied to speech recognition systems. From these facts, it is considered that the HMM is useful for modeling pitch patterns of speech. However, we cannot apply the conventional discrete or continuous HMMs to pitch pattern modeling since the observation sequence of the pitch pattern is composed of one-dimensional continuous values and a discrete symbol which represents “unvoiced”. MSD-HMM includes discrete HMMs and continuous mixture HMMs as special cases, and further can model the sequence of observation vectors with variable dimension including zero-dimensional observations, i.e., discrete symbols. As a result, MSD-HMMs can model pitch patterns without heuristic assumption. We derive a reestimation algorithm for the extended HMM and show that it can find a critical point of the likelihood function
  • Keywords
    hidden Markov models; maximum likelihood estimation; probability; spectral analysis; speech processing; HMM; MSD-HMM; continuous mixture HMMs; discrete HMMs; discrete symbol; discrete symbols; hidden Markov models; likelihood function; multi-space probability distribution; observation sequence; one-dimensional continuous values; pitch pattern modeling; reestimation algorithm; speech recognition; speech spectra; statistical models; unvoiced; zero-dimensional observations; Computer science; Context modeling; Continuing education; Hidden Markov models; Laboratories; Probability density function; Probability distribution; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758104
  • Filename
    758104