• DocumentCode
    294616
  • Title

    Enhancement of discriminative capabilities of HMM based recognizer through modification of Viterbi algorithm

  • Author

    Song, Jianming

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Wollongong Univ., NSW, Australia
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    469
  • Abstract
    The algorithm proposed in this paper integrates the concepts of variable frame rate and discriminative analysis based on Tanimoto ratio to modify the conventional Viterbi algorithm, in such a way that the steady or stationary signal is compressed, while transitional or non-stationary signal is emphasized through the frame-by-frame searching process. The usefulness of each frame is decided entirely within the Viterbi process and needs not to be the same for different models. To evaluate this algorithm, we tested a speech database of 9 highly confusable E-set English letters. With 5 state and 6 mixture components, the conventional HMM baseline system only delivered a recognition accuracy of 73.9%. In the preliminary experiment using the algorithm proposed, the recognition accuracy was increased to 82.5%
  • Keywords
    Viterbi decoding; hidden Markov models; maximum likelihood estimation; speech enhancement; speech processing; speech recognition; HMM based recognizer; HMM baseline system; Tanimoto ratio; Viterbi decoding; discriminative analysis; discriminative capabilities; experiment; frame-by-frame searching process; highly confusable E-set English letters; mixture components; modified Viterbi algorithm; nonstationary signal; recognition accuracy; speech database; speech enhancement; stationary signal compression; transitional signal; variable frame rate; Australia; Cepstral analysis; Databases; Hidden Markov models; Maximum likelihood estimation; Power system modeling; Probability density function; Signal processing; Speech; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479630
  • Filename
    479630