• DocumentCode
    2997458
  • Title

    Spectral movement function and its application to speech recognition

  • Author

    Aikawa, Kiyoaki ; Furui, Sadaoki

  • Author_Institution
    NTT Human Interface Lab., Tokyo, Japan
  • fYear
    1988
  • fDate
    11-14 Apr 1988
  • Firstpage
    223
  • Abstract
    The authors propose a new mathematical method for extracting spectral movement in a time-sequence of speech spectrum. The spectral movement is characterized by the time and frequency derivative of a time-sequence of log spectrum envelopes. Spectral movement direction, the movement toward a higher or lower frequency region, can be identified by the sign of the proposed function. A parameter which can be used for speech segmentation is derived form this function A distance measure for speech recognition is also derived as the Euclidean distance between two spectral movement patterns extracted by the proposed function. This distance is easily calculated using cepstrum coefficients. Speech recognition results using dynamic time warping template matching with this new distance measure indicate that recognition error rate can be reduced to less than half compared with the conventional Euclidean cepstrum distance measure
  • Keywords
    speech recognition; Euclidean distance; cepstrum coefficients; distance measure; dynamic time warping template matching; error rate; log spectrum envelopes; phoneme segmentation; spectral movement function; speech recognition; speech segmentation; time-sequence; Cepstrum; Data mining; Euclidean distance; Frequency; Humans; Laboratories; Nonlinear filters; Speech recognition; Time measurement; Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.1988.196554
  • Filename
    196554