• DocumentCode
    3010457
  • Title

    Nonlinear frequency warp for speech recognition

  • Author

    Blomberg, Mats ; Elenius, Kjell

  • Author_Institution
    Department of Speech Communication and Music Acoustics, KTH, Stockholm, Sweden
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    2631
  • Lastpage
    2634
  • Abstract
    A technique of nonlinear frequency warping has been investigated for recognition of Swedish vowels. A frequency warp between two spectra is computed using a standard dynamic programming algorithm. The frequency distance, defined as the area between the obtained warping function and the diagonal, is contributing to the spectral distance. The distance between two spectra is a weighted sum of the warped amplitude distance and the frequency distance. By changing two weights, we get a gradual shift between non-warped amplitude distance, warped amplitude distance, and frequency distance. In recognition experiments on natural and synthetic vowel spectra, a metric combining the frequency and amplitude distances gave better results than using only amplitude or frequency deviation. Analysis of the results of the synthetic vowels show a reduced sensitivity to voice source and pitch variation. For the natural vowels, the recognition improvement is larger for the male and female speakers separately than for the combined groups.
  • Keywords
    Distance measurement; Dynamic programming; Equations; Frequency; Shape; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1169305
  • Filename
    1169305