• DocumentCode
    3040417
  • Title

    Frequency-axis warping to improve automatic word recognition

  • Author

    Neuburg, Evwarv P.

  • Author_Institution
    Department of Defense, Meade, MD
  • Volume
    5
  • fYear
    1980
  • fDate
    29312
  • Firstpage
    166
  • Lastpage
    168
  • Abstract
    Frequency normalization of talkers remains a problem in word recognition, especially where new talkers cannot be asked to provide samples (of their vowels, for example) in advance. Several methods were investigated; for each, parameters were derived by calculating their effect on formant histograms derived from casual speech. Methods tried were a) uniform multiplication of frequencies ("stretching" the vocal tract); b) "stretching" each formant region by a different amount; c) combined shift and stretch (affine mapping); d) different affine mappings for different formants (this includes warping each formant as a function of its range); e) warping each formant non-linearly as a function of its distribution. Experiments show that parameters derived from casual speech improve vowel recognition markedly, and that method e) appears strongest.
  • Keywords
    Automatic speech recognition; Bandwidth; Frequency; Government; Histograms; Loudspeakers; Pattern matching; Pattern recognition; Protection; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '80.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1980.1170907
  • Filename
    1170907