DocumentCode
3010457
Title
Nonlinear frequency warp for speech recognition
Author
Blomberg, Mats ; Elenius, Kjell
Author_Institution
Department of Speech Communication and Music Acoustics, KTH, Stockholm, Sweden
Volume
11
fYear
1986
fDate
31503
Firstpage
2631
Lastpage
2634
Abstract
A technique of nonlinear frequency warping has been investigated for recognition of Swedish vowels. A frequency warp between two spectra is computed using a standard dynamic programming algorithm. The frequency distance, defined as the area between the obtained warping function and the diagonal, is contributing to the spectral distance. The distance between two spectra is a weighted sum of the warped amplitude distance and the frequency distance. By changing two weights, we get a gradual shift between non-warped amplitude distance, warped amplitude distance, and frequency distance. In recognition experiments on natural and synthetic vowel spectra, a metric combining the frequency and amplitude distances gave better results than using only amplitude or frequency deviation. Analysis of the results of the synthetic vowels show a reduced sensitivity to voice source and pitch variation. For the natural vowels, the recognition improvement is larger for the male and female speakers separately than for the combined groups.
Keywords
Distance measurement; Dynamic programming; Equations; Frequency; Shape; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type
conf
DOI
10.1109/ICASSP.1986.1169305
Filename
1169305
Link To Document