DocumentCode
1088413
Title
Distance measures for speech processing
Author
Gray, Augustine H., Jr. ; Markel, John D.
Author_Institution
University of California, Santa Barbara, CA
Volume
24
Issue
5
fYear
1976
fDate
10/1/1976 12:00:00 AM
Firstpage
380
Lastpage
391
Abstract
The properties and interrelationships among four measures of distance in speech processing are theoretically and experimentally discussed. The root mean square (rms) log spectral distance, cepstral distance, likelihood ratio (minimum residual principle or delta coding (DELCO) algorithm), and a cosh measure (based upon two nonsymmetrical likelihood ratios) are considered. It is shown that the cepstral measure bounds the rms log spectral measure from below, while the cosh measure bounds it from above. A simple nonlinear transformation of the likelihood ratio is shown to be highly correlated with the rms log spectral measure over expected ranges. Relationships between distance measure values and perception are also considered. The likelihood ratio, cepstral measure, and cosh measure are easily evaluated recursively from linear prediction filter coefficients, and each has a meaningful and interrelated frequency domain interpretation. Fortran programs are presented for computing the recursively evaluated distance measures.
Keywords
Autocorrelation; Cepstral analysis; Euclidean distance; Nonlinear filters; Oral communication; Root mean square; Speech analysis; Speech processing; Speech recognition; Testing;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1976.1162849
Filename
1162849
Link To Document