• DocumentCode
    1088413
  • Title

    Distance measures for speech processing

  • Author

    Gray, Augustine H., Jr. ; Markel, John D.

  • Author_Institution
    University of California, Santa Barbara, CA
  • Volume
    24
  • Issue
    5
  • fYear
    1976
  • fDate
    10/1/1976 12:00:00 AM
  • Firstpage
    380
  • Lastpage
    391
  • Abstract
    The properties and interrelationships among four measures of distance in speech processing are theoretically and experimentally discussed. The root mean square (rms) log spectral distance, cepstral distance, likelihood ratio (minimum residual principle or delta coding (DELCO) algorithm), and a cosh measure (based upon two nonsymmetrical likelihood ratios) are considered. It is shown that the cepstral measure bounds the rms log spectral measure from below, while the cosh measure bounds it from above. A simple nonlinear transformation of the likelihood ratio is shown to be highly correlated with the rms log spectral measure over expected ranges. Relationships between distance measure values and perception are also considered. The likelihood ratio, cepstral measure, and cosh measure are easily evaluated recursively from linear prediction filter coefficients, and each has a meaningful and interrelated frequency domain interpretation. Fortran programs are presented for computing the recursively evaluated distance measures.
  • Keywords
    Autocorrelation; Cepstral analysis; Euclidean distance; Nonlinear filters; Oral communication; Root mean square; Speech analysis; Speech processing; Speech recognition; Testing;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1976.1162849
  • Filename
    1162849