• DocumentCode
    1094161
  • Title

    Distortion measures for speech processing

  • Author

    Gray, Robert M. ; Buzo, Andrés ; Gray, Augustine H., Jr. ; Matsuyama, Yasuo

  • Author_Institution
    Stanford University, Stanford, CA, USA
  • Volume
    28
  • Issue
    4
  • fYear
    1980
  • fDate
    8/1/1980 12:00:00 AM
  • Firstpage
    367
  • Lastpage
    376
  • Abstract
    Several properties, interrelations, and interpretations are developed for various speech spectral distortion measures. The principle results are 1) the development of notions of relative strength and equivalence of the various distortion measures both in a mathematical sense corresponding to subjective equivalence and in a coding sense when used in minimum distortion or nearest neighbor speech processing systems; 2) the demonstration that the Itakura-Saito and related distortion measures possess a property similar to the triangle inequality when used in nearest neighbor systems such as quantization and cluster analysis; and 3) that the Itakura-Saito and normalized model distortion measures yield efficient computation algorithms for generalized centroids or minimum distortion points of groups or clusters of speech frames, an important computation in both classical cluster analysis techniques and in algorithms for optimal quantizer design. We also argue that the Itakura-Saito and related distortions are well-suited computationally, mathematically, and intuitively for such applications.
  • Keywords
    Acoustic distortion; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Mathematical model; Nearest neighbor searches; Quantization; Speech analysis; Speech coding; Speech processing;
  • fLanguage
    English
  • Journal_Title
    Acoustics, Speech and Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0096-3518
  • Type

    jour

  • DOI
    10.1109/TASSP.1980.1163421
  • Filename
    1163421