DocumentCode
1094161
Title
Distortion measures for speech processing
Author
Gray, Robert M. ; Buzo, Andrés ; Gray, Augustine H., Jr. ; Matsuyama, Yasuo
Author_Institution
Stanford University, Stanford, CA, USA
Volume
28
Issue
4
fYear
1980
fDate
8/1/1980 12:00:00 AM
Firstpage
367
Lastpage
376
Abstract
Several properties, interrelations, and interpretations are developed for various speech spectral distortion measures. The principle results are 1) the development of notions of relative strength and equivalence of the various distortion measures both in a mathematical sense corresponding to subjective equivalence and in a coding sense when used in minimum distortion or nearest neighbor speech processing systems; 2) the demonstration that the Itakura-Saito and related distortion measures possess a property similar to the triangle inequality when used in nearest neighbor systems such as quantization and cluster analysis; and 3) that the Itakura-Saito and normalized model distortion measures yield efficient computation algorithms for generalized centroids or minimum distortion points of groups or clusters of speech frames, an important computation in both classical cluster analysis techniques and in algorithms for optimal quantizer design. We also argue that the Itakura-Saito and related distortions are well-suited computationally, mathematically, and intuitively for such applications.
Keywords
Acoustic distortion; Algorithm design and analysis; Clustering algorithms; Distortion measurement; Mathematical model; Nearest neighbor searches; Quantization; Speech analysis; Speech coding; Speech processing;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1980.1163421
Filename
1163421
Link To Document