Title :
Neural network-based voice quality measurement technique
Author :
Tarraf, Ahmed ; Meyers, Martin
Author_Institution :
AT&T Bell Labs., Whippany, NJ, USA
Abstract :
In this paper, we present a novel neural network-based predictor for subjective quality of speech signals. The output from the predictor is the estimated subjective quality or mean opinion score (MOS). The internal representation of signals is calculated using a model for the human auditory system. The perceptual distance between the reference speech and the speech sample under test is used as input to the neural network, which is then trained to model the underlying relationship between this perceptual distance and its subjective quality (MOS). Accurate MOS predictions have been demonstrated for speech coders used in common wireless applications including AMPS, TDMA, GSM and CDMA. MOS values predicted by the neural network MOS machine (NN-MM) were validated for clean and corrupted channels as well as for background noise conditions. Prediction accuracy is an order of magnitude better than anything previously reported, with worst case errors on the order of 0.05 MOS point
Keywords :
feature extraction; hearing; multilayer perceptrons; prediction theory; speech processing; AMPS; CDMA; GSM; TDMA; background noise; common wireless applications; feature extraction; human auditory system; mean opinion score; multilayer perceptron; neural network-based predictor; perceptual distance; prediction accuracy; reference speech; speech sample; speech signals; subjective quality; voice quality measurement technique; Auditory system; Background noise; GSM; Humans; Measurement techniques; Multiaccess communication; Neural networks; Speech; Testing; Time division multiple access;
Conference_Titel :
Computers and Communications, 1999. Proceedings. IEEE International Symposium on
Conference_Location :
Red Sea
Print_ISBN :
0-7695-0250-4
DOI :
10.1109/ISCC.1999.780923