DocumentCode :
1856444
Title :
Text-directed speech enhancement using phoneme classification and feature map constrained vector quantization
Author :
Pellom, Bryan L. ; Hansen, John H L
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume :
2
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
645
Abstract :
This paper presents and evaluates a novel text-directed speech enhancement algorithm for usage in non-real-time applications. In our approach, the text of the intended dialogue is used to partition noisy speech into regions of broad phoneme classifications. Classes considered include stops, fricatives, affricates, nasals, vowels, semivowels, diphthongs and silence. These partitions are then used to direct a new vector quantizer based enhancement scheme in which class directed constraints are applied to improve speech quality. Objective enhancement evaluations conducted across 100 sentences of the TIMIT database indicate consistent improvement in speech quality for actual helicopter fly-by noise, aircraft cockpit noise, and automobile highway noise at signal-to-noise ratios ranging from -5 to 10 dB. Subjective quality assessment was conducted in the form of an A-B comparison test. Results of these evaluations demonstrate that, for wideband noise distortion, the proposed algorithm is preferred over unprocessed noisy speech more than 2 to 1, while the proposed algorithm is preferred over spectral subtraction processed speech by more than 3 to 1
Keywords :
acoustic noise; acoustic signal processing; speech coding; speech enhancement; speech intelligibility; speech processing; vector quantisation; HMM; TIMIT database; affricates; aircraft cockpit noise; automobile highway noise; class directed constraints; diphthongs; feature map constrained vector quantization; fricatives; helicopter fly-by noise; nasals; noisy speech; objective enhancement; phoneme classification; phoneme classifications; semivowels; signal-to-noise ratios; silence; speech quality; stops; subjective quality assessment; text-directed speech enhancement; vowels; Aircraft; Automobiles; Databases; Helicopters; Partitioning algorithms; Road transportation; Signal to noise ratio; Speech analysis; Speech enhancement; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.543203
Filename :
543203
Link To Document :
بازگشت