Title :
Maximum intelligibility-based close-loop speech synthesis framework for noisy environments
Author :
Yuan-Fu Liao ; Ming-Long Wu ; Jia-Chi Lin
Author_Institution :
Dept. of Electron. Eng., Nat. Taipei Univ. of Technol., Taipei, Taiwan
Abstract :
This paper proposes a maximum intelligibility (MI)-based close-loop speech synthesis framework to actively compensate for the distortion of background noises. In this framework, an extra environmental noise-sensing microphone and an automatic speech recognition (ASR) module are utilized to approximate a subjective intelligibility measure. The hidden Markov model-based speech synthesis system (HTS) is then online adjusted by using the MI-based model adaptation algorithm. Experimental results of two subjective listening tests in noisy environments show that the proposed approach obtains 64% of the votes in an A/B preference test and helps the participants reduce word dictation errors by relative 26% when compared to an HTS baseline.
Keywords :
hidden Markov models; speech synthesis; automatic speech recognition module; background noise distortion; hidden Markov model; maximum intelligibility based close-loop speech synthesis; noise-sensing microphone; noisy environment; speech synthesis system; Hidden Markov models; High-temperature superconductors; Noise; Noise measurement; Speech; Speech recognition; Speech synthesis; Speech synthesis; automatic speech recognition; minimum classification error; speech intelligibility;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639222