Title :
Analysis-by-synthesis method for whisper-speech reconstruction
Author :
Ahmadi, Farzaneh ; McLoughlin, Ian Vince ; Sharifzadeh, Hamid Reza
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
fDate :
Nov. 30 2008-Dec. 3 2008
Abstract :
In the following paper, a method for the real-time conversion of whispers to normal phonated speech through a code excited linear prediction analysis-by-synthesis codec is discussed. This approach uses a template of a speakerpsilas normal phonated speech for extraction of excitation parameters such as pitch and gain, and then injects these estimated excitations into whispered signal to synthesize normal-sounding speech through the CELP codec. Furthermore, since restoring pitch to whispered speech requires some considerations of quality and accuracy, spectral enhancements are required in terms of formant shifting (LSPs modification) and pitch injection based on voiced/unvoiced decision. Spectral shifting is accomplished through line-spectral pair adjustment. Implementing such methods by using the popular CELP codec allows integration of the technique with any modern speech applications and devices. Subjective testing results are presented to determine the effectiveness of the technique.
Keywords :
linear predictive coding; signal reconstruction; spectral analysis; speech codecs; speech enhancement; speech synthesis; CELP codec; analysis-by-synthesis codec; code-excited linear prediction codec; formant shifting; line-spectral pair adjustment; normal phonated speech; normal-sounding speech synthesis; pitch injection; real-time whisper conversion; spectral enhancement; spectral shifting; speech quality; subjective testing; whisper-speech reconstruction; Books; Filters; Linear predictive coding; Paper technology; Speech analysis; Speech codecs; Speech coding; Speech enhancement; Speech synthesis; Testing;
Conference_Titel :
Circuits and Systems, 2008. APCCAS 2008. IEEE Asia Pacific Conference on
Conference_Location :
Macao
Print_ISBN :
978-1-4244-2341-5
Electronic_ISBN :
978-1-4244-2342-2
DOI :
10.1109/APCCAS.2008.4746261