DocumentCode
430954
Title
Enhancement of esophagus speech using harmonic plus noise model
Author
Lehana, Parveen K. ; Gupta, Rakesh K. ; Kumari, Santoresh
Author_Institution
Dept. of Phys. & Electron., Jammu Univ., India
Volume
A
fYear
2004
fDate
21-24 Nov. 2004
Firstpage
669
Abstract
Patients whose voice boxes have been removed use either artificial larynx or esophagus as a source of excitation to the vocal tract. This excitation is shaped by the vocal tract to produce the speech output. Artificial larynx is a costly device and continuous use of batteries increases the recurring expenses. Further, the presence of background noise degrades the speech qualify. Production of esophagus speech is more convenient and advantageous. The production of speech using esophagus is cheap, as no extra device is needed. Also the hands are free while speaking for doing some other work. But main problem of the esophagus speech is the very low amplitude of the output speech, intelligibility, and naturalness. The objective of this paper is to enhance the intelligibility of esophagus speech by using harmonic plus noise model (HNM). Esophagus speech is analyzed and synthesized using HNM and informal listening tests are conducted for accessing the improvement in the synthesized speech. Investigations show that the output is more natural and intelligible as compared to input speech signal but there is an increase in the random noise. This noise is uncorrelated to the input and can be suppressed by conventional techniques such as spectral subtraction method. Experiments are also carried out for enhancing the output of HNM by using Klatt synthesizer. Results show that the output of Klatt is almost clear as compared to the HNM output.
Keywords
harmonic analysis; medical signal processing; prosthetics; random noise; signal denoising; speech enhancement; speech intelligibility; speech synthesis; Klatt synthesizer; artificial larynx; esophagus speech enhancement; esophagus speech production; harmonic plus noise model; informal listening tests; noise suppression; random noise; spectral subtraction method; speech intelligibility; speech naturalness; speech qualify; speech synthesis; vocal tract; Background noise; Batteries; Degradation; Esophagus; Larynx; Noise shaping; Signal synthesis; Speech analysis; Speech enhancement; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2004. 2004 IEEE Region 10 Conference
Print_ISBN
0-7803-8560-8
Type
conf
DOI
10.1109/TENCON.2004.1414509
Filename
1414509
Link To Document