Enhancement of esophagus speech using harmonic plus noise model

Author

Lehana, Parveen K. ; Gupta, Rakesh K. ; Kumari, Santoresh

Author_Institution

Dept. of Phys. & Electron., Jammu Univ., India

Volume

A

fYear

2004

fDate

21-24 Nov. 2004

Firstpage

669

Abstract

Patients whose voice boxes have been removed use either artificial larynx or esophagus as a source of excitation to the vocal tract. This excitation is shaped by the vocal tract to produce the speech output. Artificial larynx is a costly device and continuous use of batteries increases the recurring expenses. Further, the presence of background noise degrades the speech qualify. Production of esophagus speech is more convenient and advantageous. The production of speech using esophagus is cheap, as no extra device is needed. Also the hands are free while speaking for doing some other work. But main problem of the esophagus speech is the very low amplitude of the output speech, intelligibility, and naturalness. The objective of this paper is to enhance the intelligibility of esophagus speech by using harmonic plus noise model (HNM). Esophagus speech is analyzed and synthesized using HNM and informal listening tests are conducted for accessing the improvement in the synthesized speech. Investigations show that the output is more natural and intelligible as compared to input speech signal but there is an increase in the random noise. This noise is uncorrelated to the input and can be suppressed by conventional techniques such as spectral subtraction method. Experiments are also carried out for enhancing the output of HNM by using Klatt synthesizer. Results show that the output of Klatt is almost clear as compared to the HNM output.

Keywords

harmonic analysis; medical signal processing; prosthetics; random noise; signal denoising; speech enhancement; speech intelligibility; speech synthesis; Klatt synthesizer; artificial larynx; esophagus speech enhancement; esophagus speech production; harmonic plus noise model; informal listening tests; noise suppression; random noise; spectral subtraction method; speech intelligibility; speech naturalness; speech qualify; speech synthesis; vocal tract; Background noise; Batteries; Degradation; Esophagus; Larynx; Noise shaping; Signal synthesis; Speech analysis; Speech enhancement; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

TENCON 2004. 2004 IEEE Region 10 Conference

Print_ISBN

0-7803-8560-8

Type

conf

DOI

10.1109/TENCON.2004.1414509

Filename

1414509