Title :
Pitch determination considering laryngealization effects in spoken dialogs
Author :
Niemann, H. ; Denzler, J. ; Kahles, B. ; Kompe, R. ; Kiessling, A. ; Noth, E. ; Strom, V.
Author_Institution :
Lehrstuhl fur Mustererkennung, Erlangen-Nurnberg Univ., Germany
fDate :
27 Jun-2 Jul 1994
Abstract :
A frequent phenomenon in spoken dialogs of the information seeking type are short elliptic utterances whose mood (declarative or interrogative) can only be distinguished by intonation. The main acoustic evidence is conveyed by the fundamental frequency or F0 -contour. Many algorithms for F0 determination have been reported in the literature. A common problem are irregularities of speech known as `laryngealizations´. This article describes an approach based on neural network techniques for the improved determination of fundamental frequency. First, an improved version of the authors´ neural network algorithm for reconstruction of the voice source signal (glottis signal) is presented. Second, the reconstructed voice source signal is used as input to another neural network distinguishing the three classes `voiceless´, `voiced non-laryngealized´, and `voiced laryngealized´. Third, the results are used to improve an existing F0 algorithm. Results of this approach are presented and discussed in the context of the application in a spoken dialog system
Keywords :
neural nets; speech recognition; F0-contour; declarative; fundamental frequency; glottis signal; information seeking dialog; interrogative; intonation; laryngealization effects; neural network techniques; pitch determination; short elliptic utterances; spoken dialogs; Control systems; Dynamic programming; Filtering; Frequency; Mood; Neural networks; Speech recognition; Variable structure systems;
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
DOI :
10.1109/ICNN.1994.374988