Title :
Speech synthesis in the time domain from text
Author_Institution :
Heinrich-Hertz-Institut Für Nachrichtentechnik, Berlin, W.Germany
Abstract :
A very efficient method to synthesize speech is the combination of digitally stored monophones and transients in the time domain, because of its good intellegibility and the easy implementation which allows a very inexpensive realisation. A disadvantage inherent in procedures in the time domain has so far been the fact, that they required a very large sized memory, the size of which increased in multiples if prosodic parameters were also taken into account. We developed a system which synthesizes an unlimited vocabulary with a memory of only 22 kBytes (8-Bit Bytes). Further investigations showed, that by means of the linear prediction method it is possible to control the fundamental frequency of the speech signal in a wide range without storing additional speech segments. In addition to this work, we developed a system to transform orthografic text into a phoneme string automatically. We optimized this algorithm for the 8000 most frequent words of the German language. The whole system which is implemented on a microprozessor, is placed on a single board, with a storage of total 32 kBytes (8-Bit Bytes).
Keywords :
Control system synthesis; Explosives; Frequency; Natural languages; Prediction methods; Signal analysis; Signal synthesis; Speech synthesis; Vocabulary; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.
DOI :
10.1109/ICASSP.1982.1171863