مرکز منطقه ای اطلاع رساني علوم و فناوري - Speech synthesis in the time domain from text

DocumentCode :

3059438

Title :

Speech synthesis in the time domain from text

Author :

Grossmann, E.

Author_Institution :

Heinrich-Hertz-Institut Für Nachrichtentechnik, Berlin, W.Germany

Volume :

fYear :

1982

fDate :

30072

Firstpage :

936

Lastpage :

939

Abstract :

A very efficient method to synthesize speech is the combination of digitally stored monophones and transients in the time domain, because of its good intellegibility and the easy implementation which allows a very inexpensive realisation. A disadvantage inherent in procedures in the time domain has so far been the fact, that they required a very large sized memory, the size of which increased in multiples if prosodic parameters were also taken into account. We developed a system which synthesizes an unlimited vocabulary with a memory of only 22 kBytes (8-Bit Bytes). Further investigations showed, that by means of the linear prediction method it is possible to control the fundamental frequency of the speech signal in a wide range without storing additional speech segments. In addition to this work, we developed a system to transform orthografic text into a phoneme string automatically. We optimized this algorithm for the 8000 most frequent words of the German language. The whole system which is implemented on a microprozessor, is placed on a single board, with a storage of total 32 kBytes (8-Bit Bytes).

Keywords :

Control system synthesis; Explosives; Frequency; Natural languages; Prediction methods; Signal analysis; Signal synthesis; Speech synthesis; Vocabulary; Vocoders;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '82.

Type :

conf

DOI :

10.1109/ICASSP.1982.1171863

Filename :

1171863

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3059438