DocumentCode
3034229
Title
Diphone synthesis for phonetic vocoding
Author
Schwartz, B. ; Klovstad, J. ; Makhoul, J. ; Klatt, D. ; Zue, V.
Author_Institution
Bolt Beranek and Newman, Inc., Cambridge, Mass.
Volume
4
fYear
1979
fDate
28946
Firstpage
891
Lastpage
894
Abstract
We report on the synthesis of speech in the context of a phonetic vocoder operating at 100 b/s. With each phoneme, the vocoder transmits the duration and a single pitch value. The synthesizer uses a large inventory of diphone "models" to synthesize a desired phoneme string. The diphone inventory has been selected to differentiate between prevocalic and postvocalic allophones of sonorants, to account for changes in vowel color conditioned by postvocalic liquids, to allow exact specification of voice onset time, and to permit synthesis of glottal stops alveolar flaps and syllabic consonants. The diphones are extracted from carefully constructed short utterances and are stored as a sequence of LPC parameters. During synthesis, the requisite diphone models are time-warped, abutted and smoothed to produce a complete sequence of LPC parameters that are used in the synthesis. The algorithms used are described and compared with more conventional methods. Examples of the synthesized speech will be played.
Keywords
Assembly systems; Fasteners; Interpolation; Joining processes; Linear predictive coding; Liquids; Speech synthesis; Steady-state; Synthesizers; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79.
Type
conf
DOI
10.1109/ICASSP.1979.1170600
Filename
1170600
Link To Document