DocumentCode
3062838
Title
On the generation and use of a segment dictionary for speech coding, synthesis and recognition
Author
Chollet, G. ; Galliano, J.F. ; Lefevre, J.P. ; Viara, E.
Author_Institution
ENST, Paris, Cedex
Volume
8
fYear
1983
fDate
30407
Firstpage
1328
Lastpage
1331
Abstract
A methodology is described to obtain a set of segments and rules that represents adequately the speech performance of a given speaker. This methodology proceeds from an initial set of diphones extracted from a neutral context and modify this set with larger and/or smaller segments depending on the match with natural utterances. Each segment is stored as a sequence of frames coded using LPC coefficients. An estimate of the likelihood of timescale distortion is associated with each frame. It represents knowledge on temporal variability that can be used by synthesis rules and/or pattern matching algorithms. It is then shown how such a segment data base can be used for 1) speech coding at very low bit rate ( ∼ 400 bit/sec), 2) synthesis from unrestricted text, 3) continuous speech recognition.
Keywords
Acoustic distortion; Bit rate; Costs; Dictionaries; Linear predictive coding; Pattern matching; Speech coding; Speech processing; Speech recognition; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
Type
conf
DOI
10.1109/ICASSP.1983.1172018
Filename
1172018
Link To Document