DocumentCode :
388075
Title :
Modeling spectral speech transitions using temporal decomposition techniques
Author :
Ahlbom, Gunnar ; Bimbot, Frédéric ; Chollet, Gérard
Author_Institution :
ENST, Paris Cedex, France
Volume :
12
fYear :
1987
fDate :
31868
Firstpage :
13
Lastpage :
16
Abstract :
ATAL [1] introduced a technique for decomposing speech into phone-length temporal events in terms of overlapping and interacting articulatory gestures. This paper reports on simplifications of this technique with applications to acoustic-phonetic synthesis. Spectral evolution is represented by time-indexed trajectories in the p-dimensional space of Log-Area Ratios {y_{i}= Ln ((1+k_{i})/(1-k_{i}))} where kiare the reflection coefficients obtained from short-time stationary LPC analysis. The vocal tract configuration (spectral vector) associated with each interpolation function belongs to a finite set of articulatory targets (vector quantization code book). A set of speech segments ("polysons") has been encoded using this technique. It includes diphones, demi-syllables, and other units that are difficult to segment. Temporal decomposition using target spectra can break the complex encoding of these segments. In particular, coarticulation effects are analyticaiy explained and modeled. It is demonstrated that these new tools provide an adequate environment in our search for better rules in acoustic speech synthesis.
Keywords :
Eigenvalues and eigenfunctions; Encoding; Interpolation; Linear predictive coding; Matrices; Matrix decomposition; Robustness; Singular value decomposition; Space stations; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type :
conf
DOI :
10.1109/ICASSP.1987.1169742
Filename :
1169742
Link To Document :
بازگشت