DocumentCode
3783704
Title
Diphone-like units without phonemes - option for very low bit rate speech coding
Author
P. Motlicek;G. Baudoin;J. Cernocky
Author_Institution
Inst. of Radioelectron., Tech. Univ. Brno, Czech Republic
Volume
2
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
463
Abstract
The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.
Keywords
"Bit rate","Speech coding","Databases","Speech synthesis","Decoding","Speech recognition","Automatic speech recognition","Hidden Markov models","Speech enhancement","Vocoders"
Publisher
ieee
Conference_Titel
EUROCON´2001, Trends in Communications, International Conference on.
Print_ISBN
0-7803-6490-2
Type
conf
DOI
10.1109/EURCON.2001.938162
Filename
938162
Link To Document