Diphone-like units without phonemes - option for very low bit rate speech coding

Author

P. Motlicek;G. Baudoin;J. Cernocky

Author_Institution

Inst. of Radioelectron., Tech. Univ. Brno, Czech Republic

Volume

fYear

2001

fDate

6/23/1905 12:00:00 AM

Firstpage

463

Abstract

The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.

Keywords

"Bit rate","Speech coding","Databases","Speech synthesis","Decoding","Speech recognition","Automatic speech recognition","Hidden Markov models","Speech enhancement","Vocoders"

Publisher

ieee

Conference_Titel

EUROCON´2001, Trends in Communications, International Conference on.

Print_ISBN

0-7803-6490-2

Type

conf

DOI

10.1109/EURCON.2001.938162

Filename

938162

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3783704