Hidden Markov models applied to very low bit rate speech coding

Author

Farges, Eric P. ; Clements, Mark A.

Author_Institution

Georgia Institute of Technology, Atlanta, Georgia, U.S.A.

Volume

fYear

1986

fDate

31503

Firstpage

433

Lastpage

436

Abstract

A new type of very low bit rate speech coder based on a global Discrete Hidden Markov Model (DHMM) of continuous speech for a single speaker is presented here. Several important issues of the training, coding, and decoding procedures are discussed for a 64-state, 1024-observation model. Such a framework is useful in reducing the redundancy in a 10-bit classical Vector Quantizer (VQ), and could lead to a DHMM coder with a bit rate comparable to that of a Segment Vocoder (SV) or a Matrix Quantizer (MQ). This is achieved not only by modelling the long term non-stationarity and the inter-frame time dependencies of the speech, but also by efficiently representing a different kind of information such as vocal tract structure and linguistic patterns.

Keywords

Automatic speech recognition; Bit rate; Decoding; Hidden Markov models; Human voice; Linear predictive coding; Random variables; Speech coding; Stochastic processes; Vocoders;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.

Type

conf

DOI

10.1109/ICASSP.1986.1169052

Filename

1169052

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3006208