DocumentCode
394356
Title
Corpus based very low bit rate speech coding
Author
Baudoin, G. ; El Chami, F.
Author_Institution
Telecommunications systems laboratory, ESIEE, France
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
This paper presents a new very low bit rate segmental speech coding approach applying speech recognition in the coder and corpus based speech synthesis in the decoder. The system uses a large corpus of speech signals that is searched to find a speech segment similar to the segment to be coded. The elementary acoustical units for recognition and synthesis are determined automatically by an unsupervised training method. This approach is an alternative to using phoneme-derived linguistic units. Very good results are obtained at an average bit rate of 400 bits/second for a corpus of about 1 hour of speech. We present an efficient method for finding the best synthesis unit taking into account the good concatenation of successive segments. The proposed organization of the speech segments in the corpus allows a very efficient search of the best unit.
Keywords
search problems; speech coding; speech recognition; speech synthesis; vocoders; 400 bit/s; corpus based speech synthesis; elementary acoustical units; searching; segmental speech coding; speech coder; speech recognition; speech segment; successive segment concatenation; synthesis unit; unsupervised training method; very low bit rate speech coding; Bit rate; Decoding; Hidden Markov models; Signal synthesis; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Synthesizers; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198900
Filename
1198900
Link To Document