DocumentCode :
699729
Title :
A phonetic vocoder with adaptation to selectable speaker codebooks
Author :
Halaly, Israel ; Bistritz, Yuval
Author_Institution :
Dept. of Electr. Eng., Tel Aviv Univ., Tel Aviv, Israel
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
The paper presents a very low bit rate phonetic vocoder based on speech recognition and synthesized speech with speaker adaptation using a set of speaker phoneme codebooks (SPCBs). The vocoder incorporates a well designed set of speaker phonemes codebooks that are available to both the encoder and decoder. The encoder performs periodically `analysis by synthesis´ that compares the incoming speech to speech that the decoder could synthesize from the output stream of the phoneme recognizer and the quantized pitch data per each SPCB and adapts it to the incoming speech by spectral warping. The index of the best performing SPCB and its adaptation parameter are transmitted to the decoder, together with the pitch and recognizer output bit streams, to synthesize speech that resembles better the speaker. In experiments held at a typical low bit rate of phonetic vocoders (below 300 bps), the incorporated adaptation reduced the average spectral distortion and increased speaker recognizability as judged by listeners.
Keywords :
codecs; speech processing; speech recognition; speech synthesis; vocoders; decoder; encoder; phoneme recognizer; phonetic vocoders; speaker codebooks; speaker phoneme codebooks; speech recognition; Bit rate; Hidden Markov models; Silicon; Speech; Speech coding; Speech recognition; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080261
Link To Document :
بازگشت