DocumentCode :
560634
Title :
An Improved Oscillator Method for Modeling Structured Speech
Author :
Yen, Anton Y. ; Gorodnitsky, Irina
fYear :
2011
fDate :
5-9 Dec. 2011
Firstpage :
1
Lastpage :
5
Abstract :
Modern speech coders utilize several models in sequence to encode a single frame. The typical sequence consists of a linear predictor for modeling short scale structure, adaptive codebook for mid to long scale structure, and algebraic codebook for the remainder. We develop an alternative model, termed the Complete Oscillator Model (COM), which encodes structures on multiple time scales at once. When compared to the linear predictor and adaptive codebook of the Adaptive Multi-Rate standard, we have found the COM to yield better quality models on average while using the same number of parameters. However, its performance is uneven across different types of phonemes, notably in the transitions from unvoiced to voiced speech. We discuss how the derived performance relates to the fundamental oscillator properties and provide initial schemes for how the proposed method may be used in speech coders. All experiments are performed using sentences from the TIMIT database.
Keywords :
sequences; speech codecs; speech coding; TIMIT database; adaptive codebook; adaptive multirate standard; algebraic codebook; complete oscillator model; fundamental oscillator property; improved oscillator method; linear predictor; short scale structure modelling; single frame encoding; speech coders; structured speech modelling; Adaptation models; Bit rate; Delay; Oscillators; Signal to noise ratio; Speech; Speech coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Global Telecommunications Conference (GLOBECOM 2011), 2011 IEEE
Conference_Location :
Houston, TX, USA
ISSN :
1930-529X
Print_ISBN :
978-1-4244-9266-4
Electronic_ISBN :
1930-529X
Type :
conf
DOI :
10.1109/GLOCOM.2011.6134458
Filename :
6134458
Link To Document :
بازگشت