DocumentCode :
3129550
Title :
Comparison of linear prediction cepstrum coefficients and mel-frequency cepstrum coefficients for language identification
Author :
Wong, Eddie ; Sridharan, Sridha
Author_Institution :
Speech Res. Lab., Queensland Univ. of Technol., Brisbane, Qld., Australia
fYear :
2001
fDate :
2001
Firstpage :
95
Lastpage :
98
Abstract :
The speech parametrization methods: linear prediction cepstrum coefficients and mel-frequency cepstrum coefficients were compared with regard to language identification accuracy in a Gaussian mixture model based language identification system. Ten different languages were used to test against a set of ten second test files. The 12th order linear prediction cepstrum coefficients with delta and accelerate coefficients resulted in the best accuracy of 60.0 percent. This has shown that information obtained from linear prediction analysis has increased the ability of discriminating different languages. It also shows that language identification performance may be increased by encompassing temporal information by including delta and acceleration features. Besides, the performance of our test system has proved the feasibility of the modeling language by a single Gaussian Mixture Model instead of using complex system such as phonetic recogniser followed by language modelling or large vocabulary continuous speech recognition system
Keywords :
Gaussian processes; cepstral analysis; natural languages; speech processing; Gaussian mixture model; accelerate coefficients; language identification; language identification performance; language modelling; large vocabulary continuous speech recognition system; linear prediction analysis; linear prediction cepstrum coefficients; mel-frequency cepstrum coefficients; modeling language; phonetic recogniser; single Gaussian Mixture Model; speech parametrization methods; temporal information; test files; test system; Acceleration; Band pass filters; Cepstrum; Linear predictive coding; Mel frequency cepstral coefficient; Natural languages; Robustness; Speech recognition; System testing; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Multimedia, Video and Speech Processing, 2001. Proceedings of 2001 International Symposium on
Conference_Location :
Hong Kong
Print_ISBN :
962-85766-2-3
Type :
conf
DOI :
10.1109/ISIMP.2001.925340
Filename :
925340
Link To Document :
بازگشت