DocumentCode :
2362781
Title :
Automatic enlargement of speech corpus for speaker recognition
Author :
Alsulaiman, Mansour M.
Author_Institution :
Coll. of Comput. & Inf. Sci., King Saud Univ., Riyadh, Saudi Arabia
fYear :
2011
fDate :
20-23 March 2011
Firstpage :
302
Lastpage :
306
Abstract :
This research deals with the problem of recognition when only a few samples are available for training of the system. To avoid the low recognition rate caused by such type of speech corpus, automatic techniques for the enlargement of speech corpus are proposed in this paper. These techniques are: lengthening of sample by automatic segmentation, automatic noise addition at different sound-to-noise ratios (SNRs), and lengthening of reversed sample. Different combinations of samples, generated by the proposed techniques, are used to obtain the high recognition rate. These techniques have shown promising result.
Keywords :
Gaussian processes; cepstral analysis; hidden Markov models; speaker recognition; Gaussian mixture model; automatic noise addition; automatic segmentation; hidden Markov model; mel-frequency cepstral coefficients; sound-to-noise ratios; speaker recognition; speech corpus automatic enlargement; Databases; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Speaker recognition; Speech; Training; Automatic segmentation; Database enlargement; HMM; MFCC; Samples generation; Speaker Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers & Informatics (ISCI), 2011 IEEE Symposium on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-61284-689-7
Type :
conf
DOI :
10.1109/ISCI.2011.5958931
Filename :
5958931
Link To Document :
بازگشت