DocumentCode
2996946
Title
Contribution of prosodic and cepstral features in improvment of a synthesized arabic speaker recognition task performance
Author
Zergat, Kawthar Yasmine ; Amrouche, Abderrahmane ; Taher, Montadar Abas ; Zainal, Nasharuddin
Author_Institution
Speech Com. & Signal Proc. Lab.-LCPTS, USTHB, Bab Ezzouar, Algeria
fYear
2013
fDate
16-17 Dec. 2013
Firstpage
70
Lastpage
73
Abstract
An emerging need for biometric Speaker Verification (SV) and Identification (SI) systems is necessary for wireless remote access security in goal to be less vulnerable against distortion due to speech coding. This paper presents results on recognition system performed on the decoded speech of the G.729 codec. To show the performance loss due to distortion in the decoding process step, we are oriented to exploit the information contained within the source and the vocal tract resources. For this, SVM-based text-independent speaker classification was designed to use the information that combines the Mel Frequency Cepstral Coefficients (MFCC) features, the Energy, and the Pitch frequency. Experiments were performed over the Arabic spoken digits, the ARADIGIT database. The obtained results show that the best performance of Speaker recognition using G.729 decoded database is obtained by the combination of the prosodic features with an EER equal to 4,22%.
Keywords
cepstral analysis; decoding; distortion; speaker recognition; speech coding; support vector machines; ARADIGIT database; Arabic spoken digits; EER criterion; G.729 decoded database; Identification systems; MFCC features; SVM-based text-independent speaker classification; biometric speaker verification; decoded speech; mel frequency cepstral coefficients features; pitch frequency; prosodic features; speech coding; synthesized Arabic speaker recognition task performance; vocal tract resources; wireless remote access security; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech coding; Speech recognition; Support vector machines; Energy; G.729; MFCC; Pitch; Speaker Recognition; Speech coding; Support Vector Machines; VoIP;
fLanguage
English
Publisher
ieee
Conference_Titel
Research and Development (SCOReD), 2013 IEEE Student Conference on
Conference_Location
Putrajaya
Type
conf
DOI
10.1109/SCOReD.2013.7002544
Filename
7002544
Link To Document