Contribution of prosodic and cepstral features in improvment of a synthesized arabic speaker recognition task performance

Author

Zergat, Kawthar Yasmine ; Amrouche, Abderrahmane ; Taher, Montadar Abas ; Zainal, Nasharuddin

Author_Institution

Speech Com. & Signal Proc. Lab.-LCPTS, USTHB, Bab Ezzouar, Algeria

fYear

2013

fDate

16-17 Dec. 2013

Firstpage

70

Lastpage

73

Abstract

An emerging need for biometric Speaker Verification (SV) and Identification (SI) systems is necessary for wireless remote access security in goal to be less vulnerable against distortion due to speech coding. This paper presents results on recognition system performed on the decoded speech of the G.729 codec. To show the performance loss due to distortion in the decoding process step, we are oriented to exploit the information contained within the source and the vocal tract resources. For this, SVM-based text-independent speaker classification was designed to use the information that combines the Mel Frequency Cepstral Coefficients (MFCC) features, the Energy, and the Pitch frequency. Experiments were performed over the Arabic spoken digits, the ARADIGIT database. The obtained results show that the best performance of Speaker recognition using G.729 decoded database is obtained by the combination of the prosodic features with an EER equal to 4,22%.

Keywords

cepstral analysis; decoding; distortion; speaker recognition; speech coding; support vector machines; ARADIGIT database; Arabic spoken digits; EER criterion; G.729 decoded database; Identification systems; MFCC features; SVM-based text-independent speaker classification; biometric speaker verification; decoded speech; mel frequency cepstral coefficients features; pitch frequency; prosodic features; speech coding; synthesized Arabic speaker recognition task performance; vocal tract resources; wireless remote access security; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech coding; Speech recognition; Support vector machines; Energy; G.729; MFCC; Pitch; Speaker Recognition; Speech coding; Support Vector Machines; VoIP;

fLanguage

English

Publisher

ieee

Conference_Titel

Research and Development (SCOReD), 2013 IEEE Student Conference on

Conference_Location

Putrajaya

Type

conf

DOI

10.1109/SCOReD.2013.7002544

Filename

7002544