DocumentCode
1856657
Title
Language independent gender identification
Author
Parris, Eluned S. ; Carey, Michael J.
Author_Institution
Ensigma Ltd., Chepstow, UK
Volume
2
fYear
1996
fDate
7-10 May 1996
Firstpage
685
Abstract
This paper describes a novel technique specifically developed for gender identification which combines acoustic analysis and pitch. Two sets of hidden Markov models, male and female, are matched to the speech using the Viterbi algorithm and the most likely sequence of models with corresponding likelihood scores are produced. Linear discriminant analysis is used to normalise the models and reduce bias towards a particular gender. An enhanced version of the pitch estimation algorithm used for IMBE speech coding is used to give an average pitch estimate for the speaker. The information provided by the acoustic analysis and pitch estimation are combined using a linear classifier to identify the gender of the speech. The system was tested on three British English databases giving less than 1% identification error rate with two seconds of speech. Further tests without optimisation on eleven languages of the OGI database gave error rates less than 5.2% and an average of 2.0%
Keywords
acoustic signal processing; hidden Markov models; maximum likelihood estimation; natural languages; speaker recognition; speech coding; speech processing; British English databases; Hidden Markov models; IMBE speech coding; OGI database; Viterbi algorithm; acoustic analysis; average pitch estimate; identification error rate; language independent gender identification; linear classifier; linear discriminant analysis; pitch estimation algorithm; Acoustic testing; Databases; Error analysis; Hidden Markov models; Information analysis; Linear discriminant analysis; Loudspeakers; Speech analysis; Speech coding; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.543213
Filename
543213
Link To Document