DocumentCode :
542187
Title :
Automatic estimation of one´s age with his/her speech based upon acoustic modeling techniques of speakers
Author :
Minematsu, Nobuaki ; Sekiguchi, Mariko ; Hirose, Keikichi
Author_Institution :
Graduate School of Information Science and Technology, University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, 113-0033, JAPAN
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper proposes a technique which automatically estimates speakers´ age only with acoustic, not linguistic, information of their utterances. This method is based upon speaker recognition techniques. In the current work, we firstly divided speakers of two databases, JNAS and S(senior)-JNAS, into two groups by listening tests. One group has only the speakers whose speech sounds so aged that one should take special care when he/she talks to them. The other group has the remaining speakers of the two databases. After that, each speaker group was modeled with GMM. Experiments of automatic identification of elderly speakers showed the correct identification rate of 91 %. To improve the performance, two prosodic features were considered, i.e, speech rate and local perturbation of power. Using these features, the identification rate was improved to 95%. Finally, using scores calculated by integrating GMMs with prosodic features, experiments were carried out to automatically estimate speakers´ age. The results showed high correlation between speakers´ age estimated subjectively by humans and automatically calculated score of ‘agedness’.
Keywords :
Artificial neural networks; Databases; Estimation; Neodymium; Presses; Speaker recognition; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743673
Filename :
5743673
Link To Document :
بازگشت