Title :
Age Approximation from Speech Using Gaussian Mixture Models
Author :
Mittal, Trisha ; Barthwal, Anurag ; Koolagudi, Shashidhar G.
Author_Institution :
Graphic Era Univ., Dehradun, India
Abstract :
In this work, spectral features are extracted from speech to perform speaker classification based on their age. Mel frequency cepstral coefficients (MFCCs) are explored as features. Gaussian mixture models (GMMs) are proposed as classifiers. The age groups considered in this study are 1-10, 11-20, 21-30, 31-40 and 41-50. The age group database used in this work is recorded in Hindi from speakers of different ages and dialects containing five Hindi text prompts. The text prompts are constructed using textually neutral Hindi words recorded in neutral emotion which are used for characterizing the age group, for both male and female. Average age recognition performance, in the case of multiple speaker database is observed to be around 92.0%.
Keywords :
Gaussian processes; cepstral analysis; feature extraction; mixture models; natural language processing; signal classification; speaker recognition; speech processing; GMM; Gaussian mixture models; Hindi text prompts; MFCC; Mel frequency cepstral coefficients; age 1 yr to 10 yr; age 11 yr to 20 yr; age 21 yr to 30 yr; age 31 yr to 40 yr; age 41 yr to 50 yr; age approximation; age group; average age recognition performance; neutral emotion; speaker classification; spectral feature extraction; textually neutral Hindi words; Approximation methods; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech; Vectors; Age approximation; GMM; Mel frequency cepstral coefficients; Spectral features; Text dependent age approximation; Text independent age approximation;
Conference_Titel :
Advanced Computing, Networking and Security (ADCONS), 2013 2nd International Conference on
Conference_Location :
Mangalore
DOI :
10.1109/ADCONS.2013.43