DocumentCode
618475
Title
Text independent classification of normal and pathological voices using MFCCs and GMM-UBM
Author
Vikram, C.M. ; Umarani, K.
Author_Institution
JT Dept., SJCE, Mysore, India
fYear
2013
fDate
11-12 April 2013
Firstpage
1215
Lastpage
1220
Abstract
This paper proposes a text independent method for the classification of normal and pathological voices. If the classifier is text dependent i.e classifier is trained for a particular phoneme, then it may difficult for the patient to pronounce the particular phoneme. To overcome this difficulty, a text independent classification method is proposed, which uses Mel-Frequency Cepstral Coefficients (MFCCs) and Gaussian Mixture Model-Universal Background Model (GMM-UBM). The GMM-UBM model is trained with phonemes /a/, /e/ ,/u/ of normal and pathological voices. Hence the classifier is efficient to detect voices of different phonemes and classifies them into normal and pathological with a maximum accuracy of 85.63% . It has been noticed that, accuracy of classification can be improved by increasing the number of MFCCs, i.e the classification accuracy is 72.45% for 12 MFCCs , where as 85.63% for 24 MFCCs.
Keywords
Gaussian processes; pattern classification; speech processing; text detection; GMM UBM model; Gaussian mixture model universal background model; MFCC; classifier; mel frequency cepstral coefficients; normal voice; pathological voice; phoneme; text independent classification; voice detection; Accuracy; Adaptation models; Communications technology; Conferences; Feature extraction; Pathology; Vectors; Gaussian mixture model (GMM); Mel-frequency cepstral coefficients(MFCCs); Universal background model(UBM); pathological voice detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location
JeJu Island
Print_ISBN
978-1-4673-5759-3
Type
conf
DOI
10.1109/CICT.2013.6558286
Filename
6558286
Link To Document