DocumentCode :
618475
Title :
Text independent classification of normal and pathological voices using MFCCs and GMM-UBM
Author :
Vikram, C.M. ; Umarani, K.
Author_Institution :
JT Dept., SJCE, Mysore, India
fYear :
2013
fDate :
11-12 April 2013
Firstpage :
1215
Lastpage :
1220
Abstract :
This paper proposes a text independent method for the classification of normal and pathological voices. If the classifier is text dependent i.e classifier is trained for a particular phoneme, then it may difficult for the patient to pronounce the particular phoneme. To overcome this difficulty, a text independent classification method is proposed, which uses Mel-Frequency Cepstral Coefficients (MFCCs) and Gaussian Mixture Model-Universal Background Model (GMM-UBM). The GMM-UBM model is trained with phonemes /a/, /e/ ,/u/ of normal and pathological voices. Hence the classifier is efficient to detect voices of different phonemes and classifies them into normal and pathological with a maximum accuracy of 85.63% . It has been noticed that, accuracy of classification can be improved by increasing the number of MFCCs, i.e the classification accuracy is 72.45% for 12 MFCCs , where as 85.63% for 24 MFCCs.
Keywords :
Gaussian processes; pattern classification; speech processing; text detection; GMM UBM model; Gaussian mixture model universal background model; MFCC; classifier; mel frequency cepstral coefficients; normal voice; pathological voice; phoneme; text independent classification; voice detection; Accuracy; Adaptation models; Communications technology; Conferences; Feature extraction; Pathology; Vectors; Gaussian mixture model (GMM); Mel-frequency cepstral coefficients(MFCCs); Universal background model(UBM); pathological voice detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location :
JeJu Island
Print_ISBN :
978-1-4673-5759-3
Type :
conf
DOI :
10.1109/CICT.2013.6558286
Filename :
6558286
Link To Document :
بازگشت