• DocumentCode
    618475
  • Title

    Text independent classification of normal and pathological voices using MFCCs and GMM-UBM

  • Author

    Vikram, C.M. ; Umarani, K.

  • Author_Institution
    JT Dept., SJCE, Mysore, India
  • fYear
    2013
  • fDate
    11-12 April 2013
  • Firstpage
    1215
  • Lastpage
    1220
  • Abstract
    This paper proposes a text independent method for the classification of normal and pathological voices. If the classifier is text dependent i.e classifier is trained for a particular phoneme, then it may difficult for the patient to pronounce the particular phoneme. To overcome this difficulty, a text independent classification method is proposed, which uses Mel-Frequency Cepstral Coefficients (MFCCs) and Gaussian Mixture Model-Universal Background Model (GMM-UBM). The GMM-UBM model is trained with phonemes /a/, /e/ ,/u/ of normal and pathological voices. Hence the classifier is efficient to detect voices of different phonemes and classifies them into normal and pathological with a maximum accuracy of 85.63% . It has been noticed that, accuracy of classification can be improved by increasing the number of MFCCs, i.e the classification accuracy is 72.45% for 12 MFCCs , where as 85.63% for 24 MFCCs.
  • Keywords
    Gaussian processes; pattern classification; speech processing; text detection; GMM UBM model; Gaussian mixture model universal background model; MFCC; classifier; mel frequency cepstral coefficients; normal voice; pathological voice; phoneme; text independent classification; voice detection; Accuracy; Adaptation models; Communications technology; Conferences; Feature extraction; Pathology; Vectors; Gaussian mixture model (GMM); Mel-frequency cepstral coefficients(MFCCs); Universal background model(UBM); pathological voice detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information & Communication Technologies (ICT), 2013 IEEE Conference on
  • Conference_Location
    JeJu Island
  • Print_ISBN
    978-1-4673-5759-3
  • Type

    conf

  • DOI
    10.1109/CICT.2013.6558286
  • Filename
    6558286