• DocumentCode
    1956460
  • Title

    A computer-aided speech disorders correction system for Arabic language

  • Author

    Seddik, Ahmed Farag ; El Adawy, Mohamed ; Shahin, Ahmed Ismail

  • Author_Institution
    Biomed. Eng. Dept., Helwan Univ., Helwan, Egypt
  • fYear
    2013
  • fDate
    11-13 Sept. 2013
  • Firstpage
    18
  • Lastpage
    21
  • Abstract
    In this paper, we propose a hybrid technique that integrates both the audio visual analysis techniques to automate speech disorders treatment for Arabic language that can be used in many developing countries. We proposed a technique that is based on audio visual analysis of the patient´s speech. For patient´s audio analysis, we used the mel frequency cepstrum coefficients (MFCC´s) and linear predictive cepstrum coefficients (LPCC´s) as the key features to classify the audio. In addition, we used visual features for the analysis of the patient´s video based on the patient´s appearance. Audio visual features techniques are combined for increasing the efficiency of our recognition system. We present a comparative evaluation for both audio and video features. We perform the features evaluation using Dynamic time warping (DTW) for speech features and histogram based approach analysis for visual feature. Finally, we used a neural network based classifier to differentiate between normal and abnormal speech. This research presents an expert system with an interactive computerized environment that has the ability to treat patients with speech disorders problems.
  • Keywords
    bioacoustics; biomedical optical imaging; classification; feature extraction; interactive systems; medical disorders; medical expert systems; medical image processing; neural nets; patient treatment; speech processing; Arabic language; DTW; LPCC feature; MFCC feature; abnormal speech differentiation; audio classification; audio visual feature technique; comparative audio feature evaluation; comparative video feature evaluation; computer-aided speech disorder correction system; dynamic time warping; expert system; histogram based method; hybrid audio-visual analysis; interactive computerized environment; linear predictive cepstrum coefficient; mel frequency cepstrum coefficient; neural network based classifier; patient appearance; patient audio analysis; patient speech audio-visual analysis; patient video analysis; recognition system efficiency; speech disorder patient treatment; speech disorder treatment automation; speech feature analysis; visual feature analysis; Feature extraction; Image color analysis; Lips; Mel frequency cepstral coefficient; Speech; Speech recognition; Visualization; Arabic speech recognition; Dynamic time wrapping; Lips segmentation; Neural Network; Speech Disorders Classification; Visual speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Biomedical Engineering (ICABME), 2013 2nd International Conference on
  • Conference_Location
    Tripoli
  • Print_ISBN
    978-1-4799-0249-1
  • Type

    conf

  • DOI
    10.1109/ICABME.2013.6648836
  • Filename
    6648836