• DocumentCode
    2761062
  • Title

    Using Phase Space based processing to extract proper features for ASR systems

  • Author

    Shekofteh, Yasser ; Almasganj, Farshad

  • Author_Institution
    Biomed. Eng. Fac., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2010
  • fDate
    4-6 Dec. 2010
  • Firstpage
    596
  • Lastpage
    599
  • Abstract
    In this paper a feature extraction technique using Reconstructed Phase Spaces (RPS) is presented, which improves the overall performances of typical speech recognition systems. Unlike conventional feature extraction methods that use FFT based algorithm as power spectrum estimation (PSE) of speech signal, the proposed method is based on the trajectory and flow matrix of signal´s RPS. In this manner, a new representation of power spectrum is obtained using two dimensional DFT algorithm by which, we can gain modify versions of common feature extraction methods such as MFCC. We conducted some speech recognition experiments using HTK, the known HMM-based toolkit, over FARSDAT, a known Persian speech corpus. Through this modified version of feature extraction method, we gained 1.35% word error rate improvement in comparison to the baseline system which exploits the typical MFCC feature extraction method.
  • Keywords
    discrete Fourier transforms; feature extraction; speech recognition; FARSDAT Persian speech corpus; HTK toolkit; MFCC feature extraction; automatic speech recognition systems; discrete Fourier transforms; phase space based processing; power spectrum representation; reconstructed phase space technique; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech processing; Speech recognition; Trajectory; feature extraction; nonlinear dynamics; reconstructed phase space; spectral analysis; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2010 5th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4244-8183-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2010.5734094
  • Filename
    5734094