• DocumentCode
    2338185
  • Title

    Digits speech recognition based on ICA and geometrical learning

  • Author

    Feng, Hao ; Cao, Wen-Ming ; Wang, Shou-Jue

  • Author_Institution
    Jiaxing Univ., China
  • Volume
    8
  • fYear
    2005
  • fDate
    18-21 Aug. 2005
  • Firstpage
    4829
  • Abstract
    We investigate the use of independent component analysis (ICA) for speech feature extraction in digits speech recognition systems. We observe that this may be true for a recognition tasks based on geometrical learning with little training data. In contrast to image processing, phase information is not essential for digits speech recognition. We therefore propose a new scheme that shows how the phase sensitivity can be removed by using an analytical description of the ICA-adapted basis functions via the Hilbert transform. Furthermore, since the basis functions are not shift invariant, we extend the method to include a frequency-based ICA stage that removes redundant time shift information. The digits speech recognition results show promising accuracy, Experiments show method based on ICA and geometrical learning outperforms HMM in different number of train samples.
  • Keywords
    Hilbert transforms; feature extraction; geometry; hidden Markov models; independent component analysis; learning (artificial intelligence); speech recognition; Hilbert transform; digits speech recognition; geometrical learning; independent component analysis; speech feature extraction; Discrete cosine transforms; Feature extraction; Hidden Markov models; Independent component analysis; Intelligent systems; Mel frequency cepstral coefficient; Principal component analysis; Speech analysis; Speech recognition; Training data; Digits Speech Recognition; HMM; ICA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
  • Conference_Location
    Guangzhou, China
  • Print_ISBN
    0-7803-9091-1
  • Type

    conf

  • DOI
    10.1109/ICMLC.2005.1527793
  • Filename
    1527793