• DocumentCode
    460432
  • Title

    A Robust Digit Recognition Model Research with Low Signal Noise Ratio

  • Author

    He, SuNing ; Yu, Juebang

  • Author_Institution
    Southwest Electron. & Telecommun. Tech. Inst., Chengdu
  • Volume
    1
  • fYear
    2006
  • fDate
    38869
  • Firstpage
    553
  • Lastpage
    558
  • Abstract
    In this article, we absorbed the basic idea of DP in normalizing the temporal evolving characteristics of speech observation vector sequences, and designed a low SNR English digit recognition model based on whole MFCC sequences. Such model not only carries effectively the whole information but also normalizes the digit feature sequences by adjusting dynamically the frame length of each spoken digit, which can adapt the different speaking rates. By using frame time filter and multi sub-spectrum filter, noise interference to the digit can be partly reduced and the model´s adaptability to the backgrounds be improved. The experiment shows that the English digit error rate has reduced 30% at least, and 68.71% for best result after adding the new processing modules. In addition, the model is simple in structure and low in computation, and also easy to realize real time processing
  • Keywords
    filtering theory; natural languages; sequences; speaker recognition; DP; English digit recognition model; MFCC sequences; SNR; digit error rate; digit feature sequences; frame time filter; low signal noise ratio; mel-frequency cepstral coefficient; multisubspectrum filter; noise interference; speech observation vector sequences; spoken digit; Character recognition; Ear; Filters; Hidden Markov models; Mel frequency cepstral coefficient; Noise reduction; Noise robustness; Signal to noise ratio; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Circuits and Systems Proceedings, 2006 International Conference on
  • Conference_Location
    Guilin
  • Print_ISBN
    0-7803-9584-0
  • Electronic_ISBN
    0-7803-9585-9
  • Type

    conf

  • DOI
    10.1109/ICCCAS.2006.284697
  • Filename
    4063941