• DocumentCode
    3760660
  • Title

    On noise robust feature for speech recognition based on power function family

  • Author

    Hilman F. Pardede

  • Author_Institution
    Research Center for Informatics, Indonesian Institute of Sciences, Jl. Cisitu No 21/154D Bandung, Indonesia 40135
  • fYear
    2015
  • Firstpage
    386
  • Lastpage
    390
  • Abstract
    In this paper, a new feature robust against environmental noise is proposed for automatic speech recognition (ASR). This feature has similar extraction process with Power-Normalized Cepstral Coeffients (PNCC) except on two aspects. First, a generalization of the log function called the q-logarithmic function is applied to replace the power function and secondly, the mean normalization process is implemented before discrete cosine transform (DCT) instead of after it as in many traditional feature extraction algorithms. The proposed feature, called Q-Log Normalized Cepstral Coeffients (QLNCC), is shown more robust compared to two traditional features: MFCC and PLP. It is also better than PNCC without adding much complexity.
  • Keywords
    "Speech","Feature extraction","Mel frequency cepstral coefficient","Speech recognition","Discrete cosine transforms","Robustness","Noise measurement"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communication Systems (ISPACS), 2015 International Symposium on
  • Type

    conf

  • DOI
    10.1109/ISPACS.2015.7432801
  • Filename
    7432801