• DocumentCode
    3661174
  • Title

    Restoring high frequency spectral envelopes using neural networks for speech bandwidth extension

  • Author

    Yu Gu;Zhen-Hua Ling

  • Author_Institution
    National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, China
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper studies the methods of speech bandwidth extension (BWE) using artificial neural networks. Several types of neural networks, including bidirectional neural networks such as restricted Boltzmann machines (RBM) and bidirectional associative memories (BAM), and feedforward deep neural networks (DNNs), are employed to restore high frequency spectral envelopes from low frequency ones. Compared with Gaussian mixture models (GMM) which are popularly adopted in the conventional statistical approaches to BWE, neural networks are better at modeling the complex and non-linear mapping relationship between high-dimensional feature vectors. Experimental results show that the neural network based BWE methods proposed in this paper can achieve better performance than the GMM-based one in both objective and subjective tests. Furthermore, the DNN-based BWE method outperforms the BAM and RBM-based ones which use shallow model structures.
  • Keywords
    "Irrigation","Yttrium","Training"
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2015 International Joint Conference on
  • Electronic_ISBN
    2161-4407
  • Type

    conf

  • DOI
    10.1109/IJCNN.2015.7280483
  • Filename
    7280483