• DocumentCode
    3661150
  • Title

    Improving deep neural networks using softplus units

  • Author

    Hao Zheng; Zhanlei Yang; Wenju Liu; Jizhong Liang; Yanpeng Li

  • Author_Institution
    National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Recently, DNNs have achieved great improvement for acoustic modeling in speech recognition tasks. However, it is difficult to train the models well when the depth grows. One main reason is that when training DNNs with traditional sigmoid units, the derivatives damp sharply while back-propagating between layers, which restrict the depth of model especially with insufficient training data. To deal with this problem, some unbounded activation functions have been proposed to preserve sufficient gradients, including ReLU and softplus. Compared with ReLU, the smoothing and nonzero properties of the in gradient makes softplus-based DNNs perform better in both stabilization and performance. However, softplus-based DNNs have been rarely exploited for the phoneme recognition task. In this paper, we explore the use of softplus units for DNNs in acoustic modeling for context-independent phoneme recognition tasks. The revised RBM pre-training and dropout strategy are also applied to improve the performance of softplus units. Experiments show that, the DNNs with softplus units get significantly performance improvement and uses less epochs to get convergence compared to the DNNs trained with standard sigmoid units and ReLUs.
  • Keywords
    Speech
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2015 International Joint Conference on
  • Electronic_ISBN
    2161-4407
  • Type

    conf

  • DOI
    10.1109/IJCNN.2015.7280459
  • Filename
    7280459