Title :
Phone recognition with deep sparse rectifier neural networks
Author_Institution :
MTA-SZTE Res. Group on Artificial Intell., Univ. of Szeged, Szeged, Hungary
Abstract :
Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). This modification requires only minimal changes to any existing neural net implementation, but makes it more effective. In particular, we show that a deep architecture of rectifier neurons can attain the same recognition accuracy as deep neural networks, but without the need for pre-training. With 4-5 hidden layers of rectifier neurons we report 20.8% and 19.8% phone error rates on TIMIT (with CI and CD units, respectively), which are competitive with the best results on this database.
Keywords :
cepstral analysis; neural nets; speech recognition; TIMIT; deep sparse rectifier neural networks; phone error rates; phone recognition; recognition accuracy; rectifier neurons; sigmoid activation function; Biological neural networks; Error analysis; Neurons; Speech recognition; Standards; Training; Deep neural networks; phone recognition; sparse rectifier neural networks;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639016