DocumentCode :
3702087
Title :
Deep learning in acoustic modeling for Automatic Speech Recognition and Understanding - an overview -
Author :
Inge Gavat;Diana Militaru
Author_Institution :
Department of Electronics, Telecommunications and Information Technology, University POLITEHNICA, Bucharest, Romania
fYear :
2015
Firstpage :
1
Lastpage :
8
Abstract :
This paper will discuss the progress made in Automatic Speech Recognition and Understanding (ASRU) by applying Deep Learning (DL) in the frame of acoustic modeling. After explaining the concept of DL, specific algorithms like Restricted Bolzmann Machine (RBM), Convolutional Neural Network (CNN), Autoencoder (AE), Deep Belief Network (DBN), will be presented and evaluated. Experiments in the academic research but also in the industry with DL structures concerning Phone Recognition and Large Vocabulary Continuous Speech Recognition (LVCSR) will be highlighted, confirming the usefulness of the DL framework in ASRU. Some considerations about the future of this new and effective machine learning paradigm will conclude the paper.
Keywords :
"Speech recognition","Speech","Hidden Markov models","Acoustics","Neural networks","Feature extraction","Machine learning"
Publisher :
ieee
Conference_Titel :
Speech Technology and Human-Computer Dialogue (SpeD), 2015 International Conference on
Type :
conf
DOI :
10.1109/SPED.2015.7343074
Filename :
7343074
Link To Document :
بازگشت