DocumentCode
1689387
Title
Investigation into the use of deep neural networks for LVCSR of Czech
Author
Mateju, Lukas ; Cerva, Petr ; Zdansky, Jindrich
Author_Institution
Fac. of Mechatron., Inf. & Interdiscipl. Studies, Tech. Univ. of Liberec, Liberec, Czech Republic
fYear
2015
Firstpage
1
Lastpage
4
Abstract
This paper deals with utilization of deep neural networks (DNNs) for speech recognition. The main goal is to find out the best strategy for training and utilization of these models within an acoustic modeling module of a large vocabulary continuous speech recognition (LVCSR) system of Czech language. For this purpose, various DNNs are trained a) using several training strategies, b) with different inner structure and c) using various kinds of features. Experimental evaluation is then performed on a large dataset including broadcast recordings, recordings of lectures, dictates of judgments and set of nonlinearly distorted utterances. The resulting recipe for training of DNNs for our LVCSR system employs a) ReLU activation function with hidden layer width of 1024 neurons and b) filter-bank based features.
Keywords
channel bank filters; feature extraction; neural nets; speech recognition; transfer functions; vocabulary; Czech language; DNN; LVCSR system; ReLU activation function; acoustic modeling module; broadcast recordings; deep neural networks; filter-bank based features; large vocabulary continuous speech recognition system; Biological neural networks; Feature extraction; Hidden Markov models; Neurons; Speech; Speech recognition; Training; acoustic modeling; deep neural networks; feature extraction; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM), 2015 IEEE International Workshop of
Conference_Location
Liberec
Print_ISBN
978-1-4799-6970-8
Type
conf
DOI
10.1109/ECMSM.2015.7208708
Filename
7208708
Link To Document