• DocumentCode
    1689387
  • Title

    Investigation into the use of deep neural networks for LVCSR of Czech

  • Author

    Mateju, Lukas ; Cerva, Petr ; Zdansky, Jindrich

  • Author_Institution
    Fac. of Mechatron., Inf. & Interdiscipl. Studies, Tech. Univ. of Liberec, Liberec, Czech Republic
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper deals with utilization of deep neural networks (DNNs) for speech recognition. The main goal is to find out the best strategy for training and utilization of these models within an acoustic modeling module of a large vocabulary continuous speech recognition (LVCSR) system of Czech language. For this purpose, various DNNs are trained a) using several training strategies, b) with different inner structure and c) using various kinds of features. Experimental evaluation is then performed on a large dataset including broadcast recordings, recordings of lectures, dictates of judgments and set of nonlinearly distorted utterances. The resulting recipe for training of DNNs for our LVCSR system employs a) ReLU activation function with hidden layer width of 1024 neurons and b) filter-bank based features.
  • Keywords
    channel bank filters; feature extraction; neural nets; speech recognition; transfer functions; vocabulary; Czech language; DNN; LVCSR system; ReLU activation function; acoustic modeling module; broadcast recordings; deep neural networks; filter-bank based features; large vocabulary continuous speech recognition system; Biological neural networks; Feature extraction; Hidden Markov models; Neurons; Speech; Speech recognition; Training; acoustic modeling; deep neural networks; feature extraction; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Control, Measurement, Signals and their Application to Mechatronics (ECMSM), 2015 IEEE International Workshop of
  • Conference_Location
    Liberec
  • Print_ISBN
    978-1-4799-6970-8
  • Type

    conf

  • DOI
    10.1109/ECMSM.2015.7208708
  • Filename
    7208708