• DocumentCode
    2018886
  • Title

    Ergodic hidden control neural network for modelling of the speech process

  • Author

    Falaschi, A. ; Baldassarra, A. ; Martinelli, G. ; Ricotti, L. Prina

  • Author_Institution
    Inst. of Elettron., Perugia Univ., Italy
  • Volume
    1
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    605
  • Abstract
    The authors deal with the extension of the hidden control neural network (HCNN) architecture to the ergodic case, i.e., if all the control state sequences are allowed. This scheme gives a deeper understanding of the modeling capabilities offered by the HCNN formalism. In fact, the control input binary digits status can be considered as the presence/absence of a posteriori defined binary phonetic features, forcing the network to produce a low prediction error on pairs of speech frames. Major improvements of the technique have been found after normalization of the output vector components by the prediction error standard deviations. Other improvements arise from the extension to a second order prediction, and an appropriate pruning of the allowed control states transition matrix. Rewiring of the original architecture as a recurrent network allows for the resynthesis of smooth spectral trajectories, once the recurrent network is fed by the optimal control sequence found by dynamic programming when matching real speech against the HCNN control input.<>
  • Keywords
    dynamic programming; filtering and prediction theory; modelling; optimal control; recurrent neural nets; speech analysis and processing; binary phonetic features; control state sequences; dynamic programming; ergodic; hidden control neural network; normalization; prediction error standard deviations; recurrent network; resynthesis of smooth spectral trajectories; speech process modelling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319191
  • Filename
    319191