• DocumentCode
    3654813
  • Title

    Graphical models for the recognition of Arabic continuous speech based triphones modeling

  • Author

    Elyes Zarrouk;Yassine Benayed;Faïez Gargouri

  • Author_Institution
    MIRACL, Multimedia InfoRmation system and Advanced Computing Laboratory, Higher Institute of Computer Science and Multimedia, ISIMS, University of Sfax, Tunisia
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Recent developments in inference and learning in Dynamic Bayesian networks (DBN) allow their use in real-world applications is the first successful application of DBNs to a large scale speech recognition problem. Even if their progress is huge, those models lack a discriminatory ability especially on speech recognition such as the Hidden Markov models (HMM). In this paper, we present the performance of the hybridization of Supports Vectors machine with Dynamic Bayesian networks for Arabic triphones-based continuous speech. In fact, SVM are based on a structural risk minimization (SRM) where the aim is to set up a classifier that minimizes a bound on the expected risk, rather than the empirical risk. The best results are obtained with the proposed system SVM/DBN when we achieve 78.87% as the best recognition rate of a tested speaker. The speech recognizer was evaluated with ARABIC_DB corpus and performs at 8.04% WER as compared to 10.08% with triphones mixture-Gaussian DBN system, 10.54% with hybrid model SVM/HMM and 12.03% with HMM standards.
  • Keywords
    "Hidden Markov models","Support vector machines","Speech recognition","Speech","Bayes methods","Acoustics","Computational modeling"
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
  • Type

    conf

  • DOI
    10.1109/SNPD.2015.7176269
  • Filename
    7176269