Graphical models for the recognition of Arabic continuous speech based triphones modeling

Author

Elyes Zarrouk;Yassine Benayed;Faïez Gargouri

Author_Institution

MIRACL, Multimedia InfoRmation system and Advanced Computing Laboratory, Higher Institute of Computer Science and Multimedia, ISIMS, University of Sfax, Tunisia

fYear

2015

fDate

6/1/2015 12:00:00 AM

Firstpage

1

Lastpage

6

Abstract

Recent developments in inference and learning in Dynamic Bayesian networks (DBN) allow their use in real-world applications is the first successful application of DBNs to a large scale speech recognition problem. Even if their progress is huge, those models lack a discriminatory ability especially on speech recognition such as the Hidden Markov models (HMM). In this paper, we present the performance of the hybridization of Supports Vectors machine with Dynamic Bayesian networks for Arabic triphones-based continuous speech. In fact, SVM are based on a structural risk minimization (SRM) where the aim is to set up a classifier that minimizes a bound on the expected risk, rather than the empirical risk. The best results are obtained with the proposed system SVM/DBN when we achieve 78.87% as the best recognition rate of a tested speaker. The speech recognizer was evaluated with ARABIC_DB corpus and performs at 8.04% WER as compared to 10.08% with triphones mixture-Gaussian DBN system, 10.54% with hybrid model SVM/HMM and 12.03% with HMM standards.

Keywords

"Hidden Markov models","Support vector machines","Speech recognition","Speech","Bayes methods","Acoustics","Computational modeling"

Publisher

ieee

Conference_Titel

Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on

Type

conf

DOI

10.1109/SNPD.2015.7176269

Filename

7176269