DocumentCode
63683
Title
GFM-Based Methods for Speaker Identification
Author
Bhardwaj, Shashank ; Srivastava, Sanjeev ; Hanmandlu, M. ; Gupta, J.R.P.
Author_Institution
Netaji Subhas Inst. of Technol., Univ. of Delhi, New Delhi, India
Volume
43
Issue
3
fYear
2013
fDate
Jun-13
Firstpage
1047
Lastpage
1058
Abstract
This paper presents three novel methods for speaker identification of which two methods utilize both the continuous density hidden Markov model (HMM) and the generalized fuzzy model (GFM), which has the advantages of both Mamdani and Takagi-Sugeno models. In the first method, the HMM is utilized for the extraction of shape-based batch feature vector that is fitted with the GFM to identify the speaker. On the other hand, the second method makes use of the Gaussian mixture model (GMM) and the GFM for the identification of speakers. Finally, the third method has been inspired by the way humans cash in on the mutual acquaintances while identifying a speaker. To see the validity of the proposed models [HMM-GFM, GMM-GFM, and HMM-GFM (fusion)] in a real-life scenario, they are tested on VoxForge speech corpus and on the subset of the 2003 National Institute of Standards and Technology evaluation data set. These models are also evaluated on the corrupted VoxForge speech corpus by mixing with different types of noisy signals at different values of signal-to-noise ratios, and their performance is found superior to that of the well-known models.
Keywords
Gaussian processes; feature extraction; fuzzy set theory; hidden Markov models; speaker recognition; GFM-based methods; GMM-GFM model; Gaussian mixture model; HMM-GFM fusion model; Mamdani model; National Institute of Standards and Technology evaluation data set; Takagi-Sugeno model; VoxForge speech corpus; continuous density hidden Markov model; generalized fuzzy model; mutual acquaintances; noisy signals; shape-based batch feature vector extraction; signal-to-noise ratios; speaker identification; Correlation; Feature extraction; Hidden Markov models; Shape; Speaker recognition; Speech; Vectors; Gaussian mixture model (GMM); generalized fuzzy model (GFM); hidden Markov model (HMM); shape-based batching (SBB); Algorithms; Artificial Intelligence; Biometry; Data Interpretation, Statistical; Fuzzy Logic; Humans; Information Storage and Retrieval; Markov Chains; Normal Distribution; Pattern Recognition, Automated; Speech Production Measurement;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TSMCB.2012.2223461
Filename
6341116
Link To Document