DocumentCode :
1092912
Title :
Speaker Verification via High-Level Feature Based Phonetic-Class Pronunciation Modeling
Author :
Zhang, Shi-Xiong ; Mak, Man-Wai ; Meng, Helen M.
Author_Institution :
Hong Kong Polytech. Univ., Kowloon
Volume :
56
Issue :
9
fYear :
2007
Firstpage :
1189
Lastpage :
1198
Abstract :
It has recently been shown that the pronunciation characteristics of speakers can be represented by articulatory feature- based conditional pronunciation models (AFCPMs). However, the pronunciation models are phoneme dependent, which may lead to speaker models with low discriminative power when the amount of enrollment data is limited. This paper proposes mitigating this problem by grouping similar phonemes into phonetic classes and representing background and speaker models as phonetic-class dependent density functions. Phonemes are grouped by 1) vector quantizing the discrete densities in the phoneme-dependent universal background models, 2) using the phone properties specified in the classical phoneme tree, or 3) combining vector quantization and phone properties. Evaluations based on the 2000 NIST SRE show that this phonetic-class approach effectively alleviates the data spareness problem encountered in conventional AFCPM, which results in better performance when fused with acoustic features.
Keywords :
speaker recognition; speech processing; vector quantisation; articulatory feature-based conditional pronunciation models; phoneme tree; phoneme-dependent universal background models; phonetic-class pronunciation modeling; speaker verification; vector quantization; Data mining; Density functional theory; Displacement measurement; Error analysis; Loudspeakers; NIST; Speaker recognition; Speech; Vector quantization; Velocity measurement; NIST speaker recognition evaluation; Speaker verification; articulatory features; phonetic classes; pronunciation modeling;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/TC.2007.1081
Filename :
4288086
Link To Document :
بازگشت