مرکز منطقه ای اطلاع رساني علوم و فناوري - Bayesian vocal tract model estimates of nasal stops for speaker verification

DocumentCode :

178045

Title :

Bayesian vocal tract model estimates of nasal stops for speaker verification

Author :

Enzinger, Ewald ; Kasess, Christian H.

Author_Institution :

Acoust. Res. Inst., Vienna, Austria

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

1685

Lastpage :

1689

Abstract :

In this paper we report on speaker verification experiments using branched vocal tract model estimates of alveolar nasal (/n/) stops. While the discriminatory potential of nasal acoustics has long been established, their acoustic properties have so far mostly been characterized using spectral features. Here, we used a Bayesian estimation technique to obtain reflection coefficients of a branched-tube model of the combined nasal and oral tract. Parameters were then modeled using probabilistic linear discriminant analysis to calculate likelihood ratios for speaker verification trials. Performance was assessed on normal and high vocal effort speech using high-quality and mobile-telephone-transmitted recordings taken from the German-language Pool2010 corpus. Results are compared with those of systems based on mel-frequency cepstral coefficients (MFCC). Vocal tract parameter based systems outperform MFCC based systems in matched conditions, but lack robustness under mismatch, while being readily interpretable with respect to a physical speech production model.

Keywords :

speaker recognition; Bayesian estimation technique; Bayesian vocal tract model estimates; German-language Pool2010 corpus; MFCC based systems; alveolar nasal; branched vocal tract model; branched-tube model; discriminatory potential; mel-frequency cepstral coefficients; mobile-telephone-transmitted recordings; nasal acoustics; nasal stops; physical speech production model; probabilistic linear discriminant analysis; reflection coefficients; speaker verification; speaker verification trials; spectral features; vocal tract parameter based systems; Bayes methods; Electron tubes; Estimation; Mel frequency cepstral coefficient; Speaker recognition; Speech; Bayesian estimation; Nasals; likelihood ratio; speaker verification; vocal tract modeling;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853885

Filename :

6853885

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178045