مرکز منطقه ای اطلاع رساني علوم و فناوري - Combining cross-stream and time dimensions in phonetic speaker recognition

DocumentCode :

3495211

Title :

Combining cross-stream and time dimensions in phonetic speaker recognition

Author :

Jin, Qin ; Navratil, Jiri ; Reynolds, Douglas A. ; Campbell, Joseph P. ; Andrews, Walter D. ; Abramson, Joy S.

Volume :

fYear :

2003

fDate :

6-10 April 2003

Abstract :

Recent studies show that phonetic sequences from multiple languages can provide effective features for speaker recognition. So far, only pronunciation dynamics in the time dimension, i.e., n-gram modeling on each of the phone sequences, have been examined. In the JHU 2002 Summer Workshop, we explored modeling the statistical pronunciation dynamics across streams in multiple languages (cross-stream dimension) as an additional component to the time dimension. We found that bigram modeling in the cross-stream dimension achieves improved performance over that in the time dimension on the NIST 2001 Speaker Recognition Evaluation Extended Data Task. Moreover, a linear combination of information from both dimensions at the score level further improves the performance, showing that the two dimensions contain complementary information.

Keywords :

feature extraction; linguistics; natural languages; speaker recognition; speech processing; statistical analysis; time-domain analysis; NIST 2001 Speaker Recognition Evaluation Extended Data Task; NIST Evaluation Extended Data; bigram modeling; cross-stream dimension; feature extraction; multiple languages; phonetic speaker recognition; statistical pronunciation dynamics; time dimension; Acoustic noise; Data mining; Humans; Loudspeakers; NIST; Natural languages; Scanning probe microscopy; Speaker recognition; Speech recognition; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-7663-3

Type :

conf

DOI :

10.1109/ICASSP.2003.1202764

Filename :

1202764

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3495211