مرکز منطقه ای اطلاع رساني علوم و فناوري - Comparing Data-driven and Phonetic N-gram Systems for Text-Independent Speaker Verification

DocumentCode :

2371492

Title :

Comparing Data-driven and Phonetic N-gram Systems for Text-Independent Speaker Verification

Author :

El Hannani, Asmaa ; Petrovska-Delacrétaz, Dijana

Author_Institution :

Fribourg Univ., Fribourg

fYear :

2007

fDate :

27-29 Sept. 2007

Firstpage :

Lastpage :

Abstract :

Recognition of speaker identity based on modeling the streams produced by phonetic decoders(phonetic speaker recognition) has gained popularity during the past few years. Two of the major problems that arise when phone based systems are being developed are the possible mismatches between the development and evaluation data and the lack of transcribed databases. Data-driven segmentation techniques provide a potential solution to these problems because they do not use transcribed data and can easily be applied on development data minimizing the mismatches. In this paper we compare speaker recognition results using phonetic and data-driven decoders. To this end, we have compared the results obtained with two sets of speaker verification systems; the first one based on data-driven units and the second one on phonetic units. Results obtained on the NIST 2006 Speaker Recognition Evaluation data show that the data-driven approach is comparable to the phonetic one and that further improvements can be achieved by combining both approaches.

Keywords :

audio databases; decoding; speaker recognition; speech coding; data-driven segmentation technique; phonetic N-gram system; phonetic decoder; phonetic speaker identity recognition; text-independent speaker verification; transcribed databases; Boosting; Databases; Decoding; Employee welfare; Loudspeakers; NIST; Natural languages; Speaker recognition; Speech; Statistics;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Biometrics: Theory, Applications, and Systems, 2007. BTAS 2007. First IEEE International Conference on

Conference_Location :

Crystal City, VA

Print_ISBN :

978-1-4244-1596-0

Electronic_ISBN :

978-1-4244-1597-7

Type :

conf

DOI :

10.1109/BTAS.2007.4401945

Filename :

4401945

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2371492