Title :
Comparing Data-driven and Phonetic N-gram Systems for Text-Independent Speaker Verification
Author :
El Hannani, Asmaa ; Petrovska-Delacrétaz, Dijana
Author_Institution :
Fribourg Univ., Fribourg
Abstract :
Recognition of speaker identity based on modeling the streams produced by phonetic decoders(phonetic speaker recognition) has gained popularity during the past few years. Two of the major problems that arise when phone based systems are being developed are the possible mismatches between the development and evaluation data and the lack of transcribed databases. Data-driven segmentation techniques provide a potential solution to these problems because they do not use transcribed data and can easily be applied on development data minimizing the mismatches. In this paper we compare speaker recognition results using phonetic and data-driven decoders. To this end, we have compared the results obtained with two sets of speaker verification systems; the first one based on data-driven units and the second one on phonetic units. Results obtained on the NIST 2006 Speaker Recognition Evaluation data show that the data-driven approach is comparable to the phonetic one and that further improvements can be achieved by combining both approaches.
Keywords :
audio databases; decoding; speaker recognition; speech coding; data-driven segmentation technique; phonetic N-gram system; phonetic decoder; phonetic speaker identity recognition; text-independent speaker verification; transcribed databases; Boosting; Databases; Decoding; Employee welfare; Loudspeakers; NIST; Natural languages; Speaker recognition; Speech; Statistics;
Conference_Titel :
Biometrics: Theory, Applications, and Systems, 2007. BTAS 2007. First IEEE International Conference on
Conference_Location :
Crystal City, VA
Print_ISBN :
978-1-4244-1596-0
Electronic_ISBN :
978-1-4244-1597-7
DOI :
10.1109/BTAS.2007.4401945