DocumentCode
2773358
Title
Multi-variability speech database for robust speaker recognition
Author
Haris, B.C. ; Pradhan, G. ; Misra, A. ; Shukla, S. ; Sinha, R. ; Prasanna, S.R.M.
Author_Institution
Dept. of Electron. & Commun. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear
2011
fDate
28-30 Jan. 2011
Firstpage
1
Lastpage
5
Abstract
In this paper, we present our initial study with the recently collected speech database for developing robust speaker recognition systems in Indian context. The database contains the speech data collected across different sensors, languages, speaking styles, and environments, from 200 speakers. The speech data is collected across five different sensors in parallel, in English and multiple Indian languages, in reading and conversational speaking styles, and in office and uncontrolled environments such as laboratories, hostel rooms and corridors etc. The collected database is evaluated using adapted Gaussian mixture model based speaker verification system following the NIST 2003 speaker recognition evaluation protocol and gives comparable performance to those obtained using NIST data sets. Our initial study exploring the impact of mismatch in training and test conditions with collected data finds that the mismatch in sensor, speaking style, and environment result in significant degradation in performance compared to the matched case whereas for language mismatch case the degradation is found to be relatively smaller.
Keywords
Gaussian processes; database management systems; speaker recognition; English languages; Gaussian mixture model; NIST data sets; multiple Indian languages; multivariability speech database; robust speaker recognition; speaker verification system; speaking styles; Databases; Mobile handsets; NIST; Sensors; Speaker recognition; Speech; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2011 National Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-61284-090-1
Type
conf
DOI
10.1109/NCC.2011.5734775
Filename
5734775
Link To Document