DocumentCode
3713041
Title
An open/free database and Benchmark for Uyghur speaker recognition
Author
Askar Rozi; Dong Wang; Zhiyong Zhang;Thomas Fang Zheng
Author_Institution
Center for Speech and Language Technologies, Division of Technical Innovation and Development, Tsinghua National Laboratory for Information Science and Technology, China
fYear
2015
Firstpage
81
Lastpage
85
Abstract
Few research has been conducted on Uyghur speaker recognition. Among the limited works, researchers usually collect small speech databases and publish results based on their own private data. This `close-door evaluation´ makes most of the publications doubtable. This paper publishes an open and free speech database THUYG-20 SRE and a benchmark for Uyghur speaker recognition. The database is based on the THUYG-20 speech corpus we recently released, and the benchmark involves recognition tasks with various training/enrollment/test conditions. We provide a complete description for the database as well as the benchmark, and present an i-vector baseline system constructed using the Kaldi toolkit.
Keywords
"Speech","Databases","Speaker recognition","Signal to noise ratio","Benchmark testing","Speech recognition","Training"
Publisher
ieee
Conference_Titel
Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015 International Conference
Type
conf
DOI
10.1109/ICSDA.2015.7357869
Filename
7357869
Link To Document