An open/free database and Benchmark for Uyghur speaker recognition

Author

Askar Rozi; Dong Wang; Zhiyong Zhang;Thomas Fang Zheng

Author_Institution

Center for Speech and Language Technologies, Division of Technical Innovation and Development, Tsinghua National Laboratory for Information Science and Technology, China

fYear

2015

Firstpage

Lastpage

Abstract

Few research has been conducted on Uyghur speaker recognition. Among the limited works, researchers usually collect small speech databases and publish results based on their own private data. This `close-door evaluation´ makes most of the publications doubtable. This paper publishes an open and free speech database THUYG-20 SRE and a benchmark for Uyghur speaker recognition. The database is based on the THUYG-20 speech corpus we recently released, and the benchmark involves recognition tasks with various training/enrollment/test conditions. We provide a complete description for the database as well as the benchmark, and present an i-vector baseline system constructed using the Kaldi toolkit.

Keywords

"Speech","Databases","Speaker recognition","Signal to noise ratio","Benchmark testing","Speech recognition","Training"

Publisher

ieee

Conference_Titel

Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015 International Conference

Type

conf

DOI

10.1109/ICSDA.2015.7357869

Filename

7357869

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3713041