• DocumentCode
    3713041
  • Title

    An open/free database and Benchmark for Uyghur speaker recognition

  • Author

    Askar Rozi; Dong Wang; Zhiyong Zhang;Thomas Fang Zheng

  • Author_Institution
    Center for Speech and Language Technologies, Division of Technical Innovation and Development, Tsinghua National Laboratory for Information Science and Technology, China
  • fYear
    2015
  • Firstpage
    81
  • Lastpage
    85
  • Abstract
    Few research has been conducted on Uyghur speaker recognition. Among the limited works, researchers usually collect small speech databases and publish results based on their own private data. This `close-door evaluation´ makes most of the publications doubtable. This paper publishes an open and free speech database THUYG-20 SRE and a benchmark for Uyghur speaker recognition. The database is based on the THUYG-20 speech corpus we recently released, and the benchmark involves recognition tasks with various training/enrollment/test conditions. We provide a complete description for the database as well as the benchmark, and present an i-vector baseline system constructed using the Kaldi toolkit.
  • Keywords
    "Speech","Databases","Speaker recognition","Signal to noise ratio","Benchmark testing","Speech recognition","Training"
  • Publisher
    ieee
  • Conference_Titel
    Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015 International Conference
  • Type

    conf

  • DOI
    10.1109/ICSDA.2015.7357869
  • Filename
    7357869