DocumentCode :
134203
Title :
Corpus and transcription system of Chinese Lecture Room
Author :
Sheng Li ; Akita, Yuya ; Kawahara, Toshio
Author_Institution :
Sch. of Inf., Kyoto Univ., Kyoto, Japan
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
442
Lastpage :
445
Abstract :
The paper introduces our project on automatic speech recognition (ASR) of Chinese lectures. For a comprehensive study on spontaneous Chinese, we compile a corpus of Chinese Lecture Room (CCLR), which has faithful transcripts and caption texts. Based on the annotated alignment of these texts, we conduct analysis on linguistic phenomena of spontaneous Chinese speech. We also develop a baseline ASR system with this corpus, and refine it with the DNN-HMM framework. By exploiting the lecture data without faithful transcripts and conducting unsupervised speaker adaptation, significant improvement of ASR accuracy is achieved.
Keywords :
computer aided instruction; hidden Markov models; linguistics; natural language processing; neural nets; speech recognition; text analysis; ASR accuracy; CCLR; DNN-HMM framework; annotated alignment; automatic speech recognition; baseline ASR system; caption texts; corpus of Chinese Lecture Room; deep neural network; faithful transcripts; hidden Markov models; linguistic phenomena; spontaneous Chinese speech; transcription system; unsupervised speaker adaptation; Acoustics; Adaptation models; Data models; Neural networks; Speech; Speech recognition; Training; acoustic model; lecture; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936595
Filename :
6936595
Link To Document :
بازگشت