DocumentCode :
454718
Title :
The Cu-Htk Mandarin Broadcast News Transcription System
Author :
Sinha, R. ; Gales, M.J.F. ; Kim, D.Y. ; Liu, X.A. ; Sim, K.C. ; Woodland, P.C.
Author_Institution :
Dept. of Eng., Cambridge Univ.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This paper discusses the development of the CU-HTK Mandarin broadcast news (BN) transcription system. The Mandarin BN task includes a significant amount of English data. Hence techniques have been investigated to allow the same system to handle both Mandarin and English by augmenting the Mandarin training sets with English acoustic and language model training data. A range of acoustic models were built including models based on Gaussianised features, speaker adaptive training and feature-space MPE. A multi-branch system architecture is described in which multiple acoustic model types, alternate phone sets and segmentations can be used in a system combination framework to generate the final output. The final system shows state-of-the-art performance over a range of test sets
Keywords :
Gaussian processes; acoustics; natural languages; speech processing; CU-HTK Mandarin broadcast news; English acoustic; English data; Gaussianised features; acoustic models; broadcast news transcription system; language model training data; multibranch system architecture; speaker adaptive training; Acoustic testing; Broadcasting; Gaussian processes; Loudspeakers; Natural languages; Performance evaluation; Speech; System testing; Telephony; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660211
Filename :
1660211
Link To Document :
بازگشت