DocumentCode
454718
Title
The Cu-Htk Mandarin Broadcast News Transcription System
Author
Sinha, R. ; Gales, M.J.F. ; Kim, D.Y. ; Liu, X.A. ; Sim, K.C. ; Woodland, P.C.
Author_Institution
Dept. of Eng., Cambridge Univ.
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
This paper discusses the development of the CU-HTK Mandarin broadcast news (BN) transcription system. The Mandarin BN task includes a significant amount of English data. Hence techniques have been investigated to allow the same system to handle both Mandarin and English by augmenting the Mandarin training sets with English acoustic and language model training data. A range of acoustic models were built including models based on Gaussianised features, speaker adaptive training and feature-space MPE. A multi-branch system architecture is described in which multiple acoustic model types, alternate phone sets and segmentations can be used in a system combination framework to generate the final output. The final system shows state-of-the-art performance over a range of test sets
Keywords
Gaussian processes; acoustics; natural languages; speech processing; CU-HTK Mandarin broadcast news; English acoustic; English data; Gaussianised features; acoustic models; broadcast news transcription system; language model training data; multibranch system architecture; speaker adaptive training; Acoustic testing; Broadcasting; Gaussian processes; Loudspeakers; Natural languages; Performance evaluation; Speech; System testing; Telephony; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660211
Filename
1660211
Link To Document