Title :
A baseline for the transcription of Italian broadcast news
Author :
Brugnara, E. ; Cettolo, M. ; Federico, M. ; Giuliani, D.
Author_Institution :
Centro per la Ricerca Sci. e Tecnol., ITC-irst, Trento, Italy
Abstract :
The paper presents the first achievements in the development of a broadcast news transcription system to be applied for the processing of huge audio archives. In particular, the Italian broadcast news corpus under collection is introduced, and the first implemented baseline system is outlined. The baseline system consists of an audio segmentation module and a speech recognizer featuring a recursive Viterbi beam search, a 64k word lexicon, a tree-based trigram LM representation, and MLLR adaptation. The word error rate of the baseline was 20.9% on planned studio speech and 28.8% on the whole test set
Keywords :
audio signal processing; speech recognition; Italian broadcast news transcription; MLLR adaptation; audio archives; audio segmentation module; baseline system; planned studio speech; recursive Viterbi beam search; speech recognizer; tree-based trigram LM representation; word error rate; word lexicon; Acoustic beams; Acoustic signal detection; Acoustic testing; Adaptation model; Decoding; Loudspeakers; Maximum likelihood linear regression; Radio broadcasting; Speech recognition; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
Print_ISBN :
0-7803-6293-4
DOI :
10.1109/ICASSP.2000.862070