DocumentCode :
542160
Title :
Transcribing audio-video archives
Author :
Barras, Claude ; Allauzen, Aleandre ; Lamel, Lori ; Gauvain, Jean-Lue
Author_Institution :
Spoken Language Processing Group, LIMSI-CNRS, B.P. 133, 91403 Orsay cedex, France
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper addresses the automatic transcription of audiovideo archives using a state-of-the-art broadcast news speech transcription system. A 9-hour corpus spanning the latter half of the 20th century (1945–1995) has been transcribed and an analysis of the transcription quality carried out. In addition to the challenges of transcribing heterogenous broadcast news data, we are faced with changing properties of the archive over time, such as the audio quality, the speaking style, vocabulary items and manner of expression. After assessing the performance of the transcription system, several paths are explored in an attempt to reduce the mismatch between the acoustic and language models and the archived data.
Keywords :
Computational modeling; Computers; Distance measurement; Gold; Hidden Markov models; Speech; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743642
Filename :
5743642
Link To Document :
بازگشت