DocumentCode
1732978
Title
An investigation into VTLN for improved transcription of Czech broadcast programs
Author
Cerva, Petr ; Palecek, Karel ; Silovsky, Jan ; Nouza, Jan
Author_Institution
Inst. of Inf. Technol. & Electron., Tech. Univ. of Liberec, Liberec, Czech Republic
fYear
2011
Firstpage
201
Lastpage
204
Abstract
This paper deals with the Vocal Tract Length Normalization (VTLN) method. The aim is to investigate the best way how to utilize this technique for improving recognition accuracy of a LVCRS system that has been developed for broadcast program transcription at our lab in recent years. For this purpose, VTLN is evaluated experimentally in several configurations during testing as well as in speaker adaptive training scheme. In the former case, we employ VTLN as unsupervised for each testing utterance without the knowledge of transcription of adaptation data. Our results on different types of broadcast programs show that the resulting approach for VTLN reduces the Word Error Rate (WER) of our system significantly - by 7 % relatively.
Keywords
broadcasting; speaker recognition; unsupervised learning; Czech broadcast programs; LVCRS system; VTLN; broadcast program transcription; recognition accuracy; speaker adaptive training; testing utterance; unsupervised; vocal tract length normalization; word error rate; Acoustics; Hidden Markov models; Optimized production technology; Speech; Speech recognition; Testing; Training; Transcription of broadcast programs; Unsupervised speaker adaptation; Vocal tract length normalization;
fLanguage
English
Publisher
ieee
Conference_Titel
ELMAR, 2011 Proceedings
Conference_Location
Zadar
ISSN
1334-2630
Print_ISBN
978-1-61284-949-2
Electronic_ISBN
1334-2630
Type
conf
Filename
6044296
Link To Document