DocumentCode :
1689923
Title :
Rapid bootstrapping of a Ukrainian large vocabulary continuous speech recognition system
Author :
Schlippe, Tim ; Volovyk, Mykola ; Yurchenko, Kateryna ; Schultz, Tanja
Author_Institution :
Cognitive Syst. Lab., Karlsruhe Inst. of Technol. (KIT), Karlsruhe, Germany
fYear :
2013
Firstpage :
7329
Lastpage :
7333
Abstract :
We report on our efforts toward an LVCSR system for the Slavic language Ukrainian. We describe the Ukrainian text and speech database recently collected as a part of our GlobalPhone corpus [1] with our Rapid Language Adaptation Toolkit [2]. The data was complemented by a large collection of text data crawled from various Ukrainian websites. For the production of the pronunciation dictionary, we investigate strategies using grapheme-to-phoneme (g2p) models derived from existing dictionaries of other languages, thereby reducing severely the necessary manual effort. Russian and Bulgarian g2p models even decrease the number of pronunciation rules to one fifth. We achieve significant improvement by applying state-of-the art techniques for acoustic modeling and our day-wise text collection and language model interpolation strategy [3]. Our best system achieves a word error rate of 11.21% on the test set on read newspaper speech.
Keywords :
database languages; dictionaries; interpolation; speech recognition; Bulgarian g2p model; GlobalPhone corpus; LVCSR system; Russian g2p model; Slavic language Ukrainian; Ukrainian speech database; Ukrainian text database; Ukrainian website; acoustic modeling technique; day-wise text collection; grapheme-to-phoneme model; language model interpolation strategy; large vocabulary continuous speech recognition system; pronunciation dictionary production; rapid bootstrapping; rapid language adaptation toolkit; read newspaper speech; word error rate; Abstracts; Adaptation models; Gold; Optimization; Speech; Speech recognition; Vocabulary; Slavic language; Ukrainian; pronunciation dictionary; rapid language adaptation; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639086
Filename :
6639086
Link To Document :
بازگشت