مرکز منطقه ای اطلاع رساني علوم و فناوري - Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil

DocumentCode :

2177079

Title :

Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil

Author :

Vu, Ngoc Thang ; Kraus, Franziska ; Schultz, Tanja

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

5000

Lastpage :

5003

Abstract :

This paper presents our work on rapid language adaptation of acoustic models based on multilingual cross-language bootstrapping and unsupervised training. We used Automatic Speech Recognition (ASR) systems in English, French, German, and Spanish to build a Czech ASR system from scratch. System building was performed without using any transcribed audio data by applying three consecutive steps, i.e. cross-language transfer, unsupervised training based on the "multilingual A-stabil" confidence score, and boot strapping. Based on the confidence score we selected 72% (16.6 hours) of the available audio data with a transcription WER of less than 14.5%. The cross-language bootstrap achieves a word error rate of 23.3% on the Czech development set and 22.4% on the evaluation set. These results are very promising as the performance compares favorably to the Czech ASR system which was trained on 23 hours of manually transcribed data (21.8% on the development set and 21.3% on the evaluation set).

Keywords :

bootstrapping; speech recognition; Czech ASR system; audio data; automatic speech recognition; multilingual a-stabil; multilingual cross-language bootstrapping; unsupervised training; Acoustics; Adaptation models; Data models; Hidden Markov models; Speech recognition; Training; Training data; multilingual A-Stabil; rapid language adaptation of ASR; unsupervised training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947479

Filename :

5947479

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2177079