مرکز منطقه ای اطلاع رساني علوم و فناوري - Probabilistic lexical modeling and unsupervised training for zero-resourced ASR

DocumentCode :

672394

Title :

Probabilistic lexical modeling and unsupervised training for zero-resourced ASR

Author :

Rasipuram, Ramya ; Razavi, Mohsen ; Magimai-Doss, Mathew

Author_Institution :

Idiap Res. Inst., Martigny, Switzerland

fYear :

2013

fDate :

8-12 Dec. 2013

Firstpage :

446

Lastpage :

451

Abstract :

Standard automatic speech recognition (ASR) systems rely on transcribed speech, language models, and pronunciation dictionaries to achieve state-of-the-art performance. The unavailability of these resources constrains the ASR technology to be available for many languages. In this paper, we propose a novel zero-resourced ASR approach to train acoustic models that only uses list of probable words from the language of interest. The proposed approach is based on Kullback-Leibler divergence based hidden Markov model (KL-HMM), grapheme subword units, knowledge of grapheme-to-phoneme mapping, and graphemic constraints derived from the word list. The approach also exploits existing acoustic and lexical resources available in other resource rich languages. Furthermore, we propose unsupervised adaptation of KL-HMM acoustic model parameters if untranscribed speech data in the target language is available. We demonstrate the potential of the proposed approach through a simulated study on Greek language.

Keywords :

hidden Markov models; natural language processing; speech recognition; Greek language; KL-HMM; Kullback-Leibler divergence based hidden Markov model; acoustic models; acoustic resources; automatic speech recognition systems; grapheme subword units; grapheme-to-phoneme mapping; graphemic constraints; language models; lexical resources; probabilistic lexical modeling; pronunciation dictionaries; resource rich languages; transcribed speech; unsupervised training; untranscribed speech data; zero-resourced ASR; Acoustics; Artificial neural networks; Context modeling; Hidden Markov models; Knowledge based systems; Speech; Training; Kullback-Leibler divergence based hidden Markov model; graphemes; phonemes; probabilistic lexical modeling; unsupervised adaptation; zero-resourced speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on

Conference_Location :

Olomouc

Type :

conf

DOI :

10.1109/ASRU.2013.6707771

Filename :

6707771

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=672394