Automatic language identification with recurrent neural networks

Author

Braun, Jerome ; Levkowitz, Haim

Author_Institution

Dept. of Comput. Sci., Massachusetts Univ., Lowell, MA, USA

Volume

3

fYear

1998

fDate

4-9 May 1998

Firstpage

2184

Abstract

Automatic language identification (LID), an important domain in speech processing, means the capability of a machine to determine a natural language from a spoken utterance. We present a novel approach to LID, which involves recurrent neural networks (RNN) as the main mechanism. We propose that, because of acoustical context issues, RNNs are particularly suitable for the LID task. Our approach also introduces perceptually guided training (PGT), a novel training method, that exploits the concept of perceptually significant regions (PSR) postulated by our approach. We present our overall approach and describe LIREN/PGT, the have we have developed, implementing our approach. We also discuss our LID experiments with English, German, and Mandarin. In the paper, we concentrate on the architecture and training of RNNs for automatic language identification

Keywords

acoustic signal processing; learning (artificial intelligence); natural languages; recurrent neural nets; speech recognition; English; German; Mandarin; automatic language identification; natural language; perceptually guided training; perceptually significant regions; recurrent neural networks; spoken utterance; Computer science; Hidden Markov models; Humans; Multimedia systems; Natural languages; Recurrent neural networks; Routing; Speech processing; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on

Conference_Location

Anchorage, AK

ISSN

1098-7576

Print_ISBN

0-7803-4859-1

Type

conf

DOI

10.1109/IJCNN.1998.687199

Filename

687199