Title :
A segment-based approach to automatic language identification
Author :
Muthusamy, Yeshwant K. ; Cole, Ronald A. ; Gopalakrishnan, Murali
Author_Institution :
Dept. of Comput. Sci. & Eng., Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA
Abstract :
A segment-based approach to automatic language identification is discussed which is based on the idea that the acoustic structure of languages can be estimated by segmenting speech into broad phonetic categories. Automatic language identification can then be achieved by computing features that describe the phonetic and prosodic characteristics of the language, and using these feature measurements to train a classifier to distinguish between languages. As a first step in this approach, a multilanguage neural-network-based segmentation and broad classification algorithm using seven broad phonetic categories has been built. The algorithm was trained and tested on separate sets of speakers of American English, Japanese, Mandarin Chinese, and Tamil. It currently performs with an accuracy of 82.3% on the utterances of the test set
Keywords :
natural languages; neural nets; speech recognition; American English; Japanese; Mandarin Chinese; Tamil; acoustic structure; automatic language identification; broad classification algorithm; broad phonetic categories; language classifier; multilanguage neural-network-based segmentation; phonetic characteristics; prosodic characteristics; segment-based approach; speech recognition; Acoustic measurements; Acoustic testing; Computer science; Databases; Loudspeakers; Low pass filters; Natural languages; Neural networks; Signal processing algorithms; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150349