Title :
Analysis of phoneme-based features for language identification
Author :
Berkling, Kay M. ; Arai, Takayuki ; Barnard, Etienne
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Abstract :
This paper presents an analysis of the phonemic language identification system introduced previously (see Eurospeech, vol.2, p.1307, 1993), now extended to recognize German in addition to English and Japanese. In this system language identification is based on features derived from a superset of phonemes of all three languages. As we increase the number of languages, the need to reduce the feature space becomes apparent. Practical analysis of single-feature statistics in conjunction with linguistic knowledge leads to 90% reduction of the feature space with only a 5% loss in performance. Thus, the system discriminates between Japanese and English with 84.1% accuracy based on only 15 features compared to 84.6% based on the complete set of 318 phonemic features (or 83.6% using 333 broad-category features). Results indicate that a language identification system may be designed based on linguistic knowledge and then implemented with a neural network of appropriate complexity
Keywords :
feature extraction; natural languages; pattern classification; speech processing; English; German; Japanese; broad-category features; feature space; language identification; linguistic knowledge; neural network; performance; phoneme-based features; single-feature statistics; Dentistry; Feature extraction; Joining processes; Merging; Natural languages; Neural networks; Performance loss; Production; Statistics;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389298