Title :
Language identification using parallel sub-word recognition
Author :
Ramasubramanian, V. ; Sreenivas, T.V.
Abstract :
Parallel sub-word recognition (PSWR) is a new model that has been proposed for language identification (LID) which does not need elaborate phonetic labeling of the speech data in a foreign language. The new approach performs a front-end tokenization in terms of sub-word units which are designed by automatic segmentation, segment clustering and segment HMM modeling. We develop PSWR based LID in a framework similar to the parallel phone recognition (PPR) approach in the literature. This includes a front-end tokenizer and a back-end language model, for each language to be identified. Considering various combinations of the statistical evaluation scores, it is found that PSWR can perform as well as PPR, even with broad acoustic sub-word tokenization, thus making it an efficient alternative to the PPR system.
Keywords :
hidden Markov models; natural languages; speech processing; speech recognition; automatic segmentation; back-end language model; front-end tokenizer; language identification; parallel phone recognition; parallel sub-word recognition; phonetic labeling; segment HMM; segment clustering; speech data; statistical evaluation; Acoustic testing; Automatic speech recognition; Data engineering; Databases; Hidden Markov models; Labeling; Natural languages; Performance analysis; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198709