DocumentCode :
3003259
Title :
Language identification using noisy speech
Author :
Foil, Jerry T.
Author_Institution :
GTE Government Systems Corporation, Mountain View, California, USA
Volume :
11
fYear :
1986
fDate :
31503
Firstpage :
861
Lastpage :
864
Abstract :
This paper describes experiments in automatic identification of spoken languages using recordings of noisy radio signals as a data base. Prior efforts used uncorrupted speech; we selected techniques that we believed would be robust in noise. One technique attempted to distinguish languages by applying a classical quadratic classifier to prosodic features extracted from pitch and energy contours. Another was designed to exploit the frequency of occurrence of characteristic sounds using formant locations to represent the sounds, and using a vector-quantization distortion measure as the basis for language decisions. The techniques were required to make decisions based on speech segments of a few seconds duration. Our final tests were conducted on over 4 hours of previously unprocessed speech. Three languages, each from a different major language group, were used for development and testing. Allowing 11% false rejection (no decision), we achieved 64% correct identification with short speech segments. Our plans include the application of Markov modeling techniques to language identification.
Keywords :
Acoustic noise; Data mining; Distortion measurement; Feature extraction; Frequency; Natural languages; Noise robustness; Signal processing; Speech enhancement; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
Type :
conf
DOI :
10.1109/ICASSP.1986.1168879
Filename :
1168879
Link To Document :
بازگشت