• DocumentCode
    3003259
  • Title

    Language identification using noisy speech

  • Author

    Foil, Jerry T.

  • Author_Institution
    GTE Government Systems Corporation, Mountain View, California, USA
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    861
  • Lastpage
    864
  • Abstract
    This paper describes experiments in automatic identification of spoken languages using recordings of noisy radio signals as a data base. Prior efforts used uncorrupted speech; we selected techniques that we believed would be robust in noise. One technique attempted to distinguish languages by applying a classical quadratic classifier to prosodic features extracted from pitch and energy contours. Another was designed to exploit the frequency of occurrence of characteristic sounds using formant locations to represent the sounds, and using a vector-quantization distortion measure as the basis for language decisions. The techniques were required to make decisions based on speech segments of a few seconds duration. Our final tests were conducted on over 4 hours of previously unprocessed speech. Three languages, each from a different major language group, were used for development and testing. Allowing 11% false rejection (no decision), we achieved 64% correct identification with short speech segments. Our plans include the application of Markov modeling techniques to language identification.
  • Keywords
    Acoustic noise; Data mining; Distortion measurement; Feature extraction; Frequency; Natural languages; Noise robustness; Signal processing; Speech enhancement; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1168879
  • Filename
    1168879