• DocumentCode
    290062
  • Title

    Perceptual benchmarks for automatic language identification

  • Author

    Muthusamy, Yeshwant K. ; Jain, Neena ; Cole, Ronald A.

  • Author_Institution
    Comput. Sci. Lab., Texas Instrum. Inc., Dallas, TX, USA
  • Volume
    i
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    There has been renewed interest in the field of automatic language identification over the past two years. The advent of a public-domain ten-language corpus of telephone speech has made the evaluation of different approaches to automatic language identification feasible. In an effort to provide benchmarks for evaluating machine performance, we conducted perceptual experiments on 1-, 2-, 4- and 6-second excerpts of telephone speech excised from spontaneous speech utterances in this corpus. The subject population consisted of 10 native speakers of English and 2 speakers from each of the remaining 9 languages. Statistical analyses of our results indicate that duration of the excerpt, familiarity with the language, and number of languages known are important factors affecting a subject´s performance on the identification task
  • Keywords
    natural languages; speech recognition; statistical analysis; automatic language identification; machine performance; perceptual benchmarks; public-domain ten-language corpus; spontaneous speech utterances; statistical analyses; telephone speech; Computer science; Humans; Instruments; NIST; Natural languages; Search problems; Speech analysis; Statistical analysis; Surges; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389288
  • Filename
    389288