• DocumentCode
    312315
  • Title

    On using prosodic cues in automatic language identification

  • Author

    Thymé-Gobbel, Ann E. ; Hutchins, Sandra E.

  • Author_Institution
    Natural Speech Technol. Inc., USA
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1768
  • Abstract
    Presents an effort to explore the utility of prosodic information in language identification/discrimination (LID) tasks. We present our model and results from pair-wise LID tasks with English, Spanish, Japanese and Mandarin using multi-speaker elicited spontaneous speech and a selected set of prosodic parameters. These language represent four different types of languages, varying in pitch use and timing. Parameters were designed to capture pitch and amplitude contours on a syllable-by-syllable basis, and to be insensitive to overall amplitude, pitch and speaking rate. Results show that prosodic cues alone can distinguish between some language pairs with results comparable systems, indicating that prosodic parameters are highly useful in automatic LID. However the statistical relationships between a number of individual feature deduced from timing and pitch measurements are needed to begin to capture such complex perceptual events as rhythm. Strengths of individual prosodic paramaters and classes of parameters-primarily pitch, and secondarily duration and location-reflect differences between the four languages, mostly as expected based on the linguistic literature, suggesting that effective use of prosodic parameters is aided by a understanding of the relationships between physical measurements and perceived linguistic events
  • Keywords
    languages; linguistics; speech recognition; timing; English; Japanese; Mandarin; Spanish; amplitude contours; automatic language identification; complex perceptual events; duration; language discrimination tasks; language pairs; location; multi-speaker elicited spontaneous speech; perceived linguistic events; physical measurements; pitch use; prosodic cues; prosodic paramaters; rhythm; speaking rate; statistical relationships; syllables; timing; Natural languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607971
  • Filename
    607971