• DocumentCode
    1085042
  • Title

    Interaction between segmental and nonsegmental factors in speech recognition

  • Author

    Lindblom, Björn E F ; Svensson, Stig-Goran

  • Author_Institution
    Stockholm University, Fack, Stockholm, Sweden
  • Volume
    21
  • Issue
    6
  • fYear
    1973
  • fDate
    12/1/1973 12:00:00 AM
  • Firstpage
    536
  • Lastpage
    545
  • Abstract
    The present study demonstrates that spectrograms of Swedish utterances can be read with great accuracy under nontrivial conditions. This result is to be attributed primarily to the development of a formalized strategy that was designed to make it possible for spectrogram readers to derive information on certain grammatical features of an utterance such as word class, word boundaries, endings, and function elements. The input to this strategy consists of segmental phonetic features that the subjects extract from the spectrographic display and of information on prosodic features such as stress and tonal accent. The latter information is specified on the spectrogram for each syllable. An experimental situation is thus created that differs from the informal recognition of unknown utterances from spectrograms. A subject can base his final identification of lexical items not only on segmental phonetic features but also on an error-free specification of prosodic features and, in so far as he has been able to use the strategy successfully, on grammatical information. Experimental results are reported indicating that subjects improve their performance markedly with the aid of the strategy. In conclusion, attention is drawn to the important role that grammar and prosody appear to play in the present experiments and to the implications of the findings for future work on automatic speech recognition and speech perception.
  • Keywords
    Acoustic distortion; Data mining; Degradation; Displays; Humans; Natural languages; Oral communication; Spectrogram; Speech recognition; Stress;
  • fLanguage
    English
  • Journal_Title
    Audio and Electroacoustics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9278
  • Type

    jour

  • DOI
    10.1109/TAU.1973.1162527
  • Filename
    1162527