• DocumentCode
    792292
  • Title

    Robust auditory-based speech processing using the average localized synchrony detection

  • Author

    Ali, Ahmed M Abdelatty ; Van der Spiegel, Jan ; Mueller, Paul

  • Author_Institution
    Res. & Dev., Texas Instruments, Inc, Warren, NJ, USA
  • Volume
    10
  • Issue
    5
  • fYear
    2002
  • fDate
    7/1/2002 12:00:00 AM
  • Firstpage
    279
  • Lastpage
    292
  • Abstract
    A new auditory-based speech processing system based on the biologically rooted property of the average localized synchrony detection (ALSD) is proposed. The system detects periodicity in the speech signal at Bark-scaled frequencies while reducing the response´s spurious peaks and sensitivity to implementation mismatches, and hence presents a consistent and robust representation of the formants. The system is evaluated for its formant extraction ability while reducing spurious peaks. It is compared with other auditory-based and traditional systems in the tasks of vowel and consonant recognition on clean speech from the TIMIT database and in the presence of noise. The results illustrate the advantage of the ALSD system in extracting the formants and reducing the spurious peaks. They also indicate the superiority of the synchrony measures over the mean-rate in the presence of noise.
  • Keywords
    feature extraction; hearing; reviews; signal detection; speech processing; speech recognition; Bark-scaled frequencies; TIMIT database; auditory-based systems; automatic speech recognition; average localized synchrony detection; biologically rooted property; clean speech; consonant recognition; formant extraction; human auditory system; mean-rate; noise; robust auditory-based speech processing; robust formants representation; speech signal periodicity detection; spurious peaks reduction; synchrony measures; vowel recognition; Auditory system; Automatic speech recognition; Cepstral analysis; Frequency synchronization; Humans; Mel frequency cepstral coefficient; Robustness; Speech enhancement; Speech processing; Speech recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.800556
  • Filename
    1021071