• DocumentCode
    1739532
  • Title

    Low SNR robust Chinese tone extraction based human auditory model

  • Author

    Dai, Mingyang ; Kai Yu ; Xu, Boling ; Chongzhi Yu

  • Author_Institution
    Inst. of Acoust., Nanjing Univ., China
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    752
  • Abstract
    This paper proposes a robust Chinese tone extraction algorithm based on the human auditory mechanism and short-term stationarity of Chinese speech. In this method, we use the pooled-correlogram based on human auditory model to extract the pitch of speech. An unsupervised lateral inhibitory network is used to get the peak position, which simulates the lateral inhibitory phenomenon in the human auditory system. The pitch restriction between successive frames of speech is imposed to get rid of a miscarriage of justice in the output of the lateral inhibitory network. As shown in the experiments, the method can extract Chinese tone quite well even in rather low SNR cases. It can separate the individual tone clearly as two speakers talk simultaneously
  • Keywords
    acoustic signal detection; correlation methods; feature extraction; hearing; natural languages; neural nets; noise; speech processing; unsupervised learning; human auditory model; lateral inhibitory phenomenon; low SNR; peak position; pitch restriction; pooled-correlogram; robust Chinese tone extraction algorithm; short-term stationarity; speech frames; unsupervised lateral inhibitory network; Acoustics; Auditory system; Autocorrelation; Band pass filters; Bandwidth; Channel bank filters; Frequency; Humans; Robustness; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-5747-7
  • Type

    conf

  • DOI
    10.1109/ICOSP.2000.891620
  • Filename
    891620