Title :
Low SNR robust Chinese tone extraction based human auditory model
Author :
Dai, Mingyang ; Kai Yu ; Xu, Boling ; Chongzhi Yu
Author_Institution :
Inst. of Acoust., Nanjing Univ., China
Abstract :
This paper proposes a robust Chinese tone extraction algorithm based on the human auditory mechanism and short-term stationarity of Chinese speech. In this method, we use the pooled-correlogram based on human auditory model to extract the pitch of speech. An unsupervised lateral inhibitory network is used to get the peak position, which simulates the lateral inhibitory phenomenon in the human auditory system. The pitch restriction between successive frames of speech is imposed to get rid of a miscarriage of justice in the output of the lateral inhibitory network. As shown in the experiments, the method can extract Chinese tone quite well even in rather low SNR cases. It can separate the individual tone clearly as two speakers talk simultaneously
Keywords :
acoustic signal detection; correlation methods; feature extraction; hearing; natural languages; neural nets; noise; speech processing; unsupervised learning; human auditory model; lateral inhibitory phenomenon; low SNR; peak position; pitch restriction; pooled-correlogram; robust Chinese tone extraction algorithm; short-term stationarity; speech frames; unsupervised lateral inhibitory network; Acoustics; Auditory system; Autocorrelation; Band pass filters; Bandwidth; Channel bank filters; Frequency; Humans; Robustness; Speech recognition;
Conference_Titel :
Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-5747-7
DOI :
10.1109/ICOSP.2000.891620