DocumentCode :
1739532
Title :
Low SNR robust Chinese tone extraction based human auditory model
Author :
Dai, Mingyang ; Kai Yu ; Xu, Boling ; Chongzhi Yu
Author_Institution :
Inst. of Acoust., Nanjing Univ., China
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
752
Abstract :
This paper proposes a robust Chinese tone extraction algorithm based on the human auditory mechanism and short-term stationarity of Chinese speech. In this method, we use the pooled-correlogram based on human auditory model to extract the pitch of speech. An unsupervised lateral inhibitory network is used to get the peak position, which simulates the lateral inhibitory phenomenon in the human auditory system. The pitch restriction between successive frames of speech is imposed to get rid of a miscarriage of justice in the output of the lateral inhibitory network. As shown in the experiments, the method can extract Chinese tone quite well even in rather low SNR cases. It can separate the individual tone clearly as two speakers talk simultaneously
Keywords :
acoustic signal detection; correlation methods; feature extraction; hearing; natural languages; neural nets; noise; speech processing; unsupervised learning; human auditory model; lateral inhibitory phenomenon; low SNR; peak position; pitch restriction; pooled-correlogram; robust Chinese tone extraction algorithm; short-term stationarity; speech frames; unsupervised lateral inhibitory network; Acoustics; Auditory system; Autocorrelation; Band pass filters; Bandwidth; Channel bank filters; Frequency; Humans; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-5747-7
Type :
conf
DOI :
10.1109/ICOSP.2000.891620
Filename :
891620
Link To Document :
بازگشت