Title :
Auditory scene analysis based on time-frequency integration of shared FM and AM
Author :
Abe, Mototsugu ; Ando, Shigeru
Author_Institution :
Dept. of Math. Eng. & Inf. Phys., Tokyo Univ., Japan
Abstract :
This paper describes a new method for computational auditory scene analysis which is based on (1) waveform operators to extract instantaneous frequency (IF), frequency change (FM), and amplitude change (AM) from subband signals, and (2) the introduction of a voting method into a probability distribution function to extract coherency (shared fundamental frequency, shared FM, and shared AM) involved in them. We introduce non-parametric Kalman filtering for the time-axis integration. A consistent AM operator which is independent of frequency change is newly defined. The sharpness of the resultant probability distribution is examined with relation to the definition of the operators and subband bandwidth. We evaluate the performance of the algorithm by using several speech sounds
Keywords :
Kalman filters; amplitude modulation; coherence; feature extraction; filtering theory; frequency modulation; hearing; mathematical operators; probability; speech processing; time-frequency analysis; waveform analysis; wavelet transforms; AM; FM; algorithm performance; amplitude change; auditory scene analysis; coherency; frequency change; instantaneous frequency extraction; nonparametric Kalman filtering; probability distribution; shared fundamental frequency; speech sounds; subband bandwidth; subband signals; time-axis integration; time-frequency integration; voting method; waveform operators; wavelet transform; Bandwidth; Data mining; Distributed computing; Filters; Image analysis; Interference; Physics; Scattering; Time frequency analysis; Voting;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.681639