DocumentCode
2787601
Title
A psychoacoustic spectral subtraction method for noise suppression in automatic speech recognition
Author
Haque, Serajul ; Togneri, Roberto
Author_Institution
Sch. of Electr., Electron. & Comput. Eng., Univ. of Western Australia, Crawley, WA, Australia
fYear
2010
fDate
14-19 March 2010
Firstpage
1618
Lastpage
1621
Abstract
A time-frequency spectral subtraction method based on the knowledge of several psychoacoustic properties of human perception is presented. These effects are the critical band filtering, synaptic adaptation which also introduces temporal forward masking, equal loudness preemphasis, power law of hearing, and simultaneous masking effect. The perceptual speech and noise is estimated separately by a detailed psychoacoustic non-linear transformation undergoing in the human auditory system. The spectral subtraction using a over-subtraction factor and a spectral floor is measured by a speech recognition front-end using a continuous density HMM recognizer. The method shows reduced residual noise and improved word recognition performance in broadband Gaussian noise conditions compared to conventional spectral subtraction method.
Keywords
Gaussian noise; acoustic signal processing; hidden Markov models; interference suppression; speech enhancement; speech intelligibility; speech recognition; automatic speech recognition; broadband Gaussian noise; continuous density HMM recognizer; critical band filtering; equal loudness preemphasis; hearing power law; human auditory system; human perception; noise suppression; nonlinear transformation; psychoacoustic spectral subtraction method; simultaneous masking effect; speech recognition front-end; synaptic adaptation; temporal forward masking; word recognition performance; Auditory system; Automatic speech recognition; Density measurement; Filtering; Gaussian noise; Humans; Psychology; Speech enhancement; Speech recognition; Time frequency analysis; Auditory system; acoustic signal processing; speech communication; speech enhancement; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5494888
Filename
5494888
Link To Document