Author :
Srinivasan, S.H.
Author_Institution :
Appl. Res. Group, Satyam Comput. Services Ltd, Bangalore, India
Abstract :
Auditory scene analysis (ASA) tries to segment an auditory signal (scene) into objects. Most of the intermediate representations currently proposed based on ASA are difficult to compute. We propose auditory strands and blobs as intermediate representations. Auditory blobs are parts of an audio signal which have the same onset. By the principles of computational auditory scene analysis, they belong to the same object. We show how auditory blobs can be extracted and define harmonicity, dynamics, and onset features for auditory blobs. We also demonstrate their application to audio separation.
Keywords :
audio signal processing; feature extraction; signal representation; source separation; audio separation; audio signal segmentation; auditory blobs; auditory scene analysis; auditory strands; dynamics; harmonicity; onset features; Cepstral analysis; Cepstrum; Filter bank; Heart; Image analysis; Independent component analysis; Layout; Signal analysis; Signal processing; Speech coding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326826