Title :
The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry´s Experiment after Sixty Years
Author :
Marchegiani, Letizia ; Karadogan, Seliz G. ; Andersen, Tobias ; Larsen, Jan ; Hansen, Lars Kai
Author_Institution :
DTU Inf., Tech. Univ. of Denmark, Lyngby, Denmark
Abstract :
We investigate the role of top-down task drive attention in the cocktail party problem. In a recently proposed computational model of top-down attention it is possible to simulate the cocktail party problem and make predictions about sensitivity to confounders under different levels of attention. Based on such simulations we expect that under strong top-down attention pattern recognition is improved as the model can compensate for noise and confounders. We next investigate the role of temporal and spectral overlaps and speech intelligibility in humans, and how the presence of a task influences their relation. For this purpose, we perform behavioral experiments inspired by Cherry´s classic experiments carried out almost sixty years ago. We make participants listen to a mono signal consisting of two different narratives pronounced by a speech synthesizer under two different conditions. In the first case, participants listen with no specific task, while in the second one they are asked to follow one of the stories. Participants report the words they heard by choosing from a list which also includes terms not present in any of the narratives. We define temporal and spectral overlaps using the ideal binary mask (IBMs) as a gauge. We analyze the correlation between overlaps and the amount of reported words. We observe a significant negative correlation when there is no task, while no correlation is detected when a task is involved. Hence, results that are well aligned with the simulation results in our computational top-down attention model.
Keywords :
pattern recognition; speech synthesis; cocktail party problem; computational model; computational top-down attention model; ideal binary mask; pattern recognition; speech intelligibility; speech synthesizer; top-down task drive attention; Correlation; Humans; Interference; Mathematical model; Noise; Speech; Time frequency analysis; cocktail party; speech; top-down attention;
Conference_Titel :
Machine Learning and Applications and Workshops (ICMLA), 2011 10th International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
978-1-4577-2134-2
DOI :
10.1109/ICMLA.2011.143