DocumentCode :
3594331
Title :
The relation between speech segment selectivity and source localization accuracy
Author :
Aarabi, Parham ; Mahdavi, Alborz
Author_Institution :
The Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, 10 Kings College Road, Ontario, Canada, M5S 3G4
Volume :
1
fYear :
2002
Abstract :
An experimental analysis of the relation between speech signal segment power and the source direction-of-arrival-estimation accuracy is conducted. A total of 10 different speakers, including both male and female speakers, totaling to approximately 2 hours of speech are used to analyze the performance of the Phase Transform, the Maximum Likelihood, and the Unfiltered Cross Correlation time-delay estimation techniques. For female speakers, it is determined that the Phase Transform technique has a lower percentage of anomalies and a lower direction-of-arrival root mean-square error (DOA RMSE). Conversely, for male speakers, it is determined that the Unfiltered Cross Correlation has a lower percentage of anomalies although the Phase Transform has a lower DOA RMSE. The spatial distribution of the errors as well as the speech segment power relation to the errors are also presented.
Keywords :
Artificial neural networks; Signal to noise ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743707
Filename :
5743707
Link To Document :
بازگشت