DocumentCode
2934456
Title
Speech control in surgery: A field analysis and strategies
Author
Schuller, Björn ; Can, Salman ; Feussner, Hubertus ; Wöllmer, Martin ; Arisc, Dejan ; Hörnler, Benedikt
Author_Institution
Inst. for Human-Machine Commun., Germany
fYear
2009
fDate
June 28 2009-July 3 2009
Firstpage
1214
Lastpage
1217
Abstract
This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modelling. To overcome low recognition performance due to high noise levels during operations, the vocabulary was chosen to be highly limited and multiple noise reduction methods have been investigated. We show that the use of feature enhancement techniques, such as Histogram Equalization or a Switching Linear Dynamic Model capturing the dynamics of speech show a remarkable improvement in recognition accuracy. Considering a severe condition of usage of the recognition system with all appearing noise types, the mean accuracy can be raised from 89.67% to 91.16% with SLDM, and to 95.50% with HEQ enhancement.
Keywords
cameras; interference suppression; medical robotics; noise abatement; robot vision; speech enhancement; speech recognition; surgery; SIMIS database; feature enhancement techniques; histogram equalization; multiple noise reduction methods; real life surgical operations; robot driven camera; speech control; surgery; switching linear dynamic model; Cameras; Databases; Histograms; Noise level; Noise reduction; Robot control; Robot vision systems; Speech analysis; Surgery; Vocabulary; Acoustic noise; Biomedical equipment safety; Robustness; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
Conference_Location
New York, NY
ISSN
1945-7871
Print_ISBN
978-1-4244-4290-4
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2009.5202719
Filename
5202719
Link To Document