• DocumentCode
    2934456
  • Title

    Speech control in surgery: A field analysis and strategies

  • Author

    Schuller, Björn ; Can, Salman ; Feussner, Hubertus ; Wöllmer, Martin ; Arisc, Dejan ; Hörnler, Benedikt

  • Author_Institution
    Inst. for Human-Machine Commun., Germany
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    1214
  • Lastpage
    1217
  • Abstract
    This work introduces a robot driven camera controlled by speech. The SIMIS database of 20 recordings of real life surgical operations serves as basis for analyses and noise modelling. To overcome low recognition performance due to high noise levels during operations, the vocabulary was chosen to be highly limited and multiple noise reduction methods have been investigated. We show that the use of feature enhancement techniques, such as Histogram Equalization or a Switching Linear Dynamic Model capturing the dynamics of speech show a remarkable improvement in recognition accuracy. Considering a severe condition of usage of the recognition system with all appearing noise types, the mean accuracy can be raised from 89.67% to 91.16% with SLDM, and to 95.50% with HEQ enhancement.
  • Keywords
    cameras; interference suppression; medical robotics; noise abatement; robot vision; speech enhancement; speech recognition; surgery; SIMIS database; feature enhancement techniques; histogram equalization; multiple noise reduction methods; real life surgical operations; robot driven camera; speech control; surgery; switching linear dynamic model; Cameras; Databases; Histograms; Noise level; Noise reduction; Robot control; Robot vision systems; Speech analysis; Surgery; Vocabulary; Acoustic noise; Biomedical equipment safety; Robustness; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202719
  • Filename
    5202719