• DocumentCode
    1755340
  • Title

    Intelligent Acoustic Interfaces With Multisensor Acquisition for Immersive Reproduction

  • Author

    Comminiello, Danilo ; Cecchi, Stefania ; Scarpiniti, Michele ; Gasparini, Michele ; Romoli, Laura ; Piazza, Francesco ; Uncini, Aurelio

  • Author_Institution
    Dept. of Inf. Eng., Electron. & Telecommun., “Sapienza” Univ. of Rome, Rome, Italy
  • Volume
    17
  • Issue
    8
  • fYear
    2015
  • fDate
    Aug. 2015
  • Firstpage
    1262
  • Lastpage
    1272
  • Abstract
    Immersive speech communication systems have been gaining increasing attention due to their ability to reproduce enhanced acoustic images, and thus achieving good performance in terms of sound quality and accuracy. In this context , a fundamental role is played by intelligent acoustic interfaces (IAIs), which aim at acquiring and/or reproducing desired acoustic information with enhanced perception. The recent widespread availability of multimedia devices, equipped with different kind of sensors, has broadened the range of data processing methods, thus giving a chance for developing advanced IAIs. In this paper, we propose an immersive communication system composed of two IAIs: the first one exploits microphones and cameras, together with a signal processing system, to reduce unwanted noise and enhance the speech quality of the desired information in the transmitting room; the second one is an advanced reproduction system based on a loudspeaker array and on an effective wave field synthesis technique capable of reproducing the spatial perception of the desired speech source in the receiving room. The whole system has been assessed in simulated and real immersive communication scenarios: objective and subjective evaluations have been shown the effectiveness of the proposed system.
  • Keywords
    acoustic signal processing; sensor fusion; signal denoising; speech enhancement; voice communication; IAI; cameras; immersive communication system; immersive reproduction; immersive speech communication systems; intelligent acoustic interfaces; loudspeaker array; microphones; multisensor acquisition; signal processing system; speech quality enhancement; speech source spatial perception reproduction; unwanted noise reduction; wave field synthesis technique; Acoustics; Arrays; Cameras; Microphones; Sensors; Speech; Immersive communication; intelligent acoustic interface; kinect sensor; multichannel reproduction; multimodal interaction; noise reduction;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2015.2442151
  • Filename
    7118180