• DocumentCode
    112269
  • Title

    Primary-Ambient Extraction Using Ambient Phase Estimation with a Sparsity Constraint

  • Author

    Jianjun He ; Woon-Seng Gan ; Ee-Leng Tan

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    22
  • Issue
    8
  • fYear
    2015
  • fDate
    Aug. 2015
  • Firstpage
    1127
  • Lastpage
    1131
  • Abstract
    Spatial audio reproduction addresses the growing commercial need to recreate an immersive listening experience of digital media content, such as movies and games. Primary-ambient extraction (PAE) is one of the key approaches to facilitate flexible and optimal rendering in spatial audio reproduction. Existing approaches, such as principal component analysis and time-frequency masking, often suffer from severe extraction error. This problem is more evident when the sound scene contains a relatively strong ambient component, which is frequently encountered in digital media. In this Letter, we propose a novel PAE approach by estimating the ambient phase with a sparsity constraint (APES). This approach exploits the equal magnitude of the uncorrelated ambient components in the two channels of a stereo signal and reformulates the PAE problem as an ambient phase estimation problem, which is then solved using the criterion that the primary component is sparse. Our experimental results demonstrate that the proposed approach significantly outperforms existing approaches, especially when the ambient component is relatively strong.
  • Keywords
    audio coding; feature extraction; phase estimation; principal component analysis; APES; PAE; ambient phase estimation; digital media content; primary-ambient extraction; principal component analysis; sparsity constraint; spatial audio reproduction; stereo signal; time-frequency masking; Data mining; Indexes; Media; Motion pictures; Phase estimation; Rendering (computer graphics); Time-frequency analysis; Ambient phase; primary-ambient extraction (PAE); sparsity; spatial audio;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2014.2387021
  • Filename
    7000561