• DocumentCode
    1757011
  • Title

    Robust Segments Detector for De-Synchronization Resilient Audio Watermarking

  • Author

    Chi-Man Pun ; Xiao-Chen Yuan

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Univ. of Macau, Macau, China
  • Volume
    21
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    2412
  • Lastpage
    2424
  • Abstract
    A robust feature points detector for invariant audio watermarking is proposed in this paper. The audio segments centering at the detected feature points are extracted for both watermark embedding and extraction. These feature points are invariant to various attacks and will not be changed much for maintaining high auditory quality. Besides, high robustness and inaudibility can be achieved by embedding the watermark into the approximation coefficients of Stationary Wavelet Transform (SWT) domain, which is shift invariant. The spread spectrum communication technique is adopted to embed the watermark. Experimental results show that the proposed Robust Audio Segments Extractor (RASE) and the watermarking scheme are not only robust against common audio signal processing, such as low-pass filtering, MP3 compression, echo addition, volume change, and normalization; and distortions introduced in Stir-mark benchmark for Audio; but also robust against synchronization geometric distortions simultaneously, such as resample time-scale modification (TSM) with scaling factors up to ±50%, pitch invariant TSM by ±50%, and tempo invariant pitch shifting by ±50%. In general, the proposed scheme can well resist various attacks by the joint RASE and SWT approach, which performs much better comparing with the existing state-of-the art methods.
  • Keywords
    audio signal processing; audio watermarking; data compression; low-pass filters; spread spectrum communication; synchronisation; wavelet transforms; MP3 compression; RASE; SWT domain; Stir-mark benchmark; approximation coefficient; audio segment extraction; audio signal processing; auditory quality; de-synchronization resilient audio watermarking; distortions; echo addition; invariant audio watermarking; joint RASE-SWT approach; low-pass filtering; normalization; pitch invariant TSM; resample TSM; resample time-scale modification; robust audio segment extractor; robust feature point detector; robust segment detector; spread spectrum communication technique; stationary wavelet transform domain; synchronization geometric distortions; tempo invariant pitch shifting; volume change; watermark embedding; watermark extraction; Digital audio players; Distortion; Feature extraction; Robustness; Synchronization; Watermarking; Robust audio segments extractor (RASE); pitch shifting; stationary wavelet transform (SWT); synchronization geometric distortions; time-scale modification (TSM);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2279312
  • Filename
    6583994