arxiv:2401.02031

Spy-Watermark: Robust Invisible Watermarking for Backdoor Attack

Published on Jan 4, 2024

Authors:

Abstract

Backdoor attack aims to deceive a victim model when facing backdoor instances while maintaining its performance on benign data. Current methods use manual patterns or special perturbations as triggers, while they often overlook the <PRE_TAG>robustness</POST_TAG> against data corruption, making backdoor attacks easy to defend in practice. To address this issue, we propose a novel backdoor attack method named Spy-Watermark, which remains effective when facing data collapse and backdoor defense. Therein, we introduce a learnable watermark embedded in the latent domain of images, serving as the trigger. Then, we search for a watermark that can withstand collapse during image decoding, cooperating with several anti-collapse operations to further enhance the resilience of our trigger against data corruption. Extensive experiments are conducted on CIFAR10, GTSRB, and ImageNet datasets, demonstrating that Spy-Watermark overtakes ten state-of-the-art methods in terms of <PRE_TAG>robustness</POST_TAG> and stealthiness.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2401.02031 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2401.02031 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2401.02031 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.