SFW: Symmetric Fourier Watermarking

Abstract

Semantic watermarking techniques for latent diffusion models (LDMs) are robust against regeneration attacks, but often suffer from detection performance degradation due to the loss of frequency integrity. To tackle this problem, we propose a novel embedding method called Hermitian Symmetric Fourier Watermarking (SFW), which maintains frequency integrity by enforcing Hermitian symmetry. Additionally, we introduce a center-aware embedding strategy that reduces the vulnerability of semantic watermarking due to cropping attacks by ensuring robust information retention. To validate our approach, we apply these techniques to existing semantic watermarking schemes, enhancing their frequency-domain structures for better robustness and retrieval accuracy. Extensive experiments demonstrate that our methods achieve state-of-the-art verification and identification performance, surpassing previous approaches across various attack scenarios. Ablation studies confirm the impact of SFW on detection capabilities, the effectiveness of the center-aware embedding against cropping, and how message capacity influences identification accuracy. Notably, our method achieves the highest detection accuracy while maintaining superior image fidelity, as evidenced by FID and CLIP scores. Conclusively, our proposed SFW is shown to be an effective framework for balancing robustness and image fidelity, addressing the inherent trade-offs in semantic watermarking. Code available at github.com/thomas11809/SFWMark.

Motivation

The rise of latent diffusion models has made image generation widely accessible, but it also introduces challenges in content attribution and copyright. Semantic watermarking in the latent space offers a robust solution, especially due to its resilience against regeneration attacks, which often break pixel-level watermarks.
However, existing methods often discard the imaginary part in the Fourier domain, breaking the frequency integrity required to maintain the statistical structure of latent noise. In particular, this violates the Hermitian symmetry necessary for real-valued latent signals, leading to distorted frequency patterns that simultaneously weaken detection robustness and degrade generative quality.
To address this issue, we propose a new approach that preserves frequency integrity while enabling reliable and high-quality watermarking.

Missing imaginary patterns reveal frequency loss in baselines.

Our Approach

We introduce three key techniques to overcome the limitations of existing semantic watermarking methods.

Hermitian Symmetric Fourier Watermarking (SFW):
By enforcing Hermitian symmetry in the latent Fourier domain, our method preserves frequency integrity and fully utilizes both real and imaginary components. This leads to stronger detection performance without sacrificing image quality.
Center-Aware Embedding Strategy:
We embed watermarks only in the central region of the latent space, which remains stable under spatial transformations. This design greatly improves robustness against cropping attacks.
HSQR: Hermitian Symmetric QR Code:
We extend SFW to structured binary watermarks by splitting a QR code across the real and imaginary parts of the Fourier domain. This approach ensures high detection accuracy and message capacity, while preserving generative quality.

Additional Resources

End-to-end pipeline of semantic watermarking using the merged-in-generation scheme. Watermarks are embedded in the latent Fourier domain and later detected by analyzing the reconstructed latent query via DDIM inversion.

Taxonomy of watermarking methods categorized by message type (bitstream vs. pattern), embedding strategy (post-hoc vs. in-generation), and method family (classical vs. deep learning vs semantic watermarking). Semantic methods are further classified by their watermark patterns into Gaussian-based and structured encodings.

Processing time and detection performance of different watermarking methods. Merged-in-generation schemes require no additional time during generation, while the post-hoc based semantic method Zodiac incurs excessive processing time per image (7.36 m/img).

Normality assessment of latent distributions. By preserving Gaussianity through SFW, our method maintains the statistical structure of the latent space, enabling HSTR to achieve better generative quality than Tree-Ring despite using the same pattern structure.

BibTeX

@inproceedings{lee2025semantic, title={Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity}, author={Lee, Sung Ju and Cho, Nam Ik}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision}, pages={18759--18769}, year={2025} }

Semantic Watermarking Reinvented: Enhancing Robustness and Generation Quality with Fourier Integrity

Summary of watermarking performance across different semantic watermarking methods, following the merged-in-generation scheme with no additional processing time. The proposed approaches achieve the best balance between detection robustness and image fidelity.

Abstract

Motivation

Our Approach

Experiments

Detection Performance – Verification Task

Detection Performance – Identification Task

Generative Quality – Visual Comparison

Generative Quality – Quantitative Comparison

Ablation Study – Impact of SFW on Detection Performance

Ablation Study – Robustness to Cropping Attacks

Ablation Study – Impact of Capacity on Identification

Additional Resources

End-to-end pipeline of semantic watermarking using the merged-in-generation scheme. Watermarks are embedded in the latent Fourier domain and later detected by analyzing the reconstructed latent query via DDIM inversion.

Processing time and detection performance of different watermarking methods. Merged-in-generation schemes require no additional time during generation, while the post-hoc based semantic method Zodiac incurs excessive processing time per image (7.36 m/img).

Normality assessment of latent distributions. By preserving Gaussianity through SFW, our method maintains the statistical structure of the latent space, enabling HSTR to achieve better generative quality than Tree-Ring despite using the same pattern structure.

More Qualitative Results

BibTeX