MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder (original) (raw)

Authors

DOI:

https://doi.org/10.1609/aaai.v39i6.32606

Abstract

Image forgeries can entirely change the semantic information of an image, and can be used for unscrupulous purposes. In this paper, we propose a novel image forgery localization network named as MUN, which consists of an M^3 encoder and a UN decoder. Firstly, the M^3 encoder is constructed based on a Multi-scale Max-pooling query module to extract Multi-clue forged features. Noiseprint++ is adopted to assist the RGB clue, and its deployment methodology is discussed. A Multi-scale Max-pooling Query (MMQ) module is proposed to integrate RGB and noise features. Secondly, a novel UN decoder is proposed to extract hierarchical features from both top-down and bottom-up directions, reconstructing both high-level and low-level features at the same time. Thirdly, we formulate an IoU-recalibrated Dynamic Cross-Entropy (IoUDCE) loss to dynamically adjust the weights on forged regions according to IoU which can adaptively balance the influence of authentic and forged regions. Last but not least, we propose a data augmentation method, i.e., Deviation Noise Augmentation (DNA), which acquires accessible prior knowledge of RGB distribution to improve the generalization ability. Extensive experiments on publicly available datasets show that MUN outperforms the state-of-the-art works.

AAAI-25 / IAAI-25 / EAAI-25 Proceedings Cover

How to Cite

Liu, Y., Chen, S., Shi, H., Zhang, X.-Y., Xiao, S., & Cai, Q. (2025). MUN: Image Forgery Localization Based on M³ Encoder and UN Decoder. Proceedings of the AAAI Conference on Artificial Intelligence, 39(6), 5685-5693. https://doi.org/10.1609/aaai.v39i6.32606

Issue

Section

AAAI Technical Track on Computer Vision V