Raising the Bar of AI-generated Image Detection with CLIP

Davide Cozzolino, Giovanni Poggi, Riccardo Corvi, Matthias Nießner, Luisa Verdoliva

Publikation: Beitrag in Buch/Bericht/KonferenzbandKonferenzbeitragBegutachtung

11 Zitate (Scopus)

Abstract

The aim of this work is to explore the potential of pre-trained vision-language models (VLMs) for universal detection of AI-generated images. We develop a lightweight detection strategy based on CLIP features and study its performance in a wide variety of challenging scenarios. We find that, contrary to previous beliefs, it is neither necessary nor convenient to use a large domain-specific dataset for training. On the contrary, by using only a handful of example images from a single generative model, a CLIP-based detector exhibits surprising generalization ability and high robustness across different architectures, including recent commercial tools such as Dalle-3, Midjourney v5, and Firefly. We match the state-of-the-art (SoTA) on in-distribution data and significantly improve upon it in terms of generalization to out-of-distribution data (+6% AUC) and robustness to impaired/laundered data (+13%). Our project is available at https://grip-unina.github.io/ClipBased-SyntheticImageDetection/

OriginalspracheEnglisch
TitelProceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Herausgeber (Verlag)IEEE Computer Society
Seiten4356-4366
Seitenumfang11
ISBN (elektronisch)9798350365474
DOIs
PublikationsstatusVeröffentlicht - 2024
Veranstaltung2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024 - Seattle, USA/Vereinigte Staaten
Dauer: 16 Juni 202422 Juni 2024

Publikationsreihe

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
ISSN (Print)2160-7508
ISSN (elektronisch)2160-7516

Konferenz

Konferenz2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2024
Land/GebietUSA/Vereinigte Staaten
OrtSeattle
Zeitraum16/06/2422/06/24

Fingerprint

Untersuchen Sie die Forschungsthemen von „Raising the Bar of AI-generated Image Detection with CLIP“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren