Generating multimedia contents, particularly images, by using AI-based tool is becoming an everyday practice generally just for fair applications but always more for malevolent aims such as disinformation, defamation and blaming. Reliably detecting synthetic generated pictures is becoming crucial. However, new generative models are emerging at a much faster pace than the development of accurate deepfake detectors. According to this, it is fundamental to improve generalization capabilities in order to preserve performance in front of content created by unknown and new-emerging generation methods. This paper investigates in this direction proposing the idea to select a reduced set of CLIP-based features by resorting to a sparse autoencoder (SAE); such a set should ideally be termed the features of fake and could provide an improved generalization capacity. Experimental results carried out on extended datasets highlight this notable behavior and diversely designed detectors, based on this approach, achieve state-of-the-art performances.

CLIP Feature Selection Mechanism via Sparse Autoencoder for Generalized Deepfake Detection

Caldelli R.;
2025-01-01

Abstract

Generating multimedia contents, particularly images, by using AI-based tool is becoming an everyday practice generally just for fair applications but always more for malevolent aims such as disinformation, defamation and blaming. Reliably detecting synthetic generated pictures is becoming crucial. However, new generative models are emerging at a much faster pace than the development of accurate deepfake detectors. According to this, it is fundamental to improve generalization capabilities in order to preserve performance in front of content created by unknown and new-emerging generation methods. This paper investigates in this direction proposing the idea to select a reduced set of CLIP-based features by resorting to a sparse autoencoder (SAE); such a set should ideally be termed the features of fake and could provide an improved generalization capacity. Experimental results carried out on extended datasets highlight this notable behavior and diversely designed detectors, based on this approach, achieve state-of-the-art performances.
2025
CLIP
Deepfake detection
feature selection
generalization
mutual information
sparse autoencoder
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12606/46426
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
social impact