CLIP Feature Selection Mechanism via Sparse Autoencoder for Generalized Deepfake Detection

IRIS

Generating multimedia contents, particularly images, by using AI-based tool is becoming an everyday practice generally just for fair applications but always more for malevolent aims such as disinformation, defamation and blaming. Reliably detecting synthetic generated pictures is becoming crucial. However, new generative models are emerging at a much faster pace than the development of accurate deepfake detectors. According to this, it is fundamental to improve generalization capabilities in order to preserve performance in front of content created by unknown and new-emerging generation methods. This paper investigates in this direction proposing the idea to select a reduced set of CLIP-based features by resorting to a sparse autoencoder (SAE); such a set should ideally be termed the features of fake and could provide an improved generalization capacity. Experimental results carried out on extended datasets highlight this notable behavior and diversely designed detectors, based on this approach, achieve state-of-the-art performances.

CLIP Feature Selection Mechanism via Sparse Autoencoder for Generalized Deepfake Detection

Berlincioni L.;Ciamarra A.;Pala P.;Caldelli R.;Del Bimbo A.

2025-01-01

Abstract

Generating multimedia contents, particularly images, by using AI-based tool is becoming an everyday practice generally just for fair applications but always more for malevolent aims such as disinformation, defamation and blaming. Reliably detecting synthetic generated pictures is becoming crucial. However, new generative models are emerging at a much faster pace than the development of accurate deepfake detectors. According to this, it is fundamental to improve generalization capabilities in order to preserve performance in front of content created by unknown and new-emerging generation methods. This paper investigates in this direction proposing the idea to select a reduced set of CLIP-based features by resorting to a sparse autoencoder (SAE); such a set should ideally be termed the features of fake and could provide an improved generalization capacity. Experimental results carried out on extended datasets highlight this notable behavior and diversely designed detectors, based on this approach, achieve state-of-the-art performances.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Parole chiave
	
				CLIP
Deepfake detection
feature selection
generalization
mutual information
sparse autoencoder
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12606/46426

Citazioni

ND

0

social impact