We introduce SLIMP (Skin Lesion Image-Metadata Pre-training) for learningrich representations of skin lesions through a novel nested contrastivelearning approach that captures complex relationships between images andmetadata. Melanoma detection and skin lesion classification based solely onimages, pose significant challenges due to large variations in imagingconditions (lighting, color, resolution, distance, etc.) and lack of clinicaland phenotypical context. Clinicians typically follow a holistic approach forassessing the risk level of the patient and for deciding which lesions may bemalignant and need to be excised, by considering the patient's medical historyas well as the appearance of other lesions of the patient. Inspired by this,SLIMP combines the appearance and the metadata of individual skin lesions withpatient-level metadata relating to their medical record and other clinicallyrelevant information. By fully exploiting all available data modalitiesthroughout the learning process, the proposed pre-training strategy improvesperformance compared to other pre-training strategies on downstream skinlesions classification tasks highlighting the learned representations quality.

Skin Lesion Phenotyping via Nested Multi-modal Contrastive Learning

Valsamis Ntouskos;
2025-01-01

Abstract

We introduce SLIMP (Skin Lesion Image-Metadata Pre-training) for learningrich representations of skin lesions through a novel nested contrastivelearning approach that captures complex relationships between images andmetadata. Melanoma detection and skin lesion classification based solely onimages, pose significant challenges due to large variations in imagingconditions (lighting, color, resolution, distance, etc.) and lack of clinicaland phenotypical context. Clinicians typically follow a holistic approach forassessing the risk level of the patient and for deciding which lesions may bemalignant and need to be excised, by considering the patient's medical historyas well as the appearance of other lesions of the patient. Inspired by this,SLIMP combines the appearance and the metadata of individual skin lesions withpatient-level metadata relating to their medical record and other clinicallyrelevant information. By fully exploiting all available data modalitiesthroughout the learning process, the proposed pre-training strategy improvesperformance compared to other pre-training strategies on downstream skinlesions classification tasks highlighting the learned representations quality.
2025
Computer Science - Computer Vision and Pattern Recognition
Computer Science - Artificial Intelligence
Computer Science - Learning
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12606/26692
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
social impact