ARFON : Adversarial Robustness of Foundation Models

Date 07/03/2025 - 02/11/2025
Type Device & System Security, Machine Learning
Partner armasuisse
Partner contact Gérôme Bovet
EPFL Laboratory Signal Processing Laboratory

State-of-the-art architectures in many software applications and critical infrastructures are based on deep learning models. These models have been shown to be quite vulnerable to very small and carefully crafted perturbations, which pose fundamental questions in terms of safety, security, or performance guarantees at large. Several defense mechanisms have been developed in the last years to make models more robust against data perturbations or targeted attacks, with the best one being adversarial training, in which the model is fine-tuned by adding properly modified samples in the training set.