Privacy-preserving and distributed processing of public data in hybrid trust networks

Date 02/07/2025 - 21/11/2025
Type Machine Learning
Partner armasuisse
Partner contact Gérôme Bovet
EPFL Laboratory Scalable Computing Systems Laboratory

One of the increasingly popular paradigms for managing the growing size and complexity of modern ML models is the adoption of collaborative and decentralized approaches. While this has enabled new possibilities in privacy-preserving and scalable frameworks for distributed data analytics and model training over large-scale real-world models, current approaches often assume a uniform trust-levels among participating nodes and emphasise on the privatization of the data locally held by each node. These assumptions overlook realistic scenarios involving varying degrees of trust and differing privacy requirements between nodes. In real-world deployments, it is common for noes in a network to partially use public datasets to perform analytics or train models tailored to their specific needs.