TY - GEN
T1 - Equivariant Differentially Private Deep Learning
T2 - 16th ACM Workshop on Artificial Intelligence and Security, AISec 2023, co-located with CCS 2023
AU - Hölzl, Florian A.
AU - Rueckert, Daniel
AU - Kaissis, Georgios
N1 - Publisher Copyright:
© 2023 Owner/Author.
PY - 2023/11/30
Y1 - 2023/11/30
N2 - Differentially Private Stochastic Gradient Descent (DP-SGD) limits the amount of private information deep learning models can memorize during training. This is achieved by clipping and adding noise to the model's gradients, and thus networks with more parameters require proportionally stronger perturbation. As a result, large models have difficulties learning useful information, rendering training with DP-SGD exceedingly difficult on more challenging training tasks. Recent research has focused on combating this challenge through training adaptations such as heavy data augmentation and large batch sizes. However, these techniques further increase the computational overhead of DP-SGD and reduce its practical applicability. In this work, we propose using the principle of sparse model design to solve precisely such complex tasks with fewer parameters, higher accuracy, and in less time, thus serving as a promising direction for DP-SGD. We achieve such sparsity by design by introducing equivariant convolutional networks for model training with Differential Privacy. Using equivariant networks, we show that small and efficient architecture design can outperform current state-of-The-Art with substantially lower computational requirements. On CIFAR-10, we achieve an increase of up to 9% in accuracy while reducing the computation time by more than 85%. Our results are a step towards efficient model architectures that make optimal use of their parameters and bridge the privacy-utility gap between private and non-private deep learning for computer vision.
AB - Differentially Private Stochastic Gradient Descent (DP-SGD) limits the amount of private information deep learning models can memorize during training. This is achieved by clipping and adding noise to the model's gradients, and thus networks with more parameters require proportionally stronger perturbation. As a result, large models have difficulties learning useful information, rendering training with DP-SGD exceedingly difficult on more challenging training tasks. Recent research has focused on combating this challenge through training adaptations such as heavy data augmentation and large batch sizes. However, these techniques further increase the computational overhead of DP-SGD and reduce its practical applicability. In this work, we propose using the principle of sparse model design to solve precisely such complex tasks with fewer parameters, higher accuracy, and in less time, thus serving as a promising direction for DP-SGD. We achieve such sparsity by design by introducing equivariant convolutional networks for model training with Differential Privacy. Using equivariant networks, we show that small and efficient architecture design can outperform current state-of-The-Art with substantially lower computational requirements. On CIFAR-10, we achieve an increase of up to 9% in accuracy while reducing the computation time by more than 85%. Our results are a step towards efficient model architectures that make optimal use of their parameters and bridge the privacy-utility gap between private and non-private deep learning for computer vision.
KW - designed sparsity
KW - differential privacy
KW - equivariant convolutions
KW - image classification
UR - http://www.scopus.com/inward/record.url?scp=85179587341&partnerID=8YFLogxK
U2 - 10.1145/3605764.3623902
DO - 10.1145/3605764.3623902
M3 - Conference contribution
AN - SCOPUS:85179587341
T3 - AISec 2023 - Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security
SP - 11
EP - 22
BT - AISec 2023 - Proceedings of the 16th ACM Workshop on Artificial Intelligence and Security
PB - Association for Computing Machinery, Inc
Y2 - 30 November 2023
ER -