TY - GEN
T1 - Enhancing the Utility of Privacy-Preserving Cancer Classification Using Synthetic Data
AU - Osuala, Richard
AU - Lang, Daniel M.
AU - Riess, Anneliese
AU - Kaissis, Georgios
AU - Szafranowska, Zuzanna
AU - Skorupko, Grzegorz
AU - Diaz, Oliver
AU - Schnabel, Julia A.
AU - Lekadir, Karim
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - Deep learning holds immense promise for aiding radiologists in breast cancer detection. However, achieving optimal model performance is hampered by limitations in availability and sharing of data commonly associated to patient privacy concerns. Such concerns are further exacerbated, as traditional deep learning models can inadvertently leak sensitive training information. This work addresses these challenges exploring and quantifying the utility of privacy-preserving deep learning techniques, concretely, (i) differentially private stochastic gradient descent (DP-SGD) and (ii) fully synthetic training data generated by our proposed malignancy-conditioned generative adversarial network. We assess these methods via downstream malignancy classification of mammography masses using a transformer model. Our experimental results depict that synthetic data augmentation can improve privacy-utility tradeoffs in differentially private model training. Further, model pretraining on synthetic data achieves remarkable performance, which can be further increased with DP-SGD fine-tuning across all privacy guarantees. With this first in-depth exploration of privacy-preserving deep learning in breast imaging, we address current and emerging clinical privacy requirements and pave the way towards the adoption of private high-utility deep diagnostic models. Our reproducible codebase is publicly available at https://github.com/RichardObi/mammo_dp.
AB - Deep learning holds immense promise for aiding radiologists in breast cancer detection. However, achieving optimal model performance is hampered by limitations in availability and sharing of data commonly associated to patient privacy concerns. Such concerns are further exacerbated, as traditional deep learning models can inadvertently leak sensitive training information. This work addresses these challenges exploring and quantifying the utility of privacy-preserving deep learning techniques, concretely, (i) differentially private stochastic gradient descent (DP-SGD) and (ii) fully synthetic training data generated by our proposed malignancy-conditioned generative adversarial network. We assess these methods via downstream malignancy classification of mammography masses using a transformer model. Our experimental results depict that synthetic data augmentation can improve privacy-utility tradeoffs in differentially private model training. Further, model pretraining on synthetic data achieves remarkable performance, which can be further increased with DP-SGD fine-tuning across all privacy guarantees. With this first in-depth exploration of privacy-preserving deep learning in breast imaging, we address current and emerging clinical privacy requirements and pave the way towards the adoption of private high-utility deep diagnostic models. Our reproducible codebase is publicly available at https://github.com/RichardObi/mammo_dp.
KW - Breast Imaging
KW - Differential Privacy
KW - Generative Models
UR - http://www.scopus.com/inward/record.url?scp=85219207908&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-77789-9_6
DO - 10.1007/978-3-031-77789-9_6
M3 - Conference contribution
AN - SCOPUS:85219207908
SN - 9783031777882
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 54
EP - 64
BT - Artificial Intelligence and Imaging for Diagnostic and Treatment Challenges in Breast Care - 1st Deep Breast Workshop, Deep-Breath 2024, Held in Conjunction with MICCAI 2024, Proceedings
A2 - Mann, Ritse M.
A2 - Zhang, Tianyu
A2 - Han, Luyi
A2 - Litjens, Geert
A2 - Tan, Tao
A2 - Truhn, Danial
A2 - Li, Shuo
A2 - Gao, Yuan
A2 - Doyle, Shannon
A2 - Martí Marly, Robert
A2 - Kather, Jakob Nikolas
A2 - Pinker-Domenig, Katja
A2 - Wu, Shandong
PB - Springer Science and Business Media Deutschland GmbH
T2 - 1st Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, Deep-Breath 2024
Y2 - 10 October 2024 through 10 October 2024
ER -