Enhancing the Utility of Privacy-Preserving Cancer Classification Using Synthetic Data

Richard Osuala, Daniel M. Lang, Anneliese Riess, Georgios Kaissis, Zuzanna Szafranowska, Grzegorz Skorupko, Oliver Diaz, Julia A. Schnabel, Karim Lekadir

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep learning holds immense promise for aiding radiologists in breast cancer detection. However, achieving optimal model performance is hampered by limitations in availability and sharing of data commonly associated to patient privacy concerns. Such concerns are further exacerbated, as traditional deep learning models can inadvertently leak sensitive training information. This work addresses these challenges exploring and quantifying the utility of privacy-preserving deep learning techniques, concretely, (i) differentially private stochastic gradient descent (DP-SGD) and (ii) fully synthetic training data generated by our proposed malignancy-conditioned generative adversarial network. We assess these methods via downstream malignancy classification of mammography masses using a transformer model. Our experimental results depict that synthetic data augmentation can improve privacy-utility tradeoffs in differentially private model training. Further, model pretraining on synthetic data achieves remarkable performance, which can be further increased with DP-SGD fine-tuning across all privacy guarantees. With this first in-depth exploration of privacy-preserving deep learning in breast imaging, we address current and emerging clinical privacy requirements and pave the way towards the adoption of private high-utility deep diagnostic models. Our reproducible codebase is publicly available at https://github.com/RichardObi/mammo_dp.

Original languageEnglish
Title of host publicationArtificial Intelligence and Imaging for Diagnostic and Treatment Challenges in Breast Care - 1st Deep Breast Workshop, Deep-Breath 2024, Held in Conjunction with MICCAI 2024, Proceedings
EditorsRitse M. Mann, Tianyu Zhang, Luyi Han, Geert Litjens, Tao Tan, Danial Truhn, Shuo Li, Yuan Gao, Shannon Doyle, Robert Martí Marly, Jakob Nikolas Kather, Katja Pinker-Domenig, Shandong Wu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages54-64
Number of pages11
ISBN (Print)9783031777882
DOIs
StatePublished - 2025
Event1st Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, Deep-Breath 2024 - Marrakesh, Morocco
Duration: 10 Oct 202410 Oct 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15451 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Deep Breast Workshop on AI and Imaging for Diagnostic and Treatment Challenges in Breast Care, Deep-Breath 2024
Country/TerritoryMorocco
CityMarrakesh
Period10/10/2410/10/24

Keywords

  • Breast Imaging
  • Differential Privacy
  • Generative Models

Fingerprint

Dive into the research topics of 'Enhancing the Utility of Privacy-Preserving Cancer Classification Using Synthetic Data'. Together they form a unique fingerprint.

Cite this