Skip to main navigation Skip to search Skip to main content

1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy

  • Technical University of Munich

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

The study of privacy-preserving Natural Language Processing (NLP) has gained rising attention in recent years. One promising avenue studies the integration of Differential Privacy in NLP, which has brought about innovative methods in a variety of application settings. Of particular note areword-level Metric Local Differential Privacy (MLDP) mechanisms, which work to obfuscate potentially sensitive input text by performing word-by-wordperturbations. Although these methods have shown promising results in empirical tests, there are two major drawbacks: (1) the inevitable loss of utility due to addition of noise, and (2) the computational expensiveness of running these mechanisms on high-dimensional word embeddings. In this work, we aim to address these challenges by proposing 1-Diffractor, a new mechanism that boasts high speedups in comparison to previous mechanisms, while still demonstrating strong utility-and privacy-preserving capabilities. We evaluate 1-Diffractor for utility on several NLP tasks, for theoretical and task-based privacy, and for efficiency in terms of speed and memory. 1-Diffractor shows significant improvements in efficiency, while still maintaining competitive utility and privacy scores across all conducted comparative tests against previous MLDP mechanisms. Our code is made available at: https://github.com/sjmeis/Diffractor.

Original languageEnglish
Title of host publicationIWSPA 2024 - Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics
PublisherAssociation for Computing Machinery, Inc
Pages23-33
Number of pages11
ISBN (Electronic)9798400705557
DOIs
StatePublished - 21 Jun 2024
Event10th ACM International Workshop on Security and Privacy Analytics, IWSPA 2024 - Porto, Portugal
Duration: 21 Jun 2024 → …

Publication series

NameIWSPA 2024 - Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics

Conference

Conference10th ACM International Workshop on Security and Privacy Analytics, IWSPA 2024
Country/TerritoryPortugal
CityPorto
Period21/06/24 → …

Keywords

  • data privacy
  • differential privacy
  • natural language processing

Fingerprint

Dive into the research topics of '1-Diffractor: Efficient and Utility-Preserving Text Obfuscation Leveraging Word-Level Metric Differential Privacy'. Together they form a unique fingerprint.

Cite this