TY - GEN
T1 - 1-Diffractor
T2 - 10th ACM International Workshop on Security and Privacy Analytics, IWSPA 2024
AU - Meisenbacher, Stephen
AU - Chevli, Maulik
AU - Matthes, Florian
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/6/21
Y1 - 2024/6/21
N2 - The study of privacy-preserving Natural Language Processing (NLP) has gained rising attention in recent years. One promising avenue studies the integration of Differential Privacy in NLP, which has brought about innovative methods in a variety of application settings. Of particular note areword-level Metric Local Differential Privacy (MLDP) mechanisms, which work to obfuscate potentially sensitive input text by performing word-by-wordperturbations. Although these methods have shown promising results in empirical tests, there are two major drawbacks: (1) the inevitable loss of utility due to addition of noise, and (2) the computational expensiveness of running these mechanisms on high-dimensional word embeddings. In this work, we aim to address these challenges by proposing 1-Diffractor, a new mechanism that boasts high speedups in comparison to previous mechanisms, while still demonstrating strong utility-and privacy-preserving capabilities. We evaluate 1-Diffractor for utility on several NLP tasks, for theoretical and task-based privacy, and for efficiency in terms of speed and memory. 1-Diffractor shows significant improvements in efficiency, while still maintaining competitive utility and privacy scores across all conducted comparative tests against previous MLDP mechanisms. Our code is made available at: https://github.com/sjmeis/Diffractor.
AB - The study of privacy-preserving Natural Language Processing (NLP) has gained rising attention in recent years. One promising avenue studies the integration of Differential Privacy in NLP, which has brought about innovative methods in a variety of application settings. Of particular note areword-level Metric Local Differential Privacy (MLDP) mechanisms, which work to obfuscate potentially sensitive input text by performing word-by-wordperturbations. Although these methods have shown promising results in empirical tests, there are two major drawbacks: (1) the inevitable loss of utility due to addition of noise, and (2) the computational expensiveness of running these mechanisms on high-dimensional word embeddings. In this work, we aim to address these challenges by proposing 1-Diffractor, a new mechanism that boasts high speedups in comparison to previous mechanisms, while still demonstrating strong utility-and privacy-preserving capabilities. We evaluate 1-Diffractor for utility on several NLP tasks, for theoretical and task-based privacy, and for efficiency in terms of speed and memory. 1-Diffractor shows significant improvements in efficiency, while still maintaining competitive utility and privacy scores across all conducted comparative tests against previous MLDP mechanisms. Our code is made available at: https://github.com/sjmeis/Diffractor.
KW - data privacy
KW - differential privacy
KW - natural language processing
UR - http://www.scopus.com/inward/record.url?scp=85197534271&partnerID=8YFLogxK
U2 - 10.1145/3643651.3659896
DO - 10.1145/3643651.3659896
M3 - Conference contribution
AN - SCOPUS:85197534271
T3 - IWSPA 2024 - Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics
SP - 23
EP - 33
BT - IWSPA 2024 - Proceedings of the 10th ACM International Workshop on Security and Privacy Analytics
PB - Association for Computing Machinery, Inc
Y2 - 21 June 2024
ER -