Towards Non-adversarial Algorithmic Recourse

Tobias Leemann, Martin Pawelczyk, Bardh Prenkaj, Gjergji Kasneci

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The streams of research on adversarial examples and counterfactual explanations have largely been growing independently. This has led to several recent works trying to elucidate their similarities and differences. Most prominently, it has been argued that adversarial examples, as opposed to counterfactual explanations, have a unique characteristic in that they lead to a misclassification compared to the ground truth. However, the computational goals and methodologies employed in existing counterfactual explanation and adversarial example generation methods often lack alignment with this requirement. Using formal definitions of adversarial examples and counterfactual explanations, we introduce non-adversarial algorithmic recourse and outline why in high-stakes situations, it is imperative to obtain counterfactual explanations that do not exhibit adversarial characteristics. We subsequently investigate how different components in the objective functions, e.g., the machine learning model or cost function used to measure distance, determine whether the outcome can be considered an adversarial example or not. Our experiments on common datasets highlight that these design choices are often more critical in deciding whether recourse is non-adversarial than whether recourse or attack algorithms are used. Furthermore, we show that choosing a robust and accurate machine learning model results in less adversarial recourse desired in practice.

Original languageEnglish
Title of host publicationExplainable Artificial Intelligence - Second World Conference, xAI 2024, Proceedings
EditorsLuca Longo, Sebastian Lapuschkin, Christin Seifert
PublisherSpringer Science and Business Media Deutschland GmbH
Pages395-419
Number of pages25
ISBN (Print)9783031637995
DOIs
StatePublished - 2024
Event2nd World Conference on Explainable Artificial Intelligence, xAI 2024 - Valletta, Malta
Duration: 17 Jul 202419 Jul 2024

Publication series

NameCommunications in Computer and Information Science
Volume2155 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference2nd World Conference on Explainable Artificial Intelligence, xAI 2024
Country/TerritoryMalta
CityValletta
Period17/07/2419/07/24

Keywords

  • Adversarials
  • Algorithmic Recourse
  • Counterfactuals

Fingerprint

Dive into the research topics of 'Towards Non-adversarial Algorithmic Recourse'. Together they form a unique fingerprint.

Cite this