CausalCLIPSeg: Unlocking CLIP’s Potential in Referring Medical Image Segmentation with Causal Intervention

  • Yaxiong Chen
  • , Minghong Wei
  • , Zixuan Zheng
  • , Jingliang Hu
  • , Yilei Shi
  • , Shengwu Xiong
  • , Xiao Xiang Zhu
  • , Lichao Mou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Referring medical image segmentation targets delineating lesions indicated by textual descriptions. Aligning visual and textual cues is challenging due to their distinct data properties. Inspired by largescale pre-trained vision-language models, we propose CausalCLIPSeg, an end-to-end framework for referring medical image segmentation that leverages CLIP. Despite not being trained on medical data, we enforce CLIP’s rich semantic space onto the medical domain by a tailored crossmodal decoding method to achieve text-to-pixel alignment. Furthermore, to mitigate confounding bias that may cause the model to learn spurious correlations instead of meaningful causal relationships, CausalCLIPSeg introduces a causal intervention module which self-annotates confounders and excavates causal features from inputs for segmentation judgments. We also devise an adversarial min-max game to optimize causal features while penalizing confounding ones. Extensive experiments demonstrate the state-of-the-art performance of our proposed method. Code is available at https://github.com/WUTCM-Lab/CausalCLIPSeg.

Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention - MICCAI 2024 - 27th International Conference, Proceedings
EditorsMarius George Linguraru, Aasa Feragen, Ben Glocker, Julia A. Schnabel, Qi Dou, Stamatia Giannarou, Karim Lekadir
PublisherSpringer Science and Business Media Deutschland GmbH
Pages76-87
Number of pages12
ISBN (Print)9783031723834
DOIs
StatePublished - 2024
Event27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024 - Marrakesh, Morocco
Duration: 6 Oct 202410 Oct 2024

Publication series

NameLecture Notes in Computer Science
Volume15003 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2024
Country/TerritoryMorocco
CityMarrakesh
Period6/10/2410/10/24

Keywords

  • causal intervention
  • CLIP
  • cross-modal decoding
  • referring medical image segmentation

Fingerprint

Dive into the research topics of 'CausalCLIPSeg: Unlocking CLIP’s Potential in Referring Medical Image Segmentation with Causal Intervention'. Together they form a unique fingerprint.

Cite this