One-Shot Medical Video Object Segmentation via Temporal Contrastive Memory Networks

Yaxiong Chen, Junjian Hu, Chunlei Li, Zixuan Zheng, Jingliang Hu, Yilei Shi, Shengwu Xiong, Xiao Xiang Zhu, Lichao Mou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Video object segmentation is crucial for the efficient analysis of complex medical video data, yet it faces significant challenges in data availability and annotation. We introduce the task of one-shot medical video object segmentation, which requires separating foreground and background pixels throughout a video given only the mask annotation of the first frame. To address this problem, we propose a temporal contrastive memory network comprising image and mask encoders to learn feature representations, a temporal contrastive memory bank that aligns embeddings from adjacent frames while pushing apart distant ones to explicitly model inter-frame relationships and stores these features, and a decoder that fuses encoded image features and memory readouts for segmentation. We also collect a diverse, multi-source medical video dataset spanning various modalities and anatomies to benchmark this task. Extensive experiments demonstrate state-of-the-art performance in segmenting both seen and unseen structures from a single exemplar, showing ability to generalize from scarce labels. This highlights the potential to alleviate annotation burdens for medical video analysis. Code is available at https://github.com/MedAITech/TCMN.

Original languageEnglish
Title of host publicationApplications of Medical Artificial Intelligence - 3rd International Workshop, AMAI 2024, Held in Conjunction with MICCAI 2024, Proceedings
EditorsShandong Wu, Behrouz Shabestari, Lei Xing
PublisherSpringer Science and Business Media Deutschland GmbH
Pages241-251
Number of pages11
ISBN (Print)9783031820069
DOIs
StatePublished - 2025
Event3rd International Workshop on Applications of Medical Artificial Intelligence, AMAI 2024 held in conjunction with the 27th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2024 - Marrakesh, Morocco
Duration: 6 Oct 20246 Oct 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15384 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd International Workshop on Applications of Medical Artificial Intelligence, AMAI 2024 held in conjunction with the 27th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2024
Country/TerritoryMorocco
CityMarrakesh
Period6/10/246/10/24

Keywords

  • medical imaging
  • memory network
  • one-shot learning
  • temporal contrastive learning
  • video object segmentation

Fingerprint

Dive into the research topics of 'One-Shot Medical Video Object Segmentation via Temporal Contrastive Memory Networks'. Together they form a unique fingerprint.

Cite this