Video-based multi-target multi-camera tracking for postoperative phase recognition

Franziska Jurosch, Janik Zeller, Lars Wagner, Ege Özsoy, Alissa Jell, Sven Kolb, Dirk Wilhelm

Research output: Contribution to journalArticlepeer-review

Abstract

Purpose: Deep learning methods are commonly used to generate context understanding to support surgeons and medical professionals. By expanding the current focus beyond the operating room (OR) to postoperative workflows, new forms of assistance are possible. In this article, we propose a novel multi-target multi-camera tracking (MTMCT) architecture for postoperative phase recognition, location tracking, and automatic timestamp generation. Methods: Three RGB cameras were used to create a multi-camera data set containing 19 reenacted postoperative patient flows. Patients and beds were annotated and used to train the custom MTMCT architecture. It includes bed and patient tracking for each camera and a postoperative patient state module to provide the postoperative phase, current location of the patient, and automatically generated timestamps. Results: The architecture demonstrates robust performance for single- and multi-patient scenarios by embedding medical domain-specific knowledge. In multi-patient scenarios, the state machine representing the postoperative phases has a traversal accuracy of 84.9±6.0%, 91.4±1.5% of timestamps are generated correctly, and the patient tracking IDF1 reaches 92.0±3.6%. Comparative experiments show the effectiveness of using AFLink for matching partial trajectories in postoperative settings. Conclusion: As our approach shows promising results, it lays the foundation for real-time surgeon support, enhancing clinical documentation and ultimately improving patient care.

Original languageEnglish
Article number102306
JournalInternational Journal of Computer Assisted Radiology and Surgery
DOIs
StateAccepted/In press - 2025

Keywords

  • Patient tracking
  • Surgical data science
  • Surgical workflow analysis

Fingerprint

Dive into the research topics of 'Video-based multi-target multi-camera tracking for postoperative phase recognition'. Together they form a unique fingerprint.

Cite this