TY - JOUR
T1 - Video-based multi-target multi-camera tracking for postoperative phase recognition
AU - Jurosch, Franziska
AU - Zeller, Janik
AU - Wagner, Lars
AU - Özsoy, Ege
AU - Jell, Alissa
AU - Kolb, Sven
AU - Wilhelm, Dirk
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025
Y1 - 2025
N2 - Purpose: Deep learning methods are commonly used to generate context understanding to support surgeons and medical professionals. By expanding the current focus beyond the operating room (OR) to postoperative workflows, new forms of assistance are possible. In this article, we propose a novel multi-target multi-camera tracking (MTMCT) architecture for postoperative phase recognition, location tracking, and automatic timestamp generation. Methods: Three RGB cameras were used to create a multi-camera data set containing 19 reenacted postoperative patient flows. Patients and beds were annotated and used to train the custom MTMCT architecture. It includes bed and patient tracking for each camera and a postoperative patient state module to provide the postoperative phase, current location of the patient, and automatically generated timestamps. Results: The architecture demonstrates robust performance for single- and multi-patient scenarios by embedding medical domain-specific knowledge. In multi-patient scenarios, the state machine representing the postoperative phases has a traversal accuracy of 84.9±6.0%, 91.4±1.5% of timestamps are generated correctly, and the patient tracking IDF1 reaches 92.0±3.6%. Comparative experiments show the effectiveness of using AFLink for matching partial trajectories in postoperative settings. Conclusion: As our approach shows promising results, it lays the foundation for real-time surgeon support, enhancing clinical documentation and ultimately improving patient care.
AB - Purpose: Deep learning methods are commonly used to generate context understanding to support surgeons and medical professionals. By expanding the current focus beyond the operating room (OR) to postoperative workflows, new forms of assistance are possible. In this article, we propose a novel multi-target multi-camera tracking (MTMCT) architecture for postoperative phase recognition, location tracking, and automatic timestamp generation. Methods: Three RGB cameras were used to create a multi-camera data set containing 19 reenacted postoperative patient flows. Patients and beds were annotated and used to train the custom MTMCT architecture. It includes bed and patient tracking for each camera and a postoperative patient state module to provide the postoperative phase, current location of the patient, and automatically generated timestamps. Results: The architecture demonstrates robust performance for single- and multi-patient scenarios by embedding medical domain-specific knowledge. In multi-patient scenarios, the state machine representing the postoperative phases has a traversal accuracy of 84.9±6.0%, 91.4±1.5% of timestamps are generated correctly, and the patient tracking IDF1 reaches 92.0±3.6%. Comparative experiments show the effectiveness of using AFLink for matching partial trajectories in postoperative settings. Conclusion: As our approach shows promising results, it lays the foundation for real-time surgeon support, enhancing clinical documentation and ultimately improving patient care.
KW - Patient tracking
KW - Surgical data science
KW - Surgical workflow analysis
UR - http://www.scopus.com/inward/record.url?scp=105002358922&partnerID=8YFLogxK
U2 - 10.1007/s11548-025-03344-x
DO - 10.1007/s11548-025-03344-x
M3 - Article
AN - SCOPUS:105002358922
SN - 1861-6410
JO - International Journal of Computer Assisted Radiology and Surgery
JF - International Journal of Computer Assisted Radiology and Surgery
M1 - 102306
ER -