TY - JOUR
T1 - Know your sensors — a modality study for surgical action classification
AU - Bastian, Lennart
AU - Czempiel, Tobias
AU - Heiliger, Christian
AU - Karcz, Konrad
AU - Eck, Ulrich
AU - Busam, Benjamin
AU - Navab, Nassir
N1 - Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2023
Y1 - 2023
N2 - The surgical operating room (OR) presents many opportunities for automation and optimisation. Videos from various sources in the OR are becoming increasingly available. The medical community seeks to leverage this wealth of data to develop automated methods to advance interventional care, lower costs, and improve overall patient outcomes. Existing datasets from OR room cameras are thus far limited in size or modalities acquired, leaving it unclear which sensor modalities are best suited for tasks such as recognising surgical action from videos. This study demonstrates that the task of surgical workflow classification is highly dependent on the sensor modalities used. We perform a systematic analysis on several commonly available sensor modalities, evaluating two commonly used fusion approaches that can improve classification performance. Our findings are consistent across model architectures as well as separate camera views. The analyses are carried out on a set of multi-view RGB-D video recordings of 16 laparoscopic interventions.
AB - The surgical operating room (OR) presents many opportunities for automation and optimisation. Videos from various sources in the OR are becoming increasingly available. The medical community seeks to leverage this wealth of data to develop automated methods to advance interventional care, lower costs, and improve overall patient outcomes. Existing datasets from OR room cameras are thus far limited in size or modalities acquired, leaving it unclear which sensor modalities are best suited for tasks such as recognising surgical action from videos. This study demonstrates that the task of surgical workflow classification is highly dependent on the sensor modalities used. We perform a systematic analysis on several commonly available sensor modalities, evaluating two commonly used fusion approaches that can improve classification performance. Our findings are consistent across model architectures as well as separate camera views. The analyses are carried out on a set of multi-view RGB-D video recordings of 16 laparoscopic interventions.
KW - Surgical workflow analysis
KW - aware operating room
KW - video action recognition
UR - http://www.scopus.com/inward/record.url?scp=85144280383&partnerID=8YFLogxK
U2 - 10.1080/21681163.2022.2152377
DO - 10.1080/21681163.2022.2152377
M3 - Article
AN - SCOPUS:85144280383
SN - 2168-1163
VL - 11
SP - 1113
EP - 1121
JO - Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization
JF - Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization
IS - 4
ER -