HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal
  data improve recognition?

Abstract

This paper presents the design and results of the "PEg TRAnsfert Workflowrecognition" (PETRAW) challenge whose objective was to develop surgicalworkflow recognition methods based on one or several modalities, among video,kinematic, and segmentation data, in order to study their added value. ThePETRAW challenge provided a data set of 150 peg transfer sequences performed ona virtual simulator. This data set was composed of videos, kinematics, semanticsegmentation, and workflow annotations which described the sequences at threedifferent granularity levels: phase, step, and activity. Five tasks wereproposed to the participants: three of them were related to the recognition ofall granularities with one of the available modalities, while the othersaddressed the recognition with a combination of modalities. Averageapplication-dependent balanced accuracy (AD-Accuracy) was used as evaluationmetric to take unbalanced classes into account and because it is moreclinically relevant than a frame-by-frame score. Seven teams participated in atleast one task and four of them in all tasks. Best results are obtained withthe use of the video and the kinematics data with an AD-Accuracy between 93%and 90% for the four teams who participated in all tasks. The improvementbetween video/kinematic-based methods and the uni-modality ones was significantfor all of the teams. However, the difference in testing execution time betweenthe video/kinematic-based and the kinematic-based methods has to be taken intoconsideration. Is it relevant to spend 20 to 200 times more computing time forless than 3% of improvement? The PETRAW data set is publicly available atwww.synapse.org/PETRAW to encourage further research in surgical workflowrecognition.

Benchmarks

BenchmarkMethodologyMetrics
kinematic-based-workflow-recognition-onMedAIR
Average AD-Accuracy: 90.72
kinematic-based-workflow-recognition-onHutom
Average AD-Accuracy: 84.31
kinematic-based-workflow-recognition-onNCC Next
Average AD-Accuracy: 90.32
kinematic-based-workflow-recognition-onJHU-CIRL
Average AD-Accuracy: 86.45
kinematic-based-workflow-recognition-onMediCIS
Average AD-Accuracy: 89.71
kinematic-based-workflow-recognition-onSK
Average AD-Accuracy: 89.66
segmentation-based-workflow-recognition-onMediCIS
Average AD-Accuracy: 87.22
segmentation-based-workflow-recognition-onNCC Next
Average AD-Accuracy: 87.71
segmentation-based-workflow-recognition-onSK
Average AD-Accuracy: 88.51
segmentation-based-workflow-recognition-onHutom
Average AD-Accuracy: 60.28
semantic-segmentation-on-petrawSK
Mean IoU (class): 96.4
semantic-segmentation-on-petrawHutom
Mean IoU (class): 85
semantic-segmentation-on-petrawMediCIS
Mean IoU (class): 94
semantic-segmentation-on-petrawNCC Next
Mean IoU (class): 96.9
video-based-workflow-recognition-on-petrawSK
Average AD-Accuracy: 90.77
video-based-workflow-recognition-on-petrawMediCIS
Average AD-Accuracy: 89.15
video-based-workflow-recognition-on-petrawHutom
Average AD-Accuracy: 90.51
video-based-workflow-recognition-on-petrawNCC Next
Average AD-Accuracy: 87.77
video-based-workflow-recognition-on-petrawMedAIR
Average AD-Accuracy: 84.31
video-kinematic-base-workflow-recognition-onNCC Next
Average AD-Accuracy: 93.09
video-kinematic-base-workflow-recognition-onMMLAB
Average AD-Accuracy: 84.8
video-kinematic-base-workflow-recognition-onMedAIR
Average AD-Accuracy: 86.98
video-kinematic-base-workflow-recognition-onSK
Average AD-Accuracy: 91.61
video-kinematic-base-workflow-recognition-onHutom
Average AD-Accuracy: 91.33
video-kinematic-base-workflow-recognition-onMediCIS
Average AD-Accuracy: 90.18
video-kinematic-segmentation-base-workflowMediCIS Task 5
Average AD-Accuracy: 89.81
video-kinematic-segmentation-base-workflowSK
Average AD-Accuracy: 91.37
video-kinematic-segmentation-base-workflowNCC Next
Average AD-Accuracy: 93.09
video-kinematic-segmentation-base-workflowHutom
Average AD-Accuracy: 91.27

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition? | Papers | HyperAI