5 months ago

PEg TRAnsfer Workflow recognition challenge report: Does multi-modal data improve recognition?

Huaulmé Arnaud ; Harada Kanako ; Nguyen Quang-Minh ; Park Bogyu ; Hong Seungbum ; Choi Min-Kook ; Peven Michael ; Li Yunshuang ; Long Yonghao ; Dou

Abstract

This paper presents the design and results of the "PEg TRAnsfert Workflowrecognition" (PETRAW) challenge whose objective was to develop surgicalworkflow recognition methods based on one or several modalities, among video,kinematic, and segmentation data, in order to study their added value. ThePETRAW challenge provided a data set of 150 peg transfer sequences performed ona virtual simulator. This data set was composed of videos, kinematics, semanticsegmentation, and workflow annotations which described the sequences at threedifferent granularity levels: phase, step, and activity. Five tasks wereproposed to the participants: three of them were related to the recognition ofall granularities with one of the available modalities, while the othersaddressed the recognition with a combination of modalities. Averageapplication-dependent balanced accuracy (AD-Accuracy) was used as evaluationmetric to take unbalanced classes into account and because it is moreclinically relevant than a frame-by-frame score. Seven teams participated in atleast one task and four of them in all tasks. Best results are obtained withthe use of the video and the kinematics data with an AD-Accuracy between 93%and 90% for the four teams who participated in all tasks. The improvementbetween video/kinematic-based methods and the uni-modality ones was significantfor all of the teams. However, the difference in testing execution time betweenthe video/kinematic-based and the kinematic-based methods has to be taken intoconsideration. Is it relevant to spend 20 to 200 times more computing time forless than 3% of improvement? The PETRAW data set is publicly available atwww.synapse.org/PETRAW to encourage further research in surgical workflowrecognition.

Benchmarks

Benchmark	Methodology	Metrics
kinematic-based-workflow-recognition-on	MedAIR	Average AD-Accuracy: 90.72
kinematic-based-workflow-recognition-on	Hutom	Average AD-Accuracy: 84.31
kinematic-based-workflow-recognition-on	NCC Next	Average AD-Accuracy: 90.32
kinematic-based-workflow-recognition-on	JHU-CIRL	Average AD-Accuracy: 86.45
kinematic-based-workflow-recognition-on	MediCIS	Average AD-Accuracy: 89.71
kinematic-based-workflow-recognition-on	SK	Average AD-Accuracy: 89.66
segmentation-based-workflow-recognition-on	MediCIS	Average AD-Accuracy: 87.22
segmentation-based-workflow-recognition-on	NCC Next	Average AD-Accuracy: 87.71
segmentation-based-workflow-recognition-on	SK	Average AD-Accuracy: 88.51
segmentation-based-workflow-recognition-on	Hutom	Average AD-Accuracy: 60.28
semantic-segmentation-on-petraw	SK	Mean IoU (class): 96.4
semantic-segmentation-on-petraw	Hutom	Mean IoU (class): 85
semantic-segmentation-on-petraw	MediCIS	Mean IoU (class): 94
semantic-segmentation-on-petraw	NCC Next	Mean IoU (class): 96.9
video-based-workflow-recognition-on-petraw	SK	Average AD-Accuracy: 90.77
video-based-workflow-recognition-on-petraw	MediCIS	Average AD-Accuracy: 89.15
video-based-workflow-recognition-on-petraw	Hutom	Average AD-Accuracy: 90.51
video-based-workflow-recognition-on-petraw	NCC Next	Average AD-Accuracy: 87.77
video-based-workflow-recognition-on-petraw	MedAIR	Average AD-Accuracy: 84.31
video-kinematic-base-workflow-recognition-on	NCC Next	Average AD-Accuracy: 93.09
video-kinematic-base-workflow-recognition-on	MMLAB	Average AD-Accuracy: 84.8
video-kinematic-base-workflow-recognition-on	MedAIR	Average AD-Accuracy: 86.98
video-kinematic-base-workflow-recognition-on	SK	Average AD-Accuracy: 91.61
video-kinematic-base-workflow-recognition-on	Hutom	Average AD-Accuracy: 91.33
video-kinematic-base-workflow-recognition-on	MediCIS	Average AD-Accuracy: 90.18
video-kinematic-segmentation-base-workflow	MediCIS Task 5	Average AD-Accuracy: 89.81
video-kinematic-segmentation-base-workflow	SK	Average AD-Accuracy: 91.37
video-kinematic-segmentation-base-workflow	NCC Next	Average AD-Accuracy: 93.09
video-kinematic-segmentation-base-workflow	Hutom	Average AD-Accuracy: 91.27

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette