Challenge Results Are Not Reproducible

07/14/2023
by   Annika Reinke, et al.
0

While clinical trials are the state-of-the-art methods to assess the effect of new medication in a comparative manner, benchmarking in the field of medical image analysis is performed by so-called challenges. Recently, comprehensive analysis of multiple biomedical image analysis challenges revealed large discrepancies between the impact of challenges and quality control of the design and reporting standard. This work aims to follow up on these results and attempts to address the specific question of the reproducibility of the participants methods. In an effort to determine whether alternative interpretations of the method description may change the challenge ranking, we reproduced the algorithms submitted to the 2019 Robust Medical Image Segmentation Challenge (ROBUST-MIS). The leaderboard differed substantially between the original challenge and reimplementation, indicating that challenge rankings may not be sufficiently reproducible.

READ FULL TEXT
research
10/09/2019

BIAS: Transparent reporting of biomedical image analysis challenges

The number of biomedical image analysis challenges organized per year is...
research
11/19/2019

A Framework for Challenge Design: Insight and Deployment Challenges to Address Medical Image Analysis Problems

In this paper we aim to refine the concept of grand challenges in medica...
research
06/17/2021

How can we learn (more) from challenges? A statistical approach to driving future algorithm development.

Challenges have become the state-of-the-art approach to benchmark image...
research
10/11/2019

Methods and open-source toolkit for analyzing and visualizing challenge results

Biomedical challenges have become the de facto standard for benchmarking...
research
10/26/2018

Beyond the Leaderboard: Insight and Deployment Challenges to Address Research Problems

In the medical image analysis field, organizing challenges with associat...
research
06/06/2018

Is the winner really the best? A critical analysis of common research practice in biomedical image analysis competitions

International challenges have become the standard for validation of biom...
research
11/09/2022

Reproducibility in medical image radiomic studies: contribution of dynamic histogram binning

The de facto standard of dynamic histogram binning for radiomic feature ...

Please sign up or login with your details

Forgot password? Click here to reset