CohortFinder: an open-source tool for data-driven partitioning of biomedical image cohorts to yield robust machine learning models

07/17/2023
by   Fan Fan, et al.
0

Batch effects (BEs) refer to systematic technical differences in data collection unrelated to biological variations whose noise is shown to negatively impact machine learning (ML) model generalizability. Here we release CohortFinder, an open-source tool aimed at mitigating BEs via data-driven cohort partitioning. We demonstrate CohortFinder improves ML model performance in downstream medical image processing tasks. CohortFinder is freely available for download at cohortfinder.com.

READ FULL TEXT

page 5

page 6

page 17

page 18

page 20

page 23

page 24

page 25

research
04/08/2023

SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented Data

Training sophisticated machine learning (ML) models requires large datas...
research
06/29/2023

ManimML: Communicating Machine Learning Architectures with Animation

There has been an explosion in interest in machine learning (ML) in rece...
research
07/09/2019

The What-If Tool: Interactive Probing of Machine Learning Models

A key challenge in developing and deploying Machine Learning (ML) system...
research
07/08/2021

Data-Driven Extract Method Recommendations: A Study at ING

The sound identification of refactoring opportunities is still an open p...
research
04/10/2020

MRQy: An Open-Source Tool for Quality Control of MR Imaging Data

Even as public data repositories such as The Cancer Imaging Archive (TCI...
research
01/25/2022

Design choice and machine learning model performances

An increasing number of publications present the joint application of De...
research
05/09/2022

On Designing Data Models for Energy Feature Stores

The digitization of the energy infrastructure enables new, data driven, ...

Please sign up or login with your details

Forgot password? Click here to reset