Table 2 Benchmarking user roles and responsibilities

From: Federated benchmarking of medical artificial intelligence with MedPerf

Role name

Role definition

Role responsibilities

Benchmark committee

Benchmark committee includes regulatory bodies, groups of experts (for example, clinicians, patient representative groups), and data or model owners wishing to drive evaluation of their model or data.

Authors the benchmark, manages all benchmark assets, and produces some assets (for example, dataset preparation).

Recruits model owners and data owners, makes an open benchmark for model owners and approves applicants.

Controls access to the aggregated statistical results.

Data owner

Data owners may include hospitals, medical practices, research organizations and healthcare insurance providers that ‘own’ medical data, register medical data and execute benchmark requests.

Registers data with benchmarking platform.

Performs data labelling.

Downloads and executes a data preparation processor to prepare data.

Downloads and periodically uses platform client to approve and serve requests, and to approve and upload results to or from benchmarking platform.

Model owner

Model owners include AI researchers and software vendors that own a trained medical AI model and want to evaluate its performance.

Registers model with benchmarking platform

Views results of their model on the benchmark

Has the option to approve sharing of results of that benchmark with other model/data owners or the public if allowed by benchmark group

Platform provider

Organizations such as MLCommons, which operate a platform that enables benchmark groups to run benchmarks by connecting data owners with model owners.

Manages user accounts and provides a website for registering and discovering benchmarks, datasets, models, and for overall workflow management

Coordinates active benchmarks by sending requests, aggregating results and managing result access