Extended Data Fig. 1: Test performance of models predicting 72 h oxygen treatment trained on local data only versus the performance of the best global model available on the server.
From: Federated learning for predicting clinical outcomes in patients with COVID-19

Test performance of models predicting 72 h oxygen treatment trained on local data only (Local) versus the performance of the best global model available on the server (FL (gl. best)). b, Generalizability (average performance on other sites’ test data) as a function of a site’s dataset size (# cases). The average performance improved by 18% (from 0.760 to 0.899 or 13.9 percentage points) compared to locally trained models alone, while average generalizability of the global model improved by 34% (from 0.669 to 0.899 or 23.0 percentage points).