Extended Data Table 5 Performance comparison on COVID-19 diagnostic task

Evaluating Ark⁺’s transferability to a novel disease, COVID-19, that Ark⁺ has never encountered in its pretraining. Via linear-probing, Ark⁺ performs significantly better than MIM-CXR (p-value: 1.98E-04), on a par with RAD-DINO (p-value: 1.38E-01), and significantly inferior to CXR-FM (p-value: 3.44E-06) in Accuracy (ACC), while Ark⁺ outperforms MIM-CXR and RAD-DINO via fine-tuning and overtakes CXR-FM’s linear-probing performance. However, once Ark⁺ has been incrementally and continually pretrained with the COVID-19 diagnostic task, the resultant upgraded model, named Ark⁺_+covid, surpasses CXR-FM via linear-probing. The findings underscore the importance of openness in foundation models for continual pretraining, which endows Ark⁺ with the adaptability and scalability necessary to address novel diseases and potential future pandemics.
*Medical-MAE³⁷ achieved an accuracy of 97.3% and COVIDNet³⁵ holds an official accuracy score of 98.3% with their best-performing models.

Quick links

Search