Fig. 2: Model performance in the complex clinical case diagnosis task under automatic evaluation. | npj Digital Medicine