Figure 4

Evaluation of the trained model ensemble on the test dataset. Each CT scan was annotated by three different readers and additionally a standard of reference was created by majority voting.
Evaluation of the trained model ensemble on the test dataset. Each CT scan was annotated by three different readers and additionally a standard of reference was created by majority voting.