Fig. 3: Six readers averaged performance with Baseline DL models and MRP on internal test sets across the NAT care.

(Top) ROC curves with 95% CIs in bracket calculated with boot-strapping. (Bottom) PRCs with 95% CIs. From left to right: Pre-NAT(Staging), Mid-NAT, Post-NAT(Pre-surgical). rhpc refers to the model trained by radiological assessments (r), histopathological assessments (h), personal patient records (p), and clinical data (c), detailed definitions can be found in Methods and Fig. 1. iMGrhpc is based on Pre-NAT mammogram and rhpc data, while iMRrhpc is based on single/longitudinal MRI(s) embedding with temporal information and rhpc data. MRP aggregates and optimizes the outputs of iMGrhpc model and iMRrhpc model.