Figure 7
From: Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system

MT2000+ and FT2000+ multi nodes training loss and training time.
From: Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system

MT2000+ and FT2000+ multi nodes training loss and training time.