Review of the large scale related optimizations performed on the well known resnet50 training workload on DGX based clusters. The optimization concepts which combine the profiling and modeling of workload’s execution at scale can be applied to other deep learning neural networks running on GPU based clusters.
We Would Love to Hear From You