The "Piz Daint" supercomputer at CSCS provides an ideal platform for supporting intensive deep learning workloads as it comprises thousands of Tesla GPU compute nodes communicating through a high-speed interconnect.

In this two-day course, we looked at how to run distributed deep learning workloads with TensorFlow on Piz Daint. We used simple examples to demonstrate best practices for building efficient input pipelines to maximize the throughput of deep learning models with TensorFlow. 

The course focused on two main subjects: reading data through input pipelines and asynchronous distributed training. 

Here you can watch the video of the "Multi GPU Training with TensorFlow on Piz Daint" course >