PyTorch Distributed and Fault Tolerance
Sponsored Session: PyTorch Distributed and Fault Tolerance - Tristan Rice, Meta This session will cover the latest and greatest of PyTorch Distributed communication including upcoming features and improvements, large scale training, fault tolerance, and more. We’ll cover the current landscape of PTD including new libraries such as torchft as well as future directions.