Distributed training example for Temporal Fusion Transformer in SageMaker

159 Views Asked by Philipp Schmid At 09 September 2022 at 12:01

We’re training a big Temporal Fusion Transformer using PyTorch.

We’re looking into using Distributed Training and accelerate training jobs with SageMaker.

Does anyone have any examples of this? Any pattern you can recommend?

There are 1 best solutions below

Arun Lokanatha On 15 September 2022 at 04:55

Although there is no direct example for the above mentioned model, you should be able to follow the below documentation for PL

Refer below example for a full example of using SageMaker DDP and Pytorch Lightning.