Training of PyTorch Efficientnet seems extremely slow

Sayantan_S · August 23, 2022, 5:09am

Here are a couple of work arounds you can try:

Perform bernoulli_ of StochasticDepth on CPU
Around this line here,

if 'hpu' in input.device.type:
    dev = 'cpu'
#noise = torch.empty(size, dtype=input.dtype, device=input.device)
noise = torch.empty(size, dtype=input.dtype, device=dev)
noise = noise.bernoulli_(survival_rate)
if 'hpu' in input.device.type:
    noise = noise.to(input.device.type)

Disable inplace Dropout
Around here
Replace nn.Dropout(p=dropout, inplace=True), with nn.Dropout(p=dropout),

Please let me know if you see speedups with these 2 changes.

Thanks
Sayantan

Topic		Replies	Views
PyTorch model works on CPU/CUDA but not on HPU Training pytorch	5	1729	January 19, 2022
Habana Gaudi Hpus Training time improvement TensorFlow	2	647	September 30, 2022
Trainer killed/Segfault PyTorch	6	616	September 1, 2023
Tensors taking time to shift from HPU to CPU Inference pytorch	2	116	July 9, 2024
Gaudi2 slower compared to A100 Training	10	640	June 7, 2023

Training of PyTorch Efficientnet seems extremely slow

Related topics