|
RuntimeError: Input sizes must be equal when doing loss.backward() during the training of a GNN
|
|
3
|
126
|
March 20, 2025
|
|
Habana synergy whl files similar to https://download.pytorch.org/whl/?
|
|
2
|
127
|
January 19, 2025
|
|
Current best inference server implementation for Gaudi2
|
|
3
|
479
|
January 2, 2025
|
|
Training of torch.nn.embedding failed: loss not decreasing
|
|
2
|
80
|
January 2, 2025
|
|
Synapse detected a device critical error that requires a restart. [Compute or dma timeout]
|
|
0
|
123
|
November 12, 2024
|
|
Graph compile failed when torch.repeat
|
|
3
|
142
|
November 3, 2024
|
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
6
|
155
|
October 23, 2024
|
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
1
|
80
|
September 24, 2024
|
|
Reason for segmentation fault
|
|
2
|
171
|
September 24, 2024
|
|
FP8 range for E4M3 dtype
|
|
3
|
373
|
September 4, 2024
|
|
Issue running Llama2 pretraining using megatron deepspeed
|
|
2
|
142
|
August 1, 2024
|
|
Problem with training llama-3-70b with deepspeed
|
|
1
|
243
|
July 18, 2024
|
|
Result of torch.argmax with -inf tensor on hpu is different from that of cpu and gpu
|
|
2
|
214
|
July 9, 2024
|
|
Tensors taking time to shift from HPU to CPU
|
|
2
|
148
|
July 9, 2024
|
|
Running optimum-habana sample on gaudi
|
|
2
|
279
|
June 27, 2024
|
|
Pytorch complex datatype
|
|
1
|
157
|
May 28, 2024
|
|
Does HPU support complex datatype in torch
|
|
1
|
221
|
May 28, 2024
|
|
Linear Layer Inconsistency
|
|
2
|
245
|
April 24, 2024
|
|
Gaudi1 HPU doesn't support long?
|
|
11
|
346
|
April 4, 2024
|
|
Error in installing habanalabs-dkms in ubunti 20.04 based docker image
|
|
0
|
196
|
March 29, 2024
|
|
Support for Mixtral - Optimum Habana
|
|
3
|
556
|
March 22, 2024
|
|
Model.to device faile: "RuntimeError: synStatus=8 [Device not found] Device acquire failed."
|
|
3
|
667
|
March 13, 2024
|
|
How to broadcast each element in float64 into a single vector
|
|
1
|
286
|
December 6, 2023
|
|
Pytorch Empty Tensor error when running Stable Diffusion on optimum-habana
|
|
9
|
696
|
November 14, 2023
|
|
Docker: Error response from daemon: Unknown runtime specified habana
|
|
4
|
1038
|
July 20, 2023
|
|
Import Error
|
|
1
|
461
|
June 20, 2023
|
|
A question about how to use "wrap_in_hpu_graph"
|
|
3
|
687
|
April 25, 2023
|
|
unet2d training crash for 8 gaudis
|
|
2
|
682
|
March 17, 2023
|
|
Gaudi2 PyTorch Container - Device acquire failed
|
|
1
|
1366
|
February 22, 2023
|
|
When to use htcode.mark_step()
|
|
4
|
977
|
January 30, 2023
|