RuntimeError: Input sizes must be equal when doing loss.backward() during the training of a GNN
|
|
3
|
72
|
March 20, 2025
|
Habana synergy whl files similar to https://download.pytorch.org/whl/?
|
|
2
|
71
|
January 19, 2025
|
Current best inference server implementation for Gaudi2
|
|
3
|
423
|
January 2, 2025
|
Training of torch.nn.embedding failed: loss not decreasing
|
|
2
|
40
|
January 2, 2025
|
Synapse detected a device critical error that requires a restart. [Compute or dma timeout]
|
|
0
|
74
|
November 12, 2024
|
Graph compile failed when torch.repeat
|
|
3
|
79
|
November 3, 2024
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
6
|
107
|
October 23, 2024
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
1
|
53
|
September 24, 2024
|
Reason for segmentation fault
|
|
2
|
129
|
September 24, 2024
|
FP8 range for E4M3 dtype
|
|
3
|
236
|
September 4, 2024
|
Issue running Llama2 pretraining using megatron deepspeed
|
|
2
|
114
|
August 1, 2024
|
Problem with training llama-3-70b with deepspeed
|
|
1
|
182
|
July 18, 2024
|
Result of torch.argmax with -inf tensor on hpu is different from that of cpu and gpu
|
|
2
|
180
|
July 9, 2024
|
Tensors taking time to shift from HPU to CPU
|
|
2
|
110
|
July 9, 2024
|
Running optimum-habana sample on gaudi
|
|
2
|
218
|
June 27, 2024
|
Pytorch complex datatype
|
|
1
|
125
|
May 28, 2024
|
Does HPU support complex datatype in torch
|
|
1
|
167
|
May 28, 2024
|
Linear Layer Inconsistency
|
|
2
|
213
|
April 24, 2024
|
Gaudi1 HPU doesn't support long?
|
|
11
|
301
|
April 4, 2024
|
Error in installing habanalabs-dkms in ubunti 20.04 based docker image
|
|
0
|
163
|
March 29, 2024
|
Support for Mixtral - Optimum Habana
|
|
3
|
521
|
March 22, 2024
|
Model.to device faile: "RuntimeError: synStatus=8 [Device not found] Device acquire failed."
|
|
3
|
582
|
March 13, 2024
|
How to broadcast each element in float64 into a single vector
|
|
1
|
255
|
December 6, 2023
|
Pytorch Empty Tensor error when running Stable Diffusion on optimum-habana
|
|
9
|
598
|
November 14, 2023
|
Docker: Error response from daemon: Unknown runtime specified habana
|
|
4
|
889
|
July 20, 2023
|
Import Error
|
|
1
|
428
|
June 20, 2023
|
A question about how to use "wrap_in_hpu_graph"
|
|
3
|
640
|
April 25, 2023
|
unet2d training crash for 8 gaudis
|
|
2
|
650
|
March 17, 2023
|
Gaudi2 PyTorch Container - Device acquire failed
|
|
1
|
1300
|
February 22, 2023
|
When to use htcode.mark_step()
|
|
4
|
915
|
January 30, 2023
|