RuntimeError: Input sizes must be equal when doing loss.backward() during the training of a GNN
|
|
3
|
60
|
March 20, 2025
|
Habana synergy whl files similar to https://download.pytorch.org/whl/?
|
|
2
|
67
|
January 19, 2025
|
Current best inference server implementation for Gaudi2
|
|
3
|
411
|
January 2, 2025
|
Training of torch.nn.embedding failed: loss not decreasing
|
|
2
|
35
|
January 2, 2025
|
Synapse detected a device critical error that requires a restart. [Compute or dma timeout]
|
|
0
|
62
|
November 12, 2024
|
Graph compile failed when torch.repeat
|
|
3
|
70
|
November 3, 2024
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
6
|
102
|
October 23, 2024
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
1
|
48
|
September 24, 2024
|
Reason for segmentation fault
|
|
2
|
116
|
September 24, 2024
|
FP8 range for E4M3 dtype
|
|
3
|
207
|
September 4, 2024
|
Issue running Llama2 pretraining using megatron deepspeed
|
|
2
|
101
|
August 1, 2024
|
Problem with training llama-3-70b with deepspeed
|
|
1
|
169
|
July 18, 2024
|
Result of torch.argmax with -inf tensor on hpu is different from that of cpu and gpu
|
|
2
|
174
|
July 9, 2024
|
Tensors taking time to shift from HPU to CPU
|
|
2
|
104
|
July 9, 2024
|
Running optimum-habana sample on gaudi
|
|
2
|
205
|
June 27, 2024
|
Pytorch complex datatype
|
|
1
|
118
|
May 28, 2024
|
Does HPU support complex datatype in torch
|
|
1
|
156
|
May 28, 2024
|
Linear Layer Inconsistency
|
|
2
|
209
|
April 24, 2024
|
Gaudi1 HPU doesn't support long?
|
|
11
|
295
|
April 4, 2024
|
Error in installing habanalabs-dkms in ubunti 20.04 based docker image
|
|
0
|
155
|
March 29, 2024
|
Support for Mixtral - Optimum Habana
|
|
3
|
515
|
March 22, 2024
|
Model.to device faile: "RuntimeError: synStatus=8 [Device not found] Device acquire failed."
|
|
3
|
548
|
March 13, 2024
|
How to broadcast each element in float64 into a single vector
|
|
1
|
251
|
December 6, 2023
|
Pytorch Empty Tensor error when running Stable Diffusion on optimum-habana
|
|
9
|
580
|
November 14, 2023
|
Docker: Error response from daemon: Unknown runtime specified habana
|
|
4
|
861
|
July 20, 2023
|
Import Error
|
|
1
|
419
|
June 20, 2023
|
A question about how to use "wrap_in_hpu_graph"
|
|
3
|
633
|
April 25, 2023
|
unet2d training crash for 8 gaudis
|
|
2
|
644
|
March 17, 2023
|
Gaudi2 PyTorch Container - Device acquire failed
|
|
1
|
1272
|
February 22, 2023
|
When to use htcode.mark_step()
|
|
4
|
901
|
January 30, 2023
|