About the PyTorch category
|
|
0
|
630
|
December 21, 2020
|
Activation checkpointing modules with kwargs in forward
|
|
1
|
20
|
January 19, 2025
|
AttributeError: module 'habana_frameworks.torch.hpu' has no attribute 'wrap_in_hpu_graph
|
|
4
|
38
|
January 19, 2025
|
Hccl failure to be connected on two nodes with a simple script
|
|
1
|
24
|
January 2, 2025
|
Training of torch.nn.embedding failed: loss not decreasing
|
|
2
|
22
|
January 2, 2025
|
Transferring kNN results from CPU to HPU breaks back propagation
|
|
0
|
18
|
December 3, 2024
|
RuntimeError: [Rank:0] FATAL ERROR :: MODULE:PT_BRIDGE Exception in Lowering thread
|
|
2
|
107
|
December 3, 2024
|
RuntimeError: Input sizes must be equal when doing loss.backward() during the training of a GNN
|
|
2
|
37
|
November 12, 2024
|
Synapse detected a device critical error that requires a restart. [Compute or dma timeout]
|
|
0
|
37
|
November 12, 2024
|
NotImplementedError: Could not run 'aten::_sparse_coo_tensor_with_dims_and_tensors' with arguments from the 'SparseHPU' backend
|
|
1
|
105
|
November 12, 2024
|
GCNConv fails with normalization
|
|
0
|
46
|
November 5, 2024
|
AttributeError : 'HabanaParameterWrapper' object has no attribute 'change_device_placement'
|
|
1
|
38
|
September 24, 2024
|
Issue running Llama2 pretraining using megatron deepspeed
|
|
2
|
88
|
August 1, 2024
|
Hpu_backend not found on torch.compile
|
|
2
|
187
|
July 11, 2024
|
RuntimeError: No backend type associated with device type cpu
|
|
2
|
872
|
April 19, 2024
|
Gaudi1 HPU doesn't support long?
|
|
11
|
273
|
April 4, 2024
|
SyncBatchNorm Error
|
|
5
|
277
|
March 21, 2024
|
Trainer killed/Segfault
|
|
6
|
543
|
September 1, 2023
|
How to set default tensor device as HPU?
|
|
2
|
638
|
February 9, 2023
|
Wrong error message when out of memory
|
|
1
|
543
|
January 30, 2023
|
Gaudi Torch Cummax
|
|
4
|
814
|
November 14, 2022
|
Hugging Face Transformers using all 8 Habana Gaudi Devices
|
|
4
|
1313
|
July 7, 2022
|
Torch c++ frontend support
|
|
3
|
677
|
May 16, 2022
|