pytorch-errors

原創

vwenyu-L

2020-02-25 22:04

RuntimeError: save_for_backward can only save input or output tensors, but argument 0 doesn't satisfy this condition

When custom Funciton & Module, and the module need backward, the input should be Variable not Tensor

RuntimeError: Assertion `cur_target >= 0 && cur_target < n_classes' failed

e.g. lab[lab>=n_classes] = 0

2. RuntimeError: std::bad_cast pytorch

check date type

e.g.

Variable( torch.from_numpy(data) ).float().cuda()

Variable( torch.from_numpy(label).long().cuda()

3. RuntimeError: tensors are on different GPUs

some part not use gpu eg. model

but data use gpu

4. RuntimeError: CUDNN_STATUS_BAD_PARAM

check input and output channels of some layer

5. THCudaCheck FAIL file=/b/wheel/pytorch-src/torch/lib/THC/generic/THCStorage.c line=79 error=2 : out of memory
Segmentation fault

https://discuss.pytorch.org/t/segmentation-fault-when-loading-weight/1381/8

6. RuntimeError: CHECK_ARG(input->nDimension == output->nDimension) failed at torch/csrc/cudnn/Conv.cpp:275

input data shape is different from desired input shape of model

7. torch.utils.data Dataset...

File "//anaconda3/lib/python3.6/site-packages/torch/functional.py", line 60, in stack
return torch.cat(inputs, dim, out=out)
TypeError: cat received an invalid combination of arguments - got (list, int, out=torch.ByteTensor), but expected one of:
* (sequence[torch.ByteTensor] seq)
* (sequence[torch.ByteTensor] seq, int dim)

TypeError: cat received an invalid combination of arguments - got (list, int), but expected one of:
* (sequence[torch.ByteTensor] seq)
* (sequence[torch.ByteTensor] seq, int dim)
didn't match because some of the arguments have invalid types: (list, int)

Important: each iteration should return same data type

convert to same dtype then in train process convert it to desired dtype

concatenate operation operate on the items of same dtype

8. File "/home/wenyu/anaconda3/lib/python3.6/site-packages/torch/autograd/variable.py", line 167, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph, retain_variables)
File "/home/wenyu/anaconda3/lib/python3.6/site-packages/torch/autograd/__init__.py", line 99, in backward
variables, grad_variables, retain_graph)

RuntimeError: CUDNN_STATUS_MAPPING_ERROR

class_num with loss maybe not match

9 . RuntimeError: CUDNN_STATUS_INTERNAL_ERROR

model and data maybe on the different GPU

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

pytorch-errors

[轉帖]使用NMT和pmap解決JVM資源泄漏問題原創

Python實現大麥網搶票的四大關鍵技術點解析

Python 安裝庫指令大全

salesforce零基礎學習（一百三十八）零碎知識點小總結（十）

一款開源的.NET程序集反編譯、編輯和調試神器

關於接口協議，你必須要知道這些！

基於 Milvus + LlamaIndex 實現高級 RAG

【2024-05-21】以茶會友

pytorch-custom dataset

pytorch-multi-gpu

pytorch-containers

python-skimage

pytorch-design NEW Function and Module

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結