pytorch-errors

0.

RuntimeError: save_for_backward can only save input or output tensors, but argument 0 doesn't satisfy this condition


When custom Funciton & Module, and the module need backward, the input should be Variable not Tensor


1. 

RuntimeError: Assertion `cur_target >= 0 && cur_target < n_classes' failed

e.g. lab[lab>=n_classes] = 0


2. RuntimeError: std::bad_cast pytorch

check date type

e.g.

Variable( torch.from_numpy(data) ).float().cuda()

Variable( torch.from_numpy(label).long().cuda()


3. RuntimeError: tensors are on different GPUs

some part not use gpu eg. model

but data use gpu


4. RuntimeError: CUDNN_STATUS_BAD_PARAM

check input and output channels of some layer


5. THCudaCheck FAIL file=/b/wheel/pytorch-src/torch/lib/THC/generic/THCStorage.c line=79 error=2 : out of memory
Segmentation fault

https://discuss.pytorch.org/t/segmentation-fault-when-loading-weight/1381/8


6. RuntimeError: CHECK_ARG(input->nDimension == output->nDimension) failed at torch/csrc/cudnn/Conv.cpp:275

input data shape is different from desired input shape of model 


7. torch.utils.data Dataset...  

File "//anaconda3/lib/python3.6/site-packages/torch/functional.py", line 60, in stack
    return torch.cat(inputs, dim, out=out)
TypeError: cat received an invalid combination of arguments - got (list, int, out=torch.ByteTensor), but expected one of:
 * (sequence[torch.ByteTensor] seq)
 * (sequence[torch.ByteTensor] seq, int dim)


TypeError: cat received an invalid combination of arguments - got (list, int), but expected one of:
 * (sequence[torch.ByteTensor] seq)
 * (sequence[torch.ByteTensor] seq, int dim)
      didn't match because some of the arguments have invalid types: (list, int)


Important: each iteration should return same data type 

convert to same dtype then in train process convert it to desired dtype

concatenate operation operate on the items of same dtype 


8.   File "/home/wenyu/anaconda3/lib/python3.6/site-packages/torch/autograd/variable.py", line 167, in backward
    torch.autograd.backward(self, gradient, retain_graph, create_graph, retain_variables)
  File "/home/wenyu/anaconda3/lib/python3.6/site-packages/torch/autograd/__init__.py", line 99, in backward
    variables, grad_variables, retain_graph)

RuntimeError: CUDNN_STATUS_MAPPING_ERROR


class_num with loss maybe not match


9 . RuntimeError: CUDNN_STATUS_INTERNAL_ERROR

model and data maybe on the different GPU




發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章