【ERROR】AssertionError: The NVIDIA driver on your system is too old (found version). Please upd

錯誤信息:

AssertionError:
The NVIDIA driver on your system is too old (found version 10000).
Please update your GPU driver by downloading and installing a new
version from the URL: http://www.nvidia.com/Download/index.aspx
Alternatively, go to: https://pytorch.org to install
a PyTorch version that has been compiled with your version
of the CUDA driver.

 出錯位置:

File "/users4/zsun/pytorch/OpenNMT-py/onmt/train_single.py", line 38, in configure_process
    torch.cuda.set_device(device_id)
  File "/users4/zsun/anaconda3/envs/onmt_summary100W/lib/python3.5/site-packages/torch/cuda/__init__.py", line 300, in set_device
    torch._C._cuda_setDevice(device)
  File "/users4/zsun/anaconda3/envs/onmt_summary100W/lib/python3.5/site-packages/torch/cuda/__init__.py", line 192, in _lazy_init
    _check_driver()
  File "/users4/zsun/anaconda3/envs/onmt_summary100W/lib/python3.5/site-packages/torch/cuda/__init__.py", line 111, in _check_driver
    of the CUDA driver.""".format(str(torch._C._cuda_getDriverVersion())))

由信息可以看出,要麼升級NVIDIA driver,要麼重新安裝本機CUDA driver能夠匹配的pytorh版本,說明我的機器上pytorch版本過新,而cuda版本太老,兩者無法匹配。cuda版本是服務器的我們無法更改,

 

查看一下本機cuda

參考鏈接:https://blog.csdn.net/leviopku/article/details/84851244

(onmt_summary100W) [zsun@gpu09 OpenNMT-py]$ cat /usr/local/cuda/version.txt
CUDA Version 10.0.130
(onmt_summary100W) [zsun@gpu09 OpenNMT-py]$ cat /usr/local/cuda/include/cudnn.h | grep CUDNN_MAJOR -A 2
#define CUDNN_MAJOR 7
#define CUDNN_MINOR 3
#define CUDNN_PATCHLEVEL 0
--
#define CUDNN_VERSION (CUDNN_MAJOR * 1000 + CUDNN_MINOR * 100 + CUDNN_PATCHLEVEL)

#include "driver_types.h"

查看一下pytroch的版本(也可以使用pip list查看)

>>> torch.__version__
'1.3.1'

這兩者經常同時出現,所以順便查一下

>>> torchvision.__version__
'0.4.2'

 

根據報錯信息提示,查看pytorch官網 https://pytorch.org/get-started/previous-versions/ 看到這樣一處

# CUDA 10.0
pip install torch==1.2.0 torchvision==0.4.0 -f https://download.pytorch.org/whl/torch_stable.html

發現可能是torch1.3對於cuda 10.0 來說版本過新,所以按照上述命令嘗試重新安裝一下torch==1.2.0

 

重新運行自己的程序

 

終於

 

錯誤消失!!!

 

 

 

 

 

 

 

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章