1.安裝docker-ce
因爲nvidia-docker2依賴docker-ce,所以得先安裝docker-ce。
1.1 如果主機上已經安裝了docker,則先運行如下命令卸載
service docker stop
yum remove docker
1.2 安裝yum管理依賴包
sudo yum install -y yum-utils device-mapper-persistent-data lvm2
1.3添加docker-ce倉庫命令
sudo yum-config-manager --add-repo https://download.docker.com/linux/centos/docker-ce.repo
如果報以下錯誤
sudo yum-config-manager --add-repo https://download.docker.com/linux/centos/docker-ce.repo
File "/bin/yum-config-manager", line 135
except yum.Errors.RepoError, e:
^
SyntaxError: invalid syntax
則運行如下命令修改/bin/yum-config-manager中的配置
vi /bin/yum-config-manager
按下 i 字母
把文件開頭 #!/usr/bin/python -tt 改爲 #!/usr/bin/python2 -tt
再按下esc鍵,鍵入:wq!保存。最後再次執行添加docker-ce倉庫命令
1.4 運行如下安裝docker-ce命令
sudo yum install docker-ce docker-ce-cli containerd.io
如果報以下錯誤
Transaction check error:
file /usr/bin/docker from install of docker-ce-cli-1:19.03.8-3.el7.x86_64 conflicts with file from package docker-common-2:1.13.1-109.gitcccb291.el7.centos.x86_64
file /usr/bin/dockerd from install of docker-ce-3:19.03.8-3.el7.x86_64 conflicts with file from package docker-common-2:1.13.1-109.gitcccb291.el7.centos.x86_64
Error Summary
則先運行如下命令刪除docker殘餘的組件
sudo yum erase docker-common-2:1.13.1-109.gitcccb291.el7.centos.x86_64
再次運行安裝docker-ce命令
1.5 開機啓動/啓動/查看狀態
systemctl enable docker // 設置開機啓動
systemctl start docker // 啓動docker
systemctl status docker //查看狀態
2.安裝nvidia-docker2
2.1設置倉庫
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.repo | \
sudo tee /etc/yum.repos.d/nvidia-docker.repo
2.2更新倉庫中的key
DIST=$(sed -n 's/releasever=//p' /etc/yum.conf)
DIST=${DIST:-$(. /etc/os-release; echo $VERSION_ID)}
sudo yum makecache
2.3安裝nvidia-docker2
sudo yum install nvidia-docker2
2.4 重新載入docker daemon的設定
sudo pkill -SIGHUP dockerd
2.5 測試是否安裝成功
docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi
第一次運行會花幾分鐘下載組件,最後顯示如下結果則表示安裝成功
docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi
Wed Mar 25 04:58:46 2020
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.87.00 Driver Version: 418.87.00 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce RTX 208... Off | 00000000:1A:00.0 Off | N/A |
| 16% 26C P8 1W / 250W | 0MiB / 10989MiB | 0% Default |
+-------------------------------+----------------------+----------------------+