Kolla-Ansible安裝OpenStack Ocata版ceph osd配置拷貝出錯導致nova-compute啓動超時失敗的問題

Kolla-Ansible安裝OpenStack Ocata版ceph osd配置拷貝失敗導致nova-compute啓動超時失敗的問題

環境配置:

OpenStack版本:Ocata

節點數:4個

各節點宿主操作系統:CentOS7.7

使用Kolla-Ansible安裝OpenStack,Ocata版本,進行多節點部署時候,遇到等待計算節點nova啓動的時候,出現了超時失敗,最終部署終止。錯誤信息如下:

TASK [nova : Waiting for nova-compute service up] *****************************************************************
FAILED - RETRYING: Waiting for nova-compute service up (20 retries left).
...
FAILED - RETRYING: Waiting for nova-compute service up (1 retries left).
fatal: [172.30.220.3 -> 172.30.220.3]: FAILED! => {"attempts": 20, "changed": false, "cmd": ["docker", "exec", "kolla_toolbox", "openstack", "--os-interface", "internal", "--os-auth-url", "http://172.30.230.3:35357", "--os-identity-api-version", "3", "--os-project-domain-name", "default", "--os-tenant-name", "admin", "--os-username", "admin", "--os-password", "3PxtKnvjKDTbPg2QT3llwig08efLoAgkdEY5VVoY", "--os-user-domain-name", "default", "compute", "service", "list", "-f", "json", "--service", "nova-compute"], "delta": "0:00:02.405043", "end": "2017-10-04 16:21:35.742909", "failed": true, "rc": 0, "start": "2017-10-04 16:21:33.337866", "stderr": "", "stderr_lines": [], "stdout": "[]", "stdout_lines": ["[]"]}

往前翻日誌,發現還有一處錯誤,但是部署並沒有因爲這個錯誤終止,錯誤內容如下:

TASK [ceph : Copying over config.json files for services] ******************************************************************************************************
ok: [Controller01] => (item=ceph-mon)
ok: [Compute01] => (item=ceph-mon)
ok: [Compute02] => (item=ceph-mon)
ok: [Compute03] => (item=ceph-mon)
failed: [Compute02] (item=ceph-osd) => {"failed": true, "item": "ceph-osd", "msg": "AnsibleUndefinedVariable: 'dict object' has no attribute 'ipv4'"}
failed: [Compute01] (item=ceph-osd) => {"failed": true, "item": "ceph-osd", "msg": "AnsibleUndefinedVariable: 'dict object' has no attribute 'ipv4'"}
failed: [Compute03] (item=ceph-osd) => {"failed": true, "item": "ceph-osd", "msg": "AnsibleUndefinedVariable: 'dict object' has no attribute 'ipv4'"}
ok: [Controller01] => (item=ceph-osd)
ok: [Compute02] => (item=ceph-rgw)
ok: [Compute01] => (item=ceph-rgw)
ok: [Compute03] => (item=ceph-rgw)
ok: [Controller01] => (item=ceph-rgw)

這個錯誤很奇怪,網上查找很久也沒有專門應對這個問題的解決方案,但是從問題的錯誤信息可以看出,可能是由於網絡方面的問題導致的。後來嘗試各種方法之後,發現各個節點上的NetworkManager服務還在運行,於是禁用了各個節點上的NetworkManager服務,再次重試部署後,上邊的問題就不會出現了,計算節點的nova-compute也可以正常啓動了。

後來查閱資料,OpenStack官方不推薦使用firewalldNetworkManager,因爲OpenStack使用iptables會和firewalld衝突,NetworkManager運行的話,會對網絡進行自動配置,而neutron`無法察覺到,會導致奇怪的現象出現。

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章