http://www.leadcom.com.cn/weibaozhishi/255.html
URL:
http://www.aixchina.net/blogs/mxin/archive/2006/8/5/1460.aspx
摘要:
IBM HACMP 自從出了5.2 版本後, 到了5.205比較穩定,並已經過充分的測試(見測試篇)和實踐證明(已有多個系統成功自動切換)。這裏我整理了一下,以比較複雜的雙機互備情況爲例,其他類似設置可適當簡化。
樣例爲AIX5304+HACMP5205.
第一章 規劃篇
第二章 安裝篇
1 安裝前提
1) 操作系統版本要求:
AIX5.203以上
2) 環境要求
Ø 用戶確認
HA要求所有切換需要用到的用戶必須一一對應,userid完全相同。
如tlg1d0上oracle用戶爲orarun,tlg1a0上的orarun必須爲切換保留,sid均爲209
平時用的oracle用戶就改爲oraclt
[tlg1a0][root][/]lsuser -a id ALL
root id=0
daemon id=1
bin id=2
sys id=3
adm id=4
uucp id=5
nobody id=-2
lpd id=9
lp id=11
invscout id=6
snapp id=200
ipsec id=201
nuucp id=7
radiusd id=202
ldap id=203
oradev id=204
tuxrun id=205
oraclt id=208
orarun id=209
sshd id=211
bsx1 id=210
[tlg1d0][root][/]>lsuser -a id ALL
root id=0
daemon id=1
bin id=2
sys id=3
adm id=4
uucp id=5
nobody id=-2
lpd id=9
lp id=11
invscout id=6
snapp id=200
ipsec id=201
nuucp id=7
radiusd id=202
ldap id=203
oradev id=204
orarun id=209
tuxrun id=205
oraclt id=208
bsx1 id=210
Ø 文件系統確認
HA要求切換相關的文件系統不能重名,如tlg1d0上oracle軟件目錄爲/ora10msrun,tlg1a0上的/ora10msrun必須爲切換保留,改爲/ora10msclt
[tlg1d0][root][/]>df -k
Filesystem 1024-blocks Free %Used Iused %Iused Mounted on
/dev/hd4 524288 487820 7% 3276 3% /
/dev/hd2 7077888 1868516 74% 91290 18% /usr
/dev/hd9var 524288 458364 13% 991 1% /var
/dev/hd3 917504 826700 10% 120 1% /tmp
/dev/hd1 655360 524856 20% 291 1% /home
/proc - - - - - /proc
/dev/hd10opt 1179648 589072 51% 11370 8% /opt
/dev/lv00 131072 126912 4% 18 1% /var/adm/csd
/dev/fslv01 131072 130724 1% 4 1% /tftpboot
/dev/fslv00 6684672 3719036 45% 6 1% /export/eznim
/dev/ora10msrunlvbak 8912896 5165508 43% 24814 3% /ora10msrunbak
/dev/msrundatalvbak 13107200 2895980 78% 32 1% /msrundatabak
/dev/msrunloglvbak 5242880 3094480 41% 8185 2% /msrunlogbak
/dev/mksysblv 13107200 6238868 53% 5 1% /mksysb
/dev/tlg1d0_baklv 26214400 17495848 34% 1111 1% /tlg1d0_bak
/dev/msrundatalv 13107200 2896000 78% 31 1% /msrundata
/dev/msrunloglv 5242880 3112216 41% 7996 2% /msrunlog
/dev/ora10msrunlv 8912896 5173252 42% 23406 2% /ora10msrun
[tlg1a0][root][/]>df -k
Filesystem 1024-blocks Free %Used Iused %Iused Mounted on
/dev/hd4 655360 93968 86% 3450 14% /
/dev/hd2 8650752 3779000 57% 90347 10% /usr
/dev/hd9var 1179648 1001248 16% 1164 1% /var
/dev/hd3 1179648 1136192 4% 143 1% /tmp
/dev/hd1 655360 587592 11% 489 1% /home
/proc - - - - - /proc
/dev/hd10opt 1835008 1244528 33% 11374 4% /opt
/dev/lv00 131072 126912 4% 18 1% /var/adm/csd
/dev/fslv01 131072 118800 10% 7 1% /tftpboot
/dev/fslv00 7733248 960696 88% 27298 12% /export/eznim
/dev/fslv02 15335424 1143804 93% 4525 2% /soft_ins
/dev/tlg1a0_baklv 26214400 20751500 21% 16 1% /tlg1a0_bak
/dev/ora10mscltlvbak 13107200 9386904 29% 16852 1% /ora10mscltbak
/dev/tux81msrunlvbak 655360 407932 38% 2685 3% /tux81msrunbak
/dev/meslvbak 8912896 4815640 46% 6349 1% /BSMesWare/BSX1bak
/dev/xcomlvbak 5242880 5186260 2% 4178 1% /xcombak
/dev/ora10mscltlv 13107200 9386748 29% 16852 1% /ora10msclt
/dev/tux81msrunlv 786432 538952 32% 2685 3% /tux81msrun
/dev/meslv 14155776 10057876 29% 6343 1% /BSMesWare/BSX1
/dev/xcomlv 5242880 5186280 2% 4178 1% /xcom
3) 安裝包要求:
RSCT 2.3.6.0 或更高版本。lslpp –l|grep rsct
4) 以下的包也是必須要安裝的:(腳本可直接拷貝運行)
lslpp -l rsct.*
lslpp -l bos.adt.lib
lslpp -l bos.adt.libm
lslpp -l bos.adt.syscalls
lslpp -l bos.net.tcp.client
lslpp -l bos.net.tcp.server
lslpp -l bos.rte.SRC
lslpp -l bos.rte.libc
lslpp -l bos.rte.libcfg
lslpp -l bos.rte.libcur
lslpp -l bos.rte.libpthreads
lslpp -l bos.rte.odm
5) tlg1a0結果:
[tlg1a0][root][/]>lslpp -l rsct.*
lslpp -l bos.adt.lib
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
rsct.basic.hacmp 2.4.4.0 COMMITTED RSCT Basic Function (HACMP/ES
Support)
rsct.basic.rte 2.4.4.0 COMMITTED RSCT Basic Function
rsct.basic.sp 2.4.4.0 COMMITTED RSCT Basic Function (PSSP
Support)
rsct.clients.rte 99.99.999.999 COMMITTED Supersede Entry - Not really
installed
rsct.compat.basic.hacmp 2.4.4.0 COMMITTED RSCT Event Management Basic
Function (HACMP/ES Support)
rsct.compat.basic.rte 2.4.4.0 COMMITTED RSCT Event Management Basic
Function
rsct.compat.basic.sp 2.4.4.0 COMMITTED RSCT Event Management Basic
Function (PSSP Support)
rsct.compat.clients.hacmp 2.4.4.0 COMMITTED RSCT Event Management Client
Function (HACMP/ES Support)
rsct.compat.clients.rte 2.4.4.0 COMMITTED RSCT Event Management Client
Function
rsct.compat.clients.sp 2.4.4.0 COMMITTED RSCT Event Management Client
Function (PSSP Support)
rsct.core.auditrm 2.4.4.0 COMMITTED RSCT Audit Log Resource
Manager
rsct.core.errm 2.4.4.0 COMMITTED RSCT Event Response Resource
Manager
rsct.core.fsrm 2.4.4.0 COMMITTED RSCT File System Resource
Manager
rsct.core.gui 2.4.4.0 COMMITTED RSCT Graphical User Interface
rsct.core.hostrm 2.4.4.0 COMMITTED RSCT Host Resource Manager
rsct.core.lprm 2.4.4.0 COMMITTED RSCT Least Privilege Resource
Manager
rsct.core.rmc 2.4.4.0 COMMITTED RSCT Resource Monitoring and
Control
rsct.core.sec 2.4.4.0 COMMITTED RSCT Security
rsct.core.sensorrm 2.4.4.0 COMMITTED RSCT Sensor Resource Manager
rsct.core.sr 2.4.4.0 COMMITTED RSCT Registry
rsct.core.utils 2.4.4.0 COMMITTED RSCT Utilities
rsct.lapi.msg.EN_US.rte 2.4.0.0 COMMITTED RSCT LAPI Msgs - U.S. English
(UTF)
rsct.lapi.msg.en_US.rte 2.4.0.0 COMMITTED RSCT LAPI Msgs - U.S. English
rsct.lapi.nam 2.4.2.0 COMMITTED RSCT LAPI NAM Files
rsct.lapi.rte 2.4.2.0 COMMITTED RSCT LAPI Runtime
rsct.lapi.samp 2.4.2.0 COMMITTED RSCT LAPI Samples
rsct.msg.EN_US.core.auditrm
2.4.0.0 COMMITTED RSCT Audit Log RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.errm 2.4.0.0 COMMITTED RSCT Event Response RM Msgs -
U.S. English (UTF)
rsct.msg.EN_US.core.fsrm 2.4.0.0 COMMITTED RSCT File System RM Msgs -
U.S. English (UTF)
rsct.msg.EN_US.core.gui 2.4.0.0 COMMITTED RSCT GUI Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.hostrm
2.4.0.0 COMMITTED RSCT Host RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.lprm 2.4.0.0 COMMITTED RSCT LPRM Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.rmc 2.4.0.0 COMMITTED RSCT RMC Msgs - U.S. English
(UTF)
rsct.msg.EN_US.core.sec 2.4.0.0 COMMITTED RSCT Security Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.sensorrm
2.4.0.0 COMMITTED RSCT Sensor RM Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.sr 2.4.0.0 COMMITTED RSCT Registry Msgs - U.S.
English (UTF)
rsct.msg.EN_US.core.utils 2.4.0.0 COMMITTED RSCT Utilities Msgs - U.S.
English (UTF)
rsct.msg.en_US.basic.rte 2.4.0.0 COMMITTED RSCT Basic Msgs - U.S. English
rsct.msg.en_US.core.auditrm
2.4.0.0 COMMITTED RSCT Audit Log RM Msgs - U.S.
English
rsct.msg.en_US.core.errm 2.4.0.0 COMMITTED RSCT Event Response RM Msgs -
U.S. English
rsct.msg.en_US.core.fsrm 2.4.0.0 COMMITTED RSCT File System RM Msgs -
U.S. English
rsct.msg.en_US.core.gui 2.4.0.0 COMMITTED RSCT GUI Msgs - U.S. English
rsct.msg.en_US.core.gui.com
2.4.0.0 COMMITTED RSCT GUI JAVA Msgs - U.S.
English
rsct.msg.en_US.core.hostrm
2.4.0.0 COMMITTED RSCT Host RM Msgs - U.S.
English
rsct.msg.en_US.core.lprm 2.4.0.0 COMMITTED RSCT LPRM Msgs - U.S. English
rsct.msg.en_US.core.rmc 2.4.0.0 COMMITTED RSCT RMC Msgs - U.S. English
rsct.msg.en_US.core.rmc.com
2.4.0.0 COMMITTED RSCT RMC JAVA Msgs - U.S.
English
rsct.msg.en_US.core.sec 2.4.0.0 COMMITTED RSCT Security Msgs - U.S.
English
rsct.msg.en_US.core.sensorrm
2.4.0.0 COMMITTED RSCT Sensor RM Msgs - U.S.
English
rsct.msg.en_US.core.sr 2.4.0.0 COMMITTED RSCT Registry Msgs - U.S.
English
rsct.msg.en_US.core.utils 2.4.0.0 COMMITTED RSCT Utilities Msgs - U.S.
English
rsct.vsd.cmds 4.1.0.0 COMMITTED VSD Commands
rsct.vsd.rvsd 4.1.0.0 COMMITTED Recoverable VSD
rsct.vsd.vsdd 4.1.0.0 COMMITTED VSD Device Driver
rsct.vsd.vsdrm 4.1.0.0 COMMITTED VSD Resource Manager
Path: /etc/objrepos
rsct.basic.rte 2.4.4.0 COMMITTED RSCT Basic Function
rsct.compat.basic.rte 2.4.1.0 COMMITTED RSCT Event Management Basic
Function
rsct.core.auditrm 2.4.4.0 COMMITTED RSCT Audit Log Resource
Manager
rsct.core.errm 2.4.4.0 COMMITTED RSCT Event Response Resource
Manager
rsct.core.fsrm 2.4.4.0 COMMITTED RSCT File System Resource
Manager
rsct.core.hostrm 2.4.4.0 COMMITTED RSCT Host Resource Manager
rsct.core.lprm 2.4.4.0 COMMITTED RSCT Least Privilege Resource
Manager
rsct.core.rmc 2.4.4.0 COMMITTED RSCT Resource Monitoring and
Control
rsct.core.sec 2.4.4.0 COMMITTED RSCT Security
rsct.core.sensorrm 2.4.4.0 COMMITTED RSCT Sensor Resource Manager
rsct.core.sr 2.4.4.0 COMMITTED RSCT Registry
rsct.core.utils 2.4.4.0 COMMITTED RSCT Utilities
rsct.lapi.nam 2.4.2.0 COMMITTED RSCT LAPI NAM Files
rsct.lapi.rte 2.4.2.0 COMMITTED RSCT LAPI Runtime
rsct.vsd.cmds 4.1.0.0 COMMITTED VSD Commands
rsct.vsd.rvsd 4.1.0.0 COMMITTED Recoverable VSD
rsct.vsd.vsdd 4.1.0.0 COMMITTED VSD Device Driver
rsct.vsd.vsdrm 4.1.0.0 COMMITTED VSD Resource Manager
[tlg1a0][root][/]>lslpp -l bos.adt.lib
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.lib 5.3.0.40 COMMITTED Base Application Development
Libraries
[tlg1a0][root][/]>lslpp -l bos.adt.libm
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.libm 5.3.0.40 COMMITTED Base Application Development
Math Library
[tlg1a0][root][/]>lslpp -l bos.adt.syscalls
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.adt.syscalls 5.3.0.40 COMMITTED System Calls Application
Development Toolkit
[tlg1a0][root][/]>lslpp -l bos.net.tcp.client
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.net.tcp.client 5.3.0.41 COMMITTED TCP/IP Client Support
Path: /etc/objrepos
bos.net.tcp.client 5.3.0.10 COMMITTED TCP/IP Client Support
[tlg1a0][root][/]>lslpp -l bos.net.tcp.server
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.net.tcp.server 5.3.0.40 COMMITTED TCP/IP Server
Path: /etc/objrepos
bos.net.tcp.server 5.3.0.10 COMMITTED TCP/IP Server
[tlg1a0][root][/]>lslpp -l bos.rte.SRC
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.SRC 5.3.0.40 COMMITTED System Resource Controller
Path: /etc/objrepos
bos.rte.SRC 5.3.0.10 COMMITTED System Resource Controller
[tlg1a0][root][/]>lslpp -l bos.rte.libc
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libc 5.3.0.41 COMMITTED libc Library
[tlg1a0][root][/]>lslpp -l bos.rte.libcfg
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libcfg 5.3.0.40 COMMITTED libcfg Library
[tlg1a0][root][/]>lslpp -l bos.rte.libcur
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libcur 5.3.0.40 COMMITTED libcurses Library
[tlg1a0][root][/]>lslpp -l bos.rte.libpthreads
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.libpthreads 5.3.0.40 COMMITTED libpthreads Library
[tlg1a0][root][/]>lslpp -l bos.rte.odm
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
bos.rte.odm 5.3.0.40 COMMITTED Object Data Manager
6) 如果您要安裝並行的資源組(如oracle RAC),還要安裝下面的包:
lslpp –l bos.rte.lvm.rte (5.2.0.25 or higher)
lslpp -l bos.rte.lvm.rte
lslpp –l bos.clvm.enh
2 開始安裝 (注意,每機器均要完成)
1) 安裝HACMP5.2
如果是光盤,請插入光盤 ,輸入smitty install_latest
Install Software
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* INPUT device / directory for software /dev/cd0
* SOFTWARE to install [_all_latest]
PREVIEW only? (install operation will NOT occur) no
COMMIT software updates? yes
SAVE replaced files? no
AUTOMATICALLY install requisite software? yes
EXTEND file systems if space needed? yes
OVERWRITE same or newer versions? no
VERIFY install and check file sizes? no
Include corresponding LANGUAGE filesets? yes
DETAILED output? no
Process multiple volumes? yes
ACCEPT new license agreements? yes
Preview new LICENSE agreements? no
如果是安裝盤拷貝,請進入cd installp/ppc目錄,smitty install_latest
Install Software
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* INPUT device / directory for software .
* SOFTWARE to install [_all_latest]
…..
ACCEPT new license agreements? yes
Preview new LICENSE agreements? no
安裝結束後,會報failed,請檢查
clluster.hativoli
clusterhaview
netwiew
包以外,所有的hacmp的包都要安裝
2) 打補丁至5205
注意,請不要忽略給hacmp打補丁這一步驟。其實對hacmp來說,補丁是十分重要的。很多發現的缺陷都已經在補丁中被解決了。當嚴格的按照正確步驟安裝和配置完hacmp的軟件後,發現takeover 有問題,ip接管有問題,機器自動宕機等等千奇百怪的問題,其實都與補丁有關。所以一定要注意打補丁這個環節。此次hacmp爲5205或補丁IY73937。
smitty install_latest,全部安裝
IY53044 - Latest HACMP for AIX R510 Fixes as of January 2004
[tlg1a0][root][/soft_ins/ha52/patch]>ls
.toc cluster.es.cspoc.dsh.5.2.0.21.bff
cluster.adt.es.client.include.5.2.0.3.bff cluster.es.cspoc.rte.5.2.0.3.bff
cluster.adt.es.client.include.5.2.0.31.bff cluster.es.cspoc.rte.5.2.0.31.bff
cluster.adt.es.client.samples.clinfo.5.2.0.3.bff cluster.es.ercmf.cmds.5.2.0.3.bff
cluster.adt.es.client.samples.clinfo.5.2.0.31.bff cluster.es.ercmf.cmds.5.2.0.31.bff
cluster.adt.es.client.samples.clstat.5.2.0.3.bff cluster.es.ercmf.rte.5.2.0.1.bff
cluster.adt.es.client.samples.clstat.5.2.0.31.bff cluster.es.ercmf.rte.5.2.0.11.bff
cluster.adt.es.client.samples.libcl.5.2.0.1.bff cluster.es.pprc.cmds.5.2.0.3.bff
cluster.adt.es.client.samples.libcl.5.2.0.11.bff cluster.es.pprc.cmds.5.2.0.31.bff
cluster.doc.en_US.es.html.5.2.0.1.bff cluster.es.pprc.rte.5.2.0.3.bff
cluster.doc.en_US.es.pdf.5.2.0.1.bff cluster.es.pprc.rte.5.2.0.31.bff
cluster.doc.en_US.es.pdf.5.2.0.11.bff cluster.es.server.cfgast.5.2.0.1.bff
cluster.doc.en_US.pprc.html.5.2.0.1.bff cluster.es.server.cfgast.5.2.0.11.bff
cluster.doc.en_US.pprc.html.5.2.0.11.bff cluster.es.server.diag.5.2.0.4.bff
cluster.doc.en_US.pprc.pdf.5.2.0.1.bff cluster.es.server.diag.5.2.0.41.bff
cluster.doc.en_US.pprc.pdf.5.2.0.11.bff cluster.es.server.events.5.2.0.3.bff
cluster.doc.en_US.websphere.html.5.2.0.1.bff cluster.es.server.events.5.2.0.31.bff
cluster.doc.en_US.websphere.html.5.2.0.11.bff cluster.es.server.rte.5.2.0.5.bff
cluster.doc.en_US.websphere.pdf.5.2.0.1.bff cluster.es.server.rte.5.2.0.51.bff
cluster.doc.en_US.websphere.pdf.5.2.0.11.bff cluster.es.server.testtool.5.2.0.1.bff
cluster.es.cfs.rte.5.2.0.3.bff cluster.es.server.utils.5.2.0.4.bff
cluster.es.cfs.rte.5.2.0.31.bff cluster.es.server.utils.5.2.0.41.bff
cluster.es.client.lib.5.2.0.4.bff cluster.es.svcpprc.cmds.5.2.0.1.bff
cluster.es.client.lib.5.2.0.41.bff cluster.es.svcpprc.cmds.5.2.0.11.bff
cluster.es.client.rte.5.2.0.4.bff cluster.es.svcpprc.rte.5.2.0.1.bff
cluster.es.client.rte.5.2.0.41.bff cluster.es.websphere.5.2.0.1.bff
cluster.es.client.utils.5.2.0.3.bff cluster.es.worksheets.5.2.0.3.bff
cluster.es.client.utils.5.2.0.31.bff cluster.man.en_US.es.data.5.2.0.1.bff
cluster.es.client.wsm.5.2.0.21.bff cluster.man.en_US.es.data.5.2.0.11.bff
cluster.es.cspoc.cmds.5.2.0.3.bff cluster.xd.glvm.5.2.0.1.bff
cluster.es.cspoc.cmds.5.2.0.31.bff cluster.xd.glvm.5.2.0.11.bff
安裝結束後,仍會報failed,檢查
cluster.doc.en_US.pprc
cluster.doc.en_US.websphere
cluster.es.ercmf
cluster.es.pprc
cluster.es.svcpprc
cluster.es.websphere
cluster.xd.glvm
沒裝上外,其他都已安裝上。
當然大家可以從IBM網站上下載,或者打800-810-1818 熱線電話索要。
3) 重啓機器
記住一定要重起機器,否則安裝將無法正常繼續。
4) 安裝確認
Ø 確認inittab:
egrep "HA|clcomd" /etc/inittab
harc:2:wait:/usr/es/sbin/cluster/etc/harc.net # HACMP for AIX network startup
clcomdES:2:once:startsrc -s clcomdES >/dev/console 2>&1
clinit:a:wait:/bin/touch /usr/es/sbin/cluster/.telinit # HACMP for AIX These must be the last entries of run level a in inittab!
pst_clinit:a:wait:/bin/echo Created /usr/es/sbin/cluster/.telinit > /dev/console # HACMP for AIX These must be the last entries of run level a in inittab!
在hacmp 5.1 中 爲了安全起見,不再使用/.rhosts 文件來控制兩臺機器 之間的命令和數據交換,而是引進的一個新的進程clcomd 。 如果你編輯/etc/inittab文件 就會發現安裝完hacmp後,添加了一行:clcomdES:2:once:startsrc -s clcomdES >/dev/console
2>&1 。
Ø 確認安裝和補丁包:(關鍵爲cluster.es.server.rte)
lslpp -l cluster.*
Fileset Level State Description
----------------------------------------------------------------------------
Path: /usr/lib/objrepos
cluster.adt.es.client.include
5.2.0.3 COMMITTED ES Client Include Files
cluster.adt.es.client.samples.clinfo
5.2.0.3 COMMITTED ES Client CLINFO Samples
cluster.adt.es.client.samples.clstat
5.2.0.3 COMMITTED ES Client Clstat Samples
cluster.adt.es.client.samples.libcl
5.2.0.1 COMMITTED ES Client LIBCL Samples
cluster.adt.es.java.demo.monitor
5.2.0.0 COMMITTED ES Web Based Monitor Demo
cluster.doc.en_US.es.html 5.2.0.1 COMMITTED HAES Web-based HTML
Documentation - U.S. English
cluster.doc.en_US.es.pdf 5.2.0.1 COMMITTED HAES PDF Documentation - U.S.
English
cluster.es.cfs.rte 5.2.0.3 COMMITTED ES Cluster File System Support
cluster.es.client.lib 5.2.0.4 COMMITTED ES Client Libraries
cluster.es.client.rte 5.2.0.4 COMMITTED ES Client Runtime
cluster.es.client.utils 5.2.0.3 COMMITTED ES Client Utilities
cluster.es.client.wsm 5.2.0.2 COMMITTED Web based Smit
cluster.es.clvm.rte 5.2.0.0 COMMITTED ES for AIX Concurrent Access
cluster.es.cspoc.cmds 5.2.0.3 COMMITTED ES CSPOC Commands
cluster.es.cspoc.dsh 5.2.0.2 COMMITTED ES CSPOC dsh
cluster.es.cspoc.rte 5.2.0.3 COMMITTED ES CSPOC Runtime Commands
cluster.es.plugins.dhcp 5.2.0.0 COMMITTED ES Plugins - dhcp
cluster.es.plugins.dns 5.2.0.0 COMMITTED ES Plugins - Name Server
cluster.es.plugins.printserver
5.2.0.0 COMMITTED ES Plugins - Print Server
cluster.es.server.cfgast 5.2.0.1 COMMITTED ES Two-Node Configuration
Assistant
cluster.es.server.diag 5.2.0.4 COMMITTED ES Server Diags
cluster.es.server.events 5.2.0.3 COMMITTED ES Server Events
cluster.es.server.rte 5.2.0.5 COMMITTED ES Base Server Runtime
cluster.es.server.testtool
5.2.0.1 COMMITTED ES Cluster Test Tool
cluster.es.server.utils 5.2.0.4 COMMITTED ES Server Utilities
cluster.es.worksheets 5.2.0.3 COMMITTED Online Planning Worksheets
cluster.license 5.2.0.0 COMMITTED HACMP Electronic License
cluster.msg.en_US.cspoc 5.2.0.0 COMMITTED HACMP CSPOC Messages - U.S.
English
cluster.msg.en_US.es.client
5.2.0.0 COMMITTED ES Client Messages - U.S.
English
cluster.msg.en_US.es.server
5.2.0.0 COMMITTED ES Recovery Driver Messages -
U.S. English
Path: /etc/objrepos
cluster.es.client.rte 5.2.0.0 COMMITTED ES Client Runtime
cluster.es.clvm.rte 5.2.0.0 COMMITTED ES for AIX Concurrent Access
cluster.es.cspoc.rte 5.2.0.0 COMMITTED ES CSPOC Runtime Commands
cluster.es.server.diag 5.2.0.0 COMMITTED ES Server Diags
cluster.es.server.events 5.2.0.0 COMMITTED ES Server Events
cluster.es.server.rte 5.2.0.5 COMMITTED ES Base Server Runtime
cluster.es.server.utils 5.2.0.0 COMMITTED ES Server Utilities
Path: /usr/share/lib/objrepos
cluster.man.en_US.es.data 5.2.0.1 COMMITTED ES Man Pages - U.S. English
Ø 確認進程
ps -ef |grep clcomd
root 115694 65676 0 Jun 07 - 2:09 /usr/es/sbin/cluster/clcomd -d
第三章 準備篇
1. 修改.rhosts
vi /.rhosts
tlg1a0 root
tlg1a0_stb
tlg1a0_isvc
tlg1a0_boot
tlg1a0_svc
tlg1a0_stb
tlg1d0_isvc
tlg1d0_boot
tlg1d0_svc
tlg1d0_stb
tlg1a0_boot1
tlg1a0_l1_svc
tlg1a0_l1_stb
tlg1d0_boot1
tlg1d0_l1_svc
tlg1d0_l1_stb
tlg1a0_boot2
tlg1a0_l2_svc
tlg1a0_l2_stb
tlg1d0_boot2
tlg1d0_l2_svc
tlg1d0_l2_stb
tlg1a0_1580_svc
tlg1a0_1580_stb
tlg1d0_1580_svc
tlg1d0_1580_stb
在hacmp 5.2中 爲了安全起見,不再使用/.rhosts 文件來控制兩臺機器之間的命令和數據交換,使用 /usr/es/sbin/cluster/etc/rhosts 文件來代替 /.rhosts 文件的功能。
注意:如果兩個節點間的通訊發生了什麼問題,可以檢查rhots 文件,或者編輯rhosts文件
加入兩個節點的網絡信息。爲方便查問題,配置期間讓/.rhosts和HA的rhosts一致。
cp /.rhosts /usr/es/sbin/cluster/etc/rhosts
rcp /.rhosts tlg1d0:/usr/es/sbin/cluster/etc/rhosts
rcp /.rhosts tlg1d0:/
2. 添加共享vg , jfs2:
[tlg1d0][root][/]>lspv
hdisk0 00c1fe1f0215b425 rootvg active
hdisk1 00c1fe1f8d700839 rootvg active
hdisk3 none none
hdisk4 none none
hdisk6 00c1eedf177bd321 g1d0msd2vg active
hdisk2 00c1fe1f966aff5a g1d0msd1vg active
hdisk7 00c1eedf177ba8e3 g1d0msd2vg active
hdisk8 00c1eedf220b9697 None
hdisk9 00c1eedf220bd5f6 None
hdisk10 00c1fe1f220e7a99 g1d0msd3vg active
hdisk11 00c1fe1f220e7d28 g1d0msd3vg active
[tlg1a0][root][/]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg active
hdisk0 00c1eedf721dbee3 rootvg active
hdisk1 00c1eedf01de40b3 rootvg active
hdisk2 00c1eedf6ddb9f5e g1a0msa1vg active
hdisk4 00c1eedffa577dca g1a0msa2vg active
hdisk6 none none
hdisk7 none none
hdisk8 00c1eedf220bd5f6 g1a0msa3vg active
hdisk9 00c1eedffaa954cc g1a0msa3vg active
hdisk10 none None
hdisk11 none None
3. 修改loglv
tlg1a0 (tlg1d0vg也要修改)
² 察看
[tlg1a0][root][/]>varyonvg g1a0msa2vg
[tlg1a0][root][/]>lsvg -l g1a0msa2vg
g1a0msa2vg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
ora10mscltlv jfs2 100 100 2 open/syncd /ora10msclt
tux81msrunlv jfs2 6 6 2 open/syncd /tux81msrun
meslv jfs2 108 108 2 open/syncd /BSMesWare/BSX1
xcomlv jfs2 40 40 2 open/syncd /xcom
loglv06 jfs2log 1 1 1 open/syncd N/A
umount vg上所有fs
umount /ora10msclt
umount / tux81msrun
umount / BSMesWare/BSX1
umount / xcom
² 修改loglv名稱
[tlg1a0][root][/]> chlv -n g1a0msa2_loglv
loglv06
0516-712 chlv: The chlv succeeded, however chfs must now be
run on every filesystem which references the old log name loglv06.
[tlg1a0][root][/]>lsvg -l g1a0msa2vg
g1a0msa2vg:
LV NAME TYPE LPs PPs PVs LV STATE MOUNT POINT
ora10mscltlv jfs2 100 100 2 closed/syncd /ora10msclt
tux81msrunlv jfs2 6 6 2 closed/syncd /tux81msrun
meslv jfs2 108 108 2 closed/syncd /BSMesWare/BSX1
xcomlv jfs2 40 40 2 closed/syncd /xcom
g1a0msa2_loglv jfs2log 1 1 1 closed/syncd N/A
² 修改fs對應關係
vi /etcfilesystems:
/dev/loglv06 –〉/dev/g1a0msa2_loglv
:g/loglv06/s//g1a0msa2_loglv/g
:wq
² 確認
mount /ora10msclt
mount / tux81msrun
mount / BSMesWare/BSX1
mount / xcom
4. 整理vg
在每臺機器上先varyoffvg,運行以下腳本(實際可以copy以下腳本到文本編輯器替換成你實際的vg)
varyoffvg g1a0msa2vg
varyoffvg g1d0msd2vg
exportvg g1d0msd2vg
exportvg g1a0msa2vg
chdev -l hdisk6 -a pv=yes
chdev -l hdisk7 -a pv=yes
importvg -V 201 -n -y g1d0msd2vg hdisk6
varyonvg g1d0msd2vg
chvg -an g1d0msd2vg
chdev -l hdisk3 -a pv=yes
chdev -l hdisk4 -a pv=yes
importvg -V 101 -n -y g1a0msa2vg hdisk3
varyonvg g1a0msa2vg
chvg -an g1a0msa2vg
varyoffvg g1a0msa2vg
varyoffvg g1d0msd2vg
確認:
[tlg1d0][root][/home/scripts/takeover]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg
hdisk0 00c1fe1f0215b425 rootvg active
hdisk1 00c1fe1f8d700839 rootvg active
hdisk4 00c1eedffa577dca g1a0msa2vg
hdisk6 00c1eedf177bd321 g1d0msd2vg
hdisk2 00c1fe1f966aff5a g1d0msd1vg active
hdisk7 00c1eedf177ba8e3 g1d0msd2vg
hdisk5 00c1eedfbf806475 ha_heart
hdisk8 00c1eedf220b9697 None
hdisk9 00c1eedf220bd5f6 None
hdisk10 00c1fe1f220e7a99 g1d0msd3vg active
hdisk11 00c1fe1f220e7d28 g1d0msd3vg active
[tlg1a0][root][/home/scripts/takeover]>lspv
hdisk3 00c1eedffa577b0e g1a0msa2vg
hdisk0 00c1eedf721dbee3 rootvg active
hdisk1 00c1eedf01de40b3 rootvg active
hdisk2 00c1eedf6ddb9f5e g1a0msa1vg active
hdisk4 00c1eedffa577dca g1a0msa2vg
hdisk6 00c1eedf177bd321 g1d0msd2vg
hdisk7 00c1eedf177ba8e3 g1d0msd2vg
hdisk8 00c1eedf220bd5f6 g1a0msa3vg active
hdisk9 00c1eedffaa954cc g1a0msa3vg active
hdisk5 00c1eedfbf806475 ha_heart
hdisk10 none None
hdisk11 none None
5. 修改ip地址
按照規劃,修改ip地址 ,smitty tcpip,最終爲
[tlg1a0][root][/]>netstat -in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en0 1500 link#2 0.9.6b.dd.e1.aa 2177624 0 1368126 0 0
en0 1500 192.1.1 192.1.1.1 2177624 0 1368126 0 0
en1 1500 link#3 0.9.6b.dd.e1.ab 1928315 0 1222071 0 0
en1 1500 182.1.1 182.1.1.1 1928315 0 1222071 0 0
en2 1500 link#4 0.11.25.8.cf.23 1532962 0 868518 0 0
en2 1500 192.168.1 192.168.1.1 1532962 0 868518 0 0
en3 1500 link#5 0.11.25.8.cd.fa 1633025 0 969811 0 0
en3 1500 192.168.2 192.168.2.1 1633025 0 969811 0 0
lo0 16896 link#1 1290353 0 1291202 0 0
lo0 16896 127 127.0.0.1 1290353 0 1291202 0 0
lo0 16896 ::1 1290353 0 1291202 0 [tlg1d0][root][/]>netstat
-in
Name Mtu Network Address Ipkts Ierrs Opkts Oerrs Coll
en0 1500 link#2 0.9.6b.dd.e5.64 1169895 0 1119848 0 0
en0 1500 192.1.1 192.1.1.2 1169895 0 1119848 0 0
en1 1500 link#3 0.9.6b.dd.e5.65 2070363 0 1607863 0 0
en1 1500 182.1.1 182.1.1.2 2070363 0 1607863 0 0
en2 1500 link#4 0.11.25.8.db.20 1647564 0 1178253 0 0
en2 1500 192.168.1 192.168.1.2 1647564 0 1178253 0 0
en3 1500 link#5 0.11.25.8.da.80 1631680 0 998077 0 0
en3 1500 192.168.2 192.168.2.2 1631680 0 998077 0 0
lo0 16896 link#1 1029297 0 1305172 0 0
lo0 16896 127 127.0.0.1 1029297 0 1305172 0 0
lo0 16896 ::1 1029297 0 1305172 0 0
6. 修改/etc/hosts
127.0.0.1 loopback localhost # loopback (lo0) name/address
182.1.1.21 hmc01
182.1.1.22 hmc02
##tlg1a0
182.1.1.1 tlg1a0_isvc tlg1a0_stb
168.120.1.201 tlg1a0_l1_svc
168.121.1.201 tlg1a0_l2_svc
140.100.100.101 tlg1a0_1580_svc
192.1.1.1 tlg1a0_boot
192.168.1.1 tlg1a0_boot1 tlg1a0
192.168.2.1 tlg1a0_boot2
190.2.206.11 tlg1a0_svc
##tlg1d0
182.1.1.2 tlg1d0_isvc tlg1d0_stb
192.1.1.2 tlg1d0_boot t
192.168.1.2 tlg1d0_boot1 lg1d0
192.168.2.2 tlg1d0_boot2
190.2.206.12 tlg1d0_svc
168.120.1.202 tlg1d0_l1_svc
168.121.1.202 tlg1d0_l2_svc
140.100.100.102 tlg1d0_1580_svc
7. 編寫啓停腳本
mkdir /usr/sbin/cluster/app
[tlg1a0][root][/usr/sbin/cluster/app]>ls
start_g1a0 start_g1d0 stop_g1a0 stop_g1d0
#start_g1a0
banner start tlg1a0
route delete 0
route add 0 190.2.206.240
route add -net 168.100.100 -netmask 255.255.255.0 168.121.1.240
su - bsx1 -c /home/scripts/tlg1a0/start_bsx1.sh
banner end tlg1a0
# stop_g1a0
banner stop tlg1a0
su - bsx1 -c /home/scripts/tlg1a0/stop_bsx1.sh &
sleep 15
su - bsx1 -c /home/scripts/tlg1a0/tux_shut_force.sh
banner end tlg1a0
# start_g1d0
banner start tlg1d0!
route delete 0
route add 0 190.2.206.240
route add -net 168.100.100.0 168.121.1.240
su - orarun -c /home/scripts/tlg1d0/startdb.sh
banner end start tlg1d0
#stop_g1d0
banner stop tlg1d0
su - orarun -c /home/scripts/tlg1d0/stopdb.sh
banner end tlg1d0
編寫完成後cp到另一節點:
[tlg1a0][root][/usr/sbin/cluster/app]>rcp
* tlg1d0:/usr/sbin/cluster/app
注意:在兩個節點要保證hosts 和 啓動/停止腳本要一樣存在。
同時網關在start就腳本里要增加。
8. 配置 tty 心跳 網絡/磁盤心跳
Ø 串口線心跳(兩邊都要增加)
. smitty tty->Change / add a TTY->rs232->sa->port
number : 0
如果是有串口卡,則注意不要用port 0,tty0。
確認
a0: cat /etc/hosts>/dev/tty1
d0:cat
可看到tlg1a0上/etc/hosts的內容。
Ø 磁盤心跳
1. 建立1個共享盤 1G足夠
2. 兩邊在此盤上都建立enhance current vg—〉ha_heart(取名),不用varyon
9. syncd daemon的數據刷新頻率
該值表示刷新內存數據到硬盤的頻率,缺省爲60,HACMP安裝後一般可改爲10,
smitty hacmp -> HACMP
Extended Configuration > Advanced Performance Tuning
Parameters > Change/Show syncd frequency ->10
or /usr/es/sbin/cluster/utilities/clchsyncd
10
10. 互拷用戶的.profile等環境文件
[tlg1d0][root][/home/orarun]>rcp * tlg1a0:/home/orarun
[tlg1d0][root][/home/orarun]>rcp .profile tlg1a0:/home/orarun
[tlg1a0][root][/home/tuxrun]> rcp * tlg1a0:/home/tuxrun
[tlg1a0][root][/home/tuxrun]> rcp .profile tlg1a0:/home/tuxrun
11. 修改文件系統的目錄權限,保證兩邊一致
[tlg1d0][root][/]>chown bsx1:bea /BSMesWare/BSX1
第四章 配置篇
[tlg1a0][root][/]>smitty hacmp
1.添加cluster 和node
Initialization and Standard Configuration –〉
Add Nodes to an HACMP Cluster ->
* Cluster Name [lg_cluster]
New Nodes (via selected communication paths)[m [tlg1a0
tlg1d0]
Currently Configured Node(s)
這一步很重要,一般我們都是把每個節點的boot 作爲communication path . New node
可以一起加,也可以一個一個的加。當回車以後,系統會自己discover hacmp 的資源
顯示如下:
。。。。。。。。。。。。。。。。。。。。
Communication path tlg1a0 discovered a new node. Hostname is tlg1a0. Adding it to the configuration with Nodename tlg1a0.
Communication path tlg1d0 discovered a new node. Hostname is tlg1d0. Adding it to the configuration with Nodename tlg1d0.
Discovering IP Network Connectivity
Retrieving data from available cluster nodes. This could take a few minutes....
Discovered [9] interfaces
IP Network Discovery completed normally
Current cluster configuration:
Cluster Name: lg_cluster
Cluster Connection Authentication Mode: Standard
Cluster Message Authentication Mode: None
Cluster Message Encryption: None
Use Persistent Labels for Communication: No
There are 2 node(s) and 3 network(s) defined
NODE tlg1a0:
Network net_ether_01
tlg1a0_boot 192.1.1.1
Network net_ether_02
tlg1a0_isvc 182.1.1.1
Network net_ether_03
tlg1a0_boot2 192.168.2.1
tlg1a0_boot1 192.168.1.1
NODE tlg1d0:
Network net_ether_01
tlg1d0_boot 192.1.1.2
Network net_ether_02
tlg1d0_isvc 182.1.1.2
Network net_ether_03
tlg1d0_boot1 192.168.1.2
……………………..
2.添加network,修改網卡歸屬
1) 修改網卡歸屬
通過上面步驟地自動discovery,實際一般network和devices均已添加,我們只需略做修正就可以了。
Extended Configuration
Extended Topology Configuration
Configure HACMP Communication Interfaces/Devices
Change/Show Communication Interfaces/Devices
選 en0 tlg1a0_boot
Change/Show a Communication Interface
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
Node Name [tlg1a0]
Network Interface en0
IP Label/Address tlg1a0_boot
Network Type ether
* Network Name [net_ether_01] –〉net_ether_02
同樣修改tlg1d0_boot
Change/Show a Communication Interface
[Entry Fields]
Node Name [tlg1d0]
Network Interface en0
IP Label/Address tlg1d0_boot
Network Type ether
* Network Name [net_ether_02] –〉net_ether_02
這樣可以看到net_ether_01沒有了。
2) 心跳網絡必須自己添加:
Extended Configuration
Extended Topology Configuration
Configure HACMP Communication Interfaces/Devices
Add Communication Interfaces/Devices
Add Discovered Communication Interface and Devices
Communication Devices
Ø 串口心跳
# Node Device Device Path
tlg1a0 tty1 /dev/tty1
tlg1d0 tty1 /dev/tty1
Ø 磁盤心跳
# Node Device Device Path Pvid
tlg1a0 hdisk5 /dev/hdisk5 00c1eedfbf8 x
tlg1d0 hdisk5 /dev/hdisk5 00c1eedfbf8 x
按F7逐一選中回車,將增加心跳網絡,至此toplog已配好
3. 察看確認拓撲(toplog)結構
Extended Configuration
Extended Topology Configuration
Show HACMP Topology
Show Cluster Topology
Cluster Name: lg_cluster
Cluster Connection Authentication Mode: Standard
Cluster Message Authentication Mode: None
Cluster Message Encryption: None
Use Persistent Labels for Communication: No
NODE tlg1a0:
Network net_diskhb_01
tlg1a0_hdisk5_01 /dev/hdisk5
Network net_ether_02
tlg1a0_boot 192.1.1.1
tlg1a0_isvc 182.1.1.1
Network net_ether_03
tlg1a0_boot1 192.168.1.1
tlg1a0_boot2 192.168.2.1
NODE tlg1d0:
Network net_diskhb_01
tlg1d0_hdisk5_01 /dev/hdisk5
Network net_ether_02
tlg1d0_isvc 182.1.1.2
tlg1d0_boot 192.1.1.2
Network net_ether_03
tlg1d0_boot2 192.168.2.2
tlg1d0_boot1 192.168.1.2
可以看到已符合規劃要求,可繼續了
4. 添加高可用資源
(service ip , application server , vg and jfs )
1) 添加app server
Extended Configuration
Extended Resource Configuration
HACMP Extended Resources Configuration
Configure HACMP Applications
Configure HACMP Application Servers
Add an Application Server
* Server Name [g1a0_app]
*Start Script [/usr/sbin/cluster/app/start_g1a0]
* Stop Script [/usr/sbin/cluster/app/stop_g1a0]
Application Monitor Name(s)
同樣增加 g1d0_app
* Server Name [g1d0_app]
*Start Script [/usr/sbin/cluster/app/start_g1d0]
* Stop Script [/usr/sbin/cluster/app/stop_g1d0]
2) 添加service ip
Extended Configuration
Extended Resource Configuration
HACMP Extended Resources Configuration
Configure HACMP Service IP Labels/Addresses
Add a Service IP Label/Address
Configurable on Multiple Nodes
net_ether_02 (192.1.1.0/24 182.1.1.0/24)
* IP Label/Address tlg1d0_svc
* Network Name net_ether_02
Alternate HW Address to accompany IP Label/Address []
同樣增加tlg1a0_svc,
net_ether_03 (192.168.1.0/24 192.168.2.0/24)
同樣增加tlg1a0_1580_svc, tlg1d0_1580_svc, tlg1a0_l1_svc, tlg1d0_l1_svc, tlg1a0_l2_svc, tlg1d0_l2_svc
3) 創建資源組
Extended Configuration
Extended Resource Configuration
HACMP Extended Resource Group Configuration
Add a Resource Group
Add a Resource Group
(extended)
Type or select values in entry fields.
Press Enter AFTER making all desired changes. [Entry Fields]
* Resource Group Name [tlg1a0RG]
* Participating Nodes (Default Node Priority) [tlg1a0 tlg1d0]
Startup Policy Online On Home Node Only
Fallover Policy Fallover To Next Priority Node In The List
Fallback Policy Fallback To Higher Priority Node In The List
同樣建立tlg1d0_RG
….
Resource Group Name [tlg1d0_RG]
* Participating Nodes (Default Node Priority) [tlg1d0
tlg1a0]
…
2).配置資源組
Extended Configuration
Extended Resource Configuration
HACMP Extended Resource Group Configuration
Change/Show Resources and Attributes for a Resource Group
tlg1a0_RG
Change/Show All Resources and Attributes for a Resource Group
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
Resource Group Name tlg1a0_RG
Participating Nodes (Default Node Priority) tlg1a0 tlg1d0
Startup Policy Online On Home Node Only
Fallover Policy Fallover To Next Priority Node In The List
Fallback Policy Fallback To Higher Priority Node In The List
Fallback Timer Policy (empty is immediate) []
Service IP Labels/Addresses [tlg1a0_1580_svc
tlg1a0_l1_svc tlg1a0_l2_svc tlg1a0_svc]
Application Servers [g1a0_app]
Volume Groups [g1a0msa2vg]
Use forced varyon of volume groups, if necessary false
同樣的方法配置tlg1d0_RG
5. 檢查和同步hacmp配置
(注意:1-4的配置均在tlg1a0上完成,現先強制同步到tlg1d0)
Extended Configuration
Extended Verification and Synchronization
HACMP Verification and Synchronization
Type or select values in entry fields.
Press Enter AFTER making all desired changes.
[Entry Fields]
* Verify, Synchronize or Both [Both]
* Automatically correct errors found during [Yes]
verification?
* Force synchronization if verification fails? [Yes]
* Verify changes only? [No]
* Logging [Standard]
Command: OK stdout: yes stderr: no
Before command completion, additional instructions may appear below.
[MORE...5]
Verification will automatically correct verification errors.
Retrieving data from available cluster nodes. This could take a few minutes......
Verifying Cluster Topology...
Verifying Cluster Resources...
processing RG: tlg1a0_RG
processing RG: tlg1d0_RG
ERROR: The HACMP timestamp file for shared volume group: g1a0msa2vg is inconsistent
with the time stamp in the VGDA for the following nodes: tlg1a0 tlg1d0
ERROR: The HACMP timestamp file for shared volume group: g1d0msd2vg is inconsistent
with the time stamp in the VGDA for the following nodes: tlg1a0 tlg1d0
Starting Corrective Action: cl_resource_modify_vg_time_stamp.
Setting HACMP timestamp for volume group: g1a0msa2vg to 44a762222f495eff on node: tlg1a0: PASS
Setting HACMP timestamp for volume group: g1a0msa2vg to 44a762222f495eff on node: tlg1d0: PASS
Setting HACMP timestamp for volume group: g1d0msd2vg to 44a7654527e38a67 on node: tlg1a0: PASS
Setting HACMP timestamp for volume group: g1d0msd2vg to 44a7654527e38a67 on node: tlg1d0: PASS
-----------------------------------------------------------------
1 problem(s) were resolved.
Remember to redo automatic error notification if configuration has changed.
Updating ODM errnotify on node tlg1d0.
Adding any necessary HACMP entries to /etc/inittab and /etc/rc.net for IPAT on node tlg1a0.
Updating ODM errnotify on node tlg1a0.
Verification has completed normally.
記得修改/etc/hosts,改爲svc的地址上。
##tlg1a0
182.1.1.1 tlg1a0_isvc tlg1a0_stb
168.120.1.201 tlg1a0_l1_svc
168.121.1.201 tlg1a0_l2_svc
140.100.100.101 tlg1a0_1580_svc
192.1.1.1 tlg1a0_boot
192.168.1.1 tlg1a0_boot1
192.168.2.1 tlg1a0_boot2
190.2.206.11 tlg1a0_svc tlg1a0
##tlg1d0
182.1.1.2 tlg1d0_isvc tlg1d0_stb
192.1.1.2 tlg1d0_boot t
192.168.1.2 tlg1d0_boot1
192.168.2.2 tlg1d0_boot2
190.2.206.12 tlg1d0_svc lg1d0
168.120.1.202 tlg1d0_l1_svc
168.121.1.202 tlg1d0_l2_svc
140.100.100.102 tlg1d0_1580_svc
到此爲止我們的hacmp已經基本配置完成了。
6. 確認hacmp配置
1) snmp的調整(AIX5.3不需要)
在aix5.2 下要對snmp 做一些調整纔可以看到真正的hacmp的狀態。
具體來說, aix 5.2 的 snmp 默認是version 3 :
scmb02:/usr/sbin>ls -l |grep snmp
lrwxrwxrwx 1 root system 8 Apr 08 17:55 clsnmp -> clsnmpne
-rwxr-x--- 1 root system 83150 Mar 12 2003 clsnmpne
-rwxr-x--- 1 root system 55110 Mar 12 2003 pppsnmpd
lrwxrwxrwx 1 root system 9 Apr 08 17:55 snmpd -> snmpdv3ne
而hacmp 只支持snmp version 1 . 所以我們要做一下調整:
stopsrc -s snmpd
/usr/sbin/snmpv3_ssw -1startsrc -s snmpd
scmb02:/usr/sbin>ls -l |grep snmp
lrwxrwxrwx 1 root system 18 Apr 21 13:40 clsnmp -> /usr/sbin/clsnmpne
-rwxr-x--- 1 root system 83150 Mar 12 2003 clsnmpne
-rwxr-x--- 1 root system 55110 Mar 12 2003 pppsnmpd
lrwxrwxrwx 1 root system 17 Apr 21 13:40 snmpd -> /usr/sbin/snmpdv1
2) 啓動hacmp時選擇:
* Start now, on system restart or both[m now
Start Cluster Services on these nodes [scmb02]
BROADCAST message at startup? true
Startup Cluster Lock Services? false
Startup Cluster Information Daemon? true
Reacquire resources after forced down ? false
3) 啓動clstat
cd /usr/sbin/cluster
clstat
clstat - HACMP Cluster Status Monitor
-------------------------------------
Cluster: lg_cluster (1145945461)
Sun Jul 2 16:54:16 BEIST 2006
State: UP Nodes: 2
SubState: STABLE
Node: tlg1a0 State: UP
Interface: tlg1a0_isvc (1) Address: 182.1.1.1
State: UP
Interface: tlg1a0_boot (1) Address: 192.1.1.1
State: UP
Interface: tlg1a0_boot1 (2) Address: 192.168.1.1
State: UP
Interface: tlg1a0_boot2 (2) Address: 192.168.2.1
State: UP
Interface: tlg1a0_heart (0) Address: 0.0.0.0
State: UP
Interface: tlg1a0_1580_svc (2) Address: 140.100.100.101
State: UP
Interface: tlg1a0_l1_svc (2) Address: 168.120.1.201
State: UP
Interface: tlg1a0_l2_svc (2) Address: 168.121.1.201
State: UP
Interface: tlg1a0_svc (1) Address: 190.2.206.11
State: UP
Resource Group: tlg1a0_RG State: On line
Node: tlg1d0 State: UP
Interface: tlg1d0_isvc (1) Address: 182.1.1.2
State: UP
Interface: tlg1d0_boot (1) Address: 192.1.1.2
State: UP
Interface: tlg1d0_boot1 (2) Address: 192.168.1.2
State: UP
Interface: tlg1d0_boot2 (2) Address: 192.168.2.2
State: UP
Interface: tlg1d0_heart (0) Address: 0.0.0.0
State: UP
Interface: tlg1d0_1580_svc (2) Address: 140.100.100.102
State: UP
第五章 測試篇
太累了,見另文《網段合併實現及交換機對HA影響測試報告.doc》
已經公開 Saturday, August 05, 2006 4:49 PM 作者:
mxin