這次做了一套核心系統的ogg同步,一下是遇到的report報錯,做個記錄
問題一: OGG-01705 Input checkpoint position.. for input trail file..greater than the size of the file
2019-09-10 09:28:08 ERROR OGG-01705 Input checkpoint position 314748185 for input trail file '/oggfs/dirdat/af000068' is greater than the size of the file (31469181
8). Please consult Oracle Knowledge Management Doc ID 1138409.1. for instructions.
報錯已經給出了mos文檔號,直接照這mos操作就可以了
原因很簡單,因爲seq號超過了隊列文件大小,所以ogg找不到這個seq號
我這裏的隊列文件大小是314691818<314748185我輸入的seq號
[oracle@jcyqdb02:10.176.34.111:/oggfs]$ ls -lrt /oggfs/dirdat/af0*
-rw-r----- 1 oracle oinstall 1023999656 Sep 6 16:21 /oggfs/dirdat/af000056
-rw-r----- 1 oracle oinstall 158146254 Sep 7 01:13 /oggfs/dirdat/af000057
-rw-r----- 1 oracle oinstall 1023999701 Sep 8 05:02 /oggfs/dirdat/af000058
-rw-r----- 1 oracle oinstall 1023999995 Sep 9 02:11 /oggfs/dirdat/af000059
-rw-r----- 1 oracle oinstall 1023999878 Sep 9 03:44 /oggfs/dirdat/af000060
-rw-r----- 1 oracle oinstall 1023999622 Sep 9 05:30 /oggfs/dirdat/af000061
-rw-r----- 1 oracle oinstall 1023999902 Sep 9 06:33 /oggfs/dirdat/af000062
-rw-r----- 1 oracle oinstall 1023999726 Sep 9 07:19 /oggfs/dirdat/af000063
-rw-r----- 1 oracle oinstall 1023999847 Sep 9 08:37 /oggfs/dirdat/af000064
-rw-r----- 1 oracle oinstall 1023999784 Sep 9 10:43 /oggfs/dirdat/af000065
-rw-r----- 1 oracle oinstall 1023999817 Sep 9 15:21 /oggfs/dirdat/af000066
-rw-r----- 1 oracle oinstall 1023999639 Sep 10 05:17 /oggfs/dirdat/af000067
-rw-r----- 1 oracle oinstall 314691818 Sep 10 09:13 /oggfs/dirdat/af000068
-rw-r----- 1 oracle oinstall 135205227 Sep 10 09:27 /oggfs/dirdat/af000069
-rw-r----- 1 oracle oinstall 281407673 Sep 10 10:28 /oggfs/dirdat/af000070
-rw-r----- 1 oracle oinstall 230887576 Sep 10 10:54 /oggfs/dirdat/af000071
-rw-r----- 1 oracle oinstall 516095204 Sep 10 10:59 /oggfs/dirdat/af000072
解決辦法:
start * filterduptransactions
或者是seq輸入有問題
問題二:OGG-01028: Incompatible record (102)
OGG-01028 Incompatible record (102) in /oggfs/dirdat/ac000061, rba 117653457 (getting header).
記錄損壞,也可能是整個隊列文件損壞
使用logdump去定位問題
Logdump 217 >open /oggfs/dirdat/ac000061
Current LogTrail is /oggfs/dirdat/ac000061
Logdump 224 >pos 117653457
Reading forward from RBA 117653457
Logdump 219 >count
** Count begins at RBA 117344506
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | .................. --說明記錄損壞
LogTrail /oggfs/dirdat/ac000061 has 1036 records
Total Data Bytes 207204
Avg Bytes/Record 200
Insert 1036
After Images 1036
Average of 6 Transactions
Bytes/Trans ..... 42822
Records/Trans ... 172
Files/Trans ..... 1
Logdump 238 >n
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | ..................
Logdump 239 >n
Bad record found at (RBA 117653457, format 5.50 Unknown TokenID ffff)
4701 012a 4800 0036 4504 0041 00c8 05ff 02f2 b85a | G..*H..6E..A.......Z
964e 4b79 0000 0000 0bf7 e9dc 0000 44fb 0152 0000 | .NKy..........D..R..
0001 4550 4d5f 53ff ffff ffff ffff ffff ffff ffff | ..EPM_S.............
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff ffff | ....................
ffff ffff ffff ffff ffff ffff ffff ffff ffff | ..................
--多次next不會調到下一條記錄,說明整個隊列文件損壞
alter replicat1 extseqno 62 extrba nextnumber1--跳過這個記錄
alter replicat1 seqno 63 --跳過整個隊列文件
根據隊列文件生成時間,去源端重傳隊列文件,並在rba事務發生時間之前去啓動replicat並加上handcollision參數
(在有主鍵或者有唯一索引的情況下handcollision也可以保證數據一致性,如果沒有數據無法保證一致)
(其實這個問題相對要複雜一點點,隊列文件損壞需要確認投遞過程中損壞還是抽取損壞,如果是投遞損壞可以重新投遞,如果是抽取損壞需要重新抽取並投遞)
問題三:
OGG-01032:There is a problem in network communication
2019-09-10 11:03:03 ERROR OGG-01032 There is a problem in network communication, a remote file problem, encryption keys for target and source do not match (if using
ENCRYPT) or an unknown error. Length is 3 - 000000: 46 45 00 |FE. |.
2019-09-10 11:03:03 ERROR OGG-01668 PROCESS ABENDING.
初看以爲是網絡問題,直接去找網絡工程師,結果非常尷尬
實際上目標端mgr都掛了,端口沒開
問題四:
GGSCI (cdyxdb2) 64> alter DPEYKA etrollover
2019-09-10 11:21:45 INFO OGG-01520 Rollover performed. For each affected output trail of Version 10 or higher format, after starting the source extract, issue ALTER EXTSEQNO for that trail's reader (either pump EXTRACT or REPLICAT) to move the reader's scan to the new trail file; it will not happen automatically.
EXTRACT altered.
GGSCI (cdyxdb2) 94> !
info DPEYKA
EXTRACT DPEYKA Last Started 2019-09-10 11:22 Status RUNNING
Checkpoint Lag 04:21:44 (updated 00:00:20 ago)
Log Read Checkpoint File ./dirdat/ya010454
2019-09-10 07:00:41.000000 RBA 92947305
GGSCI (cdyxdb2) 95> !
info DPEYKA
EXTRACT DPEYKA Last Started 2019-09-10 11:22 Status RUNNING
Checkpoint Lag 04:16:43 (updated 00:00:00 ago)
Log Read Checkpoint File ./dirdat/ya010455
2019-09-10 07:06:25.000000 RBA 481294966