Dec 17 09:33:25 cre1r08n06 kernel: [946277.337433] {5}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 4
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337436] {5}[Hardware Error]: It has been corrected by h/w and requires no further action
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337438] {5}[Hardware Error]: event severity: corrected
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337440] {5}[Hardware Error]: Error 0, type: corrected
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337441] {5}[Hardware Error]: fru_text: A7
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337443] {5}[Hardware Error]: section_type: memory error
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337444] {5}[Hardware Error]: error_status: 0x0000000000000400
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337446] {5}[Hardware Error]: physical_address: 0x0000002553b14040
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337449] {5}[Hardware Error]: node: 0 card: 2 module: 1 rank: 1 bank: 0 row: 19064 column: 768
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337450] {5}[Hardware Error]: error_type: 2, single-bit ECC
Dec 17 09:33:25 cre1r08n06 kernel: [946277.337458] EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
Dec 17 09:33:26 cre1r08n06 kernel: [946277.638033] EDAC MC0: 0 CE memory read error on CPU_SrcID#0_Channel#0_DIMM#0 (channel:0 slot:0 page:0x2553b14 offset:0x40 grain:32 syndrome:0x0 - area:DRAM err_code:0000:009f socket:0 channel_mask:1 rank:0)
Hardware error from APEI Generic Hardware Error Source: %d 的日誌是由 drivers/acpi/apei/ghes.c : __ghes_print_estatus()函數打印的。
Reference
1. ACPI, APEI support
2. Documentation/acpi/apei/einj.txt
3. Documentation/acpi/apei/output_format.txt