mchehab/rasdaemon

`sudo ras-mc-ctl --error-count` not listing `Corrected error` event?

ecclex opened this issue · 0 comments

ecclex commented

I decided to open this issue after casually mentioning it here.

After tightening my RAM timings to simulate a mce: [Hardware Error]: Machine check events logged event/ECC error correction report, I get this result with my computer still running:

sudo ras-mc-ctl --summary
No Memory errors.

No PCIe AER errors.

No ARM processor errors.

No Extlog errors.

No devlink errors.
No disk errors.
MCE records summary:
	1 Corrected error, no action required. errors
sudo ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No ARM processor errors.

No Extlog errors.

No devlink errors.

No disk errors.

MCE events:
1 .. error: Corrected error, no action required., CPU 2, bank Unified Memory Controller (bank=21), mcg mcgstatus=0, mci Error_overflow CECC, memory_channel=0,csrow=2, mcgcap=0x0000011d, status=0xdc2040000400011b, addr=0x7d41b7940, misc=0xd01a000601000000, walltime=0x6506e4a4, cpuid=0x00a60f12, bank=0x00000015

But

sudo ras-mc-ctl --error-count
Label               	CE	UE
mc#0csrow#3channel#0	0	0
mc#0csrow#2channel#1	0	0
mc#0csrow#2channel#0	0	0
mc#0csrow#3channel#1	0	0

isn't displaying anything?

hw: ASUS TUF GAMING B650-PLUS, 7800X3D and 2xKSM48E40BD8KM-32HM.
sw: arch, ras 0.8.0.tar.bz2 though aur.archlinux.org/packages/rasdaemon.