Xeon E5-2630L v2 with 4 x 32GB LRDIMM. There's some kind of incompatibility (buggy BIOS perhaps?) which causes memory errors to start spewing out whenever the 32GB sticks are used. 16GB sticks work fine.
Here's an example of the error:
The odd thing is that the errors begin almost at exactly the same point after boot. I measured from the last log entry at boot, to the first log entry reporting a memory error, and repeated several times. With the exception of the first, all failures begin within a few seconds of each other... all just after the 16 minute mark. The machine is idle.
1. 16:25 (first memory error reported 16 min 25 sec after boot)
2. 16:08
3. 16:12
4. 16:10
5. 16:12
(In hindsight, measuring from the *first* log entry at boot may have been less variable.)
The same thing happens when I boot to single user. Although I don't have a precise timestamp from written logs, I watched the console near the 16 minute point, and shortly after that the MCA errors started.
By comparison, I booted a Linux on USB kernel, and the memory errors start immediately.
So... what happens for the first 16 minutes after FreeBSD boot that "prevents" (or possibly ignores) memory errors?!
FreeBSD 12.1-RELEASE, fresh install.
Here's an example of the error:
Code:
Aug 25 11:25:44 test kernel: MCA: Bank 7, Status 0xcc0009c000010090
Aug 25 11:25:44 test kernel: MCA: Global Cap 0x0000000001000c17, Status 0x0000000000000000
Aug 25 11:25:44 test kernel: MCA: Vendor "GenuineIntel", ID 0x306e4, APIC ID 0
Aug 25 11:25:44 test kernel: MCA: CPU 0 COR (39) OVER RD channel 0 memory error
Aug 25 11:25:44 test kernel: MCA: Address 0xba909040
Aug 25 11:25:44 test kernel: MCA: Misc 0x1400e8e86
The odd thing is that the errors begin almost at exactly the same point after boot. I measured from the last log entry at boot, to the first log entry reporting a memory error, and repeated several times. With the exception of the first, all failures begin within a few seconds of each other... all just after the 16 minute mark. The machine is idle.
1. 16:25 (first memory error reported 16 min 25 sec after boot)
2. 16:08
3. 16:12
4. 16:10
5. 16:12
(In hindsight, measuring from the *first* log entry at boot may have been less variable.)
The same thing happens when I boot to single user. Although I don't have a precise timestamp from written logs, I watched the console near the 16 minute point, and shortly after that the MCA errors started.
By comparison, I booted a Linux on USB kernel, and the memory errors start immediately.
So... what happens for the first 16 minutes after FreeBSD boot that "prevents" (or possibly ignores) memory errors?!
FreeBSD 12.1-RELEASE, fresh install.