Hi there,
I have a FreeNAS running which unfortunately shows a lot of memory errors. There seems to be more at stake than just 1 module (which had been reported in Supermicro BIOS and which I already removed).
In /var/log/messages I still get a lot of
As I read in this thread it is possible to localize the faulty DIMM via the given error address which can be found in the address range that is shown in the Memory Device Mapped Address of dmidecode output. Unfortunately in my case this is not clear as there are two sets of DIMMs that have the same adress ranges. Also, at some earlier stage the error adresses even pointed outside the range of my Memory Device Mapped Addresses completely. It therefore seems as there were be some conversion/calculation involved, maybe via the memory controllers.
Please help!
Here you go with the memory handles straight out of dmidecode. This was originally 64 GB of RAM in 8 sticks.
Memory Array Mapped Addresses:
Memory Device Mapped Addresses (which point to the actual physical memory sticks handles):
I have a FreeNAS running which unfortunately shows a lot of memory errors. There seems to be more at stake than just 1 module (which had been reported in Supermicro BIOS and which I already removed).
In /var/log/messages I still get a lot of
Code:
Sep 7 02:07:20 freenas MCA: CPU 0 COR (1) MS channel 0 memory error
Sep 7 02:07:20 freenas MCA: Address 0x6e0bfb900
Sep 7 02:07:20 freenas MCA: Misc 0x1221000000000086
Sep 7 02:07:20 freenas MCA: Bank 15, Status 0x8c000040000800c0
Sep 7 02:07:20 freenas MCA: Global Cap 0x0000000007000c14, Status 0x0000000000000000
Sep 7 02:07:20 freenas MCA: Vendor "GenuineIntel", ID 0x50657, APIC ID 0
As I read in this thread it is possible to localize the faulty DIMM via the given error address which can be found in the address range that is shown in the Memory Device Mapped Address of dmidecode output. Unfortunately in my case this is not clear as there are two sets of DIMMs that have the same adress ranges. Also, at some earlier stage the error adresses even pointed outside the range of my Memory Device Mapped Addresses completely. It therefore seems as there were be some conversion/calculation involved, maybe via the memory controllers.
Please help!
Here you go with the memory handles straight out of dmidecode. This was originally 64 GB of RAM in 8 sticks.
Memory Array Mapped Addresses:
Code:
Handle 0x0021, DMI type 19, 31 bytes
Memory Array Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x007FFFFFFFF
Range Size: 32 GB
Physical Array Handle: 0x0020
Partition Width: 4
Handle 0x0029, DMI type 19, 31 bytes
Memory Array Mapped Address
Starting Address: 0x00800000000
Ending Address: 0x00FFFFFFFFF
Range Size: 32 GB
Physical Array Handle: 0x0028
Partition Width: 4
Handle 0x0030, DMI type 19, 31 bytes
Memory Array Mapped Address
Starting Address: 0x01000000000
Ending Address: 0x017FFFFFFFF
Range Size: 32 GB
Physical Array Handle: 0x0020
Partition Width: 0
Handle 0x0035, DMI type 19, 31 bytes
Memory Array Mapped Address
Starting Address: 0x01800000000
Ending Address: 0x01FFFFFFFFF
Range Size: 32 GB
Physical Array Handle: 0x0028
Partition Width: 0
Memory Device Mapped Addresses (which point to the actual physical memory sticks handles):
Code:
Handle 0x0031, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x001FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x0022
Memory Array Mapped Address Handle: 0x0030
Partition Row Position: 1
Interleave Position: 1
Interleaved Data Depth: 1
Handle 0x0032, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00200000000
Ending Address: 0x003FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x0023
Memory Array Mapped Address Handle: 0x0030
Partition Row Position: 1
Interleave Position: 1
Interleaved Data Depth: 1
Handle 0x0033, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00400000000
Ending Address: 0x005FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x0024
Memory Array Mapped Address Handle: 0x0030
Partition Row Position: 1
Handle 0x0034, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00600000000
Ending Address: 0x007FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x0026
Memory Array Mapped Address Handle: 0x0030
Partition Row Position: 1
Handle 0x0036, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x001FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x002A
Memory Array Mapped Address Handle: 0x0035
Partition Row Position: 1
Interleave Position: 1
Interleaved Data Depth: 1
Handle 0x0037, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00200000000
Ending Address: 0x003FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x002B
Memory Array Mapped Address Handle: 0x0035
Partition Row Position: 1
Interleave Position: 1
Interleaved Data Depth: 1
Handle 0x0038, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00400000000
Ending Address: 0x005FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x002C
Memory Array Mapped Address Handle: 0x0035
Partition Row Position: 1
Handle 0x0039, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00600000000
Ending Address: 0x007FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0x002E
Memory Array Mapped Address Handle: 0x0035
Partition Row Position: 1