Hello,
Recently one of the SuperMicro node gone crashed. On checking the logs we found ECC memory errors, although these errors should be auto-corrected. However, we tried to find the faulted DIM using mcelog utility but needs instructions/help to identify the physical memory bank location using dmidecode.
Here is the mcelog output: https://pastebin.com/Jwent8YN
Here is the dmidecode output: https://pastebin.com/NjBNc9hG
We need to find the exact memory module to be replaced. There is a method that is used to identify by mapping the memory address range, please let me know how can i map and spot the module.
Recently one of the SuperMicro node gone crashed. On checking the logs we found ECC memory errors, although these errors should be auto-corrected. However, we tried to find the faulted DIM using mcelog utility but needs instructions/help to identify the physical memory bank location using dmidecode.
Here is the mcelog output: https://pastebin.com/Jwent8YN
Here is the dmidecode output: https://pastebin.com/NjBNc9hG
We need to find the exact memory module to be replaced. There is a method that is used to identify by mapping the memory address range, please let me know how can i map and spot the module.