Today in the morning I have discovered that my server running 13.0-RELEASE-p6 wasn't reachable anymore, which left me no other choice then to perform a hardware reset, after which everything was working again. It started during the night with
No idea what caused the problem and no idea how to analyze this any further.
What would be the most elegant solution to prevent such a problem in the future?
So likely a script, that would check connectivity and reboot the server if the link can not be established anymore?
re0
losing connection and toggling link state between DOWN
and UP
till the hardware reset./var/log/messages
only showed the following:
Code:
Jan 24 23:24:43 server kernel: re0: watchdog timeout
Jan 24 23:24:43 server kernel: re0: link state changed to DOWN
Jan 24 23:24:47 server kernel: re0: link state changed to UP
Jan 24 23:24:52 server kernel: re0: watchdog timeout
Jan 24 23:24:52 server kernel: re0: link state changed to DOWN
Jan 24 23:24:56 server kernel: re0: link state changed to UP
Jan 24 23:25:02 server kernel: re0: watchdog timeout
Jan 24 23:25:02 server kernel: re0: link state changed to DOWN
Jan 24 23:25:05 server kernel: re0: link state changed to UP
.
.
.
No idea what caused the problem and no idea how to analyze this any further.
What would be the most elegant solution to prevent such a problem in the future?
So likely a script, that would check connectivity and reboot the server if the link can not be established anymore?