It's more common than not that a 'reboot' of a FreeBSD server results in a hung one requiring manual intervention (via reset or power-cycling). It even happens at kernel panics. Sometimes it works (more often it the server was rebooted more recently, but long uptimes definitely makes it more probable to "hang").
Are there anyone looking into this issue - that has been there for ages?
... after which nothing more happens.
FreeBSD 11.2 & 11.3, hardware Dell PowerEdge R730xd & R740xd, HP ProLiant DL380g9 and others. A lot of disks attached to SAS HBA controllers, 10Ge Intel X710 ethernet controllers but otherwise fairly standard machines...
We've tried mitigating this issue somewhat by enabling the hardware watchdogd(8) feature and that helps - sometimes. But not always (it seem the hardware watchdog also gets lost every now and then).
One possible workaround that has been mentioned before is adding to /boot/loader.conf:
which we have but it doesn't make any difference so it's probably something else preventing the machine to reboot properly.
Are there anyone looking into this issue - that has been there for ages?
Code:
> FreeBSD/amd64 (balrog) (ttyu1)
>
> login: Oct 2 13:14:10 balrog reboot: rebooted by fancypants
> Oct 2 13:14:25 balrog syslogd: exiting on signal 15
> pflog0: promiscuous mode disabled
> Waiting (max 60 seconds) for system process `vnlru' to stop... done
> Waiting (max 60 seconds) for system process `bufdaemon' to stop... done
> Waiting (max 60 seconds) for system process `syncer' to stop...
> Syncing disks, vnodes remaining... 0 0 0 0 0 0 0 0 0 0 0 done
> All buffers synced.
FreeBSD 11.2 & 11.3, hardware Dell PowerEdge R730xd & R740xd, HP ProLiant DL380g9 and others. A lot of disks attached to SAS HBA controllers, 10Ge Intel X710 ethernet controllers but otherwise fairly standard machines...
We've tried mitigating this issue somewhat by enabling the hardware watchdogd(8) feature and that helps - sometimes. But not always (it seem the hardware watchdog also gets lost every now and then).
One possible workaround that has been mentioned before is adding to /boot/loader.conf:
Code:
hw.usb.no_shutdown_wait = "1"