cracauer@
Developer
I have the FreeBSD machines in my PXE cluster down with very weird behavior.
"pxeboot" (the bootloader you get specified from dhcpd and that is retrieved via tftp) hangs. There were significant changes in recent versions, but I tried those from FreeBSD releases 11, 14 and 15-current (and a few others I didn't mark) and they all hang, in a variety of places.
What makes it odd:
- this used to work. I have fresh PXE installs of 2 machines from spring that suddenly stopped working. Keep in mind that I still use the same pxeboot binary, so this broke without me changing any of the software on the FreeBSD side
- I thought that maybe I updated the BIOS (and hence the pxe software in the BIOS) without remembering it, but just this morning I fired up an old mainboard that I used with pxeboot for a long time and that I definitely did not update. Same behavior now
- the Linux machines in this PXE cluster (booted via LILO) work just fine
I inserted some debug print statements into some progressing places in pxeboot, but that didn't tell me anything significant either. How do I debug pxeboot other than in a fully debugged virtual machine?
Does pxeboot work for anybody? I assume yes, because I Didn't Change Anything(tm).
So here are some remaining theories:
- something about my Ethernet changed. What and how would that affect pxeboot in FreeBSD but not Linux?
- it seems unlikely that changes in the dhcp or tftp servers are responsible. Or does it?
- I suppose I could try a diskless boot from the same dhcpd and tftpd in a virtual machine. That would determine whether the Ethernet is responsible
Opinions?
"pxeboot" (the bootloader you get specified from dhcpd and that is retrieved via tftp) hangs. There were significant changes in recent versions, but I tried those from FreeBSD releases 11, 14 and 15-current (and a few others I didn't mark) and they all hang, in a variety of places.
What makes it odd:
- this used to work. I have fresh PXE installs of 2 machines from spring that suddenly stopped working. Keep in mind that I still use the same pxeboot binary, so this broke without me changing any of the software on the FreeBSD side
- I thought that maybe I updated the BIOS (and hence the pxe software in the BIOS) without remembering it, but just this morning I fired up an old mainboard that I used with pxeboot for a long time and that I definitely did not update. Same behavior now
- the Linux machines in this PXE cluster (booted via LILO) work just fine
I inserted some debug print statements into some progressing places in pxeboot, but that didn't tell me anything significant either. How do I debug pxeboot other than in a fully debugged virtual machine?
Does pxeboot work for anybody? I assume yes, because I Didn't Change Anything(tm).
So here are some remaining theories:
- something about my Ethernet changed. What and how would that affect pxeboot in FreeBSD but not Linux?
- it seems unlikely that changes in the dhcp or tftp servers are responsible. Or does it?
- I suppose I could try a diskless boot from the same dhcpd and tftpd in a virtual machine. That would determine whether the Ethernet is responsible
Opinions?