I'm trying to set up a file server (9.1 release) but I'm having a semi-fatal problem dealing with pools:
I have a raidz2 comprised of six sata drives connected via my motherboard's intel southbridge sata ports. All of the bios raid options are disabled and the drives are in straight AHCI mode (hotswap enabled). The system (accounts, home dir, etc) is installed on a separate 7th drive formatted as normal ufs, connected to a separate non-Intel motherboard port.
As part of my initial stress testing, I'm simulating failures by popping the SATA cable to various drives in the 6x pool. If I pop two drives, the pool goes into 'degraded' mode and everything works as expected. I can zero and replace the drives, etc, no problem. However, when I pop a third drive, the machine becomes VERY unstable. I can nose around the boot drive just fine, but anything involving i/o that so much as sneezes in the general direction of the pool hangs the machine. Once this happens I can log in via ssh, but that's pretty much it. I've reinstalled and tested this over a dozen times, and it's perfectly repeatable:
[cmd=]ls[/cmd] the dir where the pool is mounted? Hang.
I'm already in the dir, and try to [cmd=]cd[/cmd] back to my home dir? Hang.
[cmd=]zpool destroy[/cmd]? Hang.
[cmd=]zpool replace[/cmd]? Hang.
[cmd=]zpool history[/cmd]? Hang.
[cmd=]shutdown -r now[/cmd]? Gets halfway through, then hang.
[cmd=]reboot -q[/cmd]? same as shutdown.
The machine never recovers (at least, not inside 35 minutes, which is the most I'm willing to wait). Reconnecting the drives has no effect. My only option is to hard reset the machine with the front panel button. Googling for info suggested I try changing the pool's "failmode" setting from "wait" to "continue", but that doesn't appear to make any difference. For reference, this is a virgin 9.1-release installed off the dvd image with no ports or packages or any extra anything.
Can someone help me out here? Is this a bug or something? I don't think I'm doing anything wrong procedure wise. I fully understand and accept that a raidz2 with three dead drives is toast, but I will NOT accept having it take down the rest of the machine with it. I can't even nuke the damn pool and start over without taking the whole machine offline.
Also, apologies if there's already a thread about this- forum search appears to be broken at the moment and I didn't see anything when I hand searched.
I have a raidz2 comprised of six sata drives connected via my motherboard's intel southbridge sata ports. All of the bios raid options are disabled and the drives are in straight AHCI mode (hotswap enabled). The system (accounts, home dir, etc) is installed on a separate 7th drive formatted as normal ufs, connected to a separate non-Intel motherboard port.
As part of my initial stress testing, I'm simulating failures by popping the SATA cable to various drives in the 6x pool. If I pop two drives, the pool goes into 'degraded' mode and everything works as expected. I can zero and replace the drives, etc, no problem. However, when I pop a third drive, the machine becomes VERY unstable. I can nose around the boot drive just fine, but anything involving i/o that so much as sneezes in the general direction of the pool hangs the machine. Once this happens I can log in via ssh, but that's pretty much it. I've reinstalled and tested this over a dozen times, and it's perfectly repeatable:
[cmd=]ls[/cmd] the dir where the pool is mounted? Hang.
I'm already in the dir, and try to [cmd=]cd[/cmd] back to my home dir? Hang.
[cmd=]zpool destroy[/cmd]? Hang.
[cmd=]zpool replace[/cmd]? Hang.
[cmd=]zpool history[/cmd]? Hang.
[cmd=]shutdown -r now[/cmd]? Gets halfway through, then hang.
[cmd=]reboot -q[/cmd]? same as shutdown.
The machine never recovers (at least, not inside 35 minutes, which is the most I'm willing to wait). Reconnecting the drives has no effect. My only option is to hard reset the machine with the front panel button. Googling for info suggested I try changing the pool's "failmode" setting from "wait" to "continue", but that doesn't appear to make any difference. For reference, this is a virgin 9.1-release installed off the dvd image with no ports or packages or any extra anything.
Can someone help me out here? Is this a bug or something? I don't think I'm doing anything wrong procedure wise. I fully understand and accept that a raidz2 with three dead drives is toast, but I will NOT accept having it take down the rest of the machine with it. I can't even nuke the damn pool and start over without taking the whole machine offline.
Also, apologies if there's already a thread about this- forum search appears to be broken at the moment and I didn't see anything when I hand searched.