I'm running FreeBSD 8.0-RC1. Two disks in my 12x500 double parity ZFS array died, leaving the array in degraded mode.
I installed two new 500GB disks and used the zpool replace command twice, supplying the pool name, a failed device name, and a new device name each time.
Resilvering completed after about 15 hours. However, at this point, zpool status reported that the replace operations were still in progress. I subsequently scrubbed the pool successfully and cleared the error counts. The current output of zpool status:
Attempting to manually detach one of the failed devices does not succeed:
Likewise, manually detaching one of the new devices also fails:
However, if I boot the system with ad34 and ad38 disconnected, the array remains accessible in degraded mode.
Any ideas as to how I might fix this? I'm considering zeroing one of the new drives and attaching it to the array in the hope that after resilvering completed, I would be able to detach one of the stale, failed devices.
Is the problem perhaps that I attempted to replace two drives in a double parity array simultaneously? Any ideas?
I installed two new 500GB disks and used the zpool replace command twice, supplying the pool name, a failed device name, and a new device name each time.
Resilvering completed after about 15 hours. However, at this point, zpool status reported that the replace operations were still in progress. I subsequently scrubbed the pool successfully and cleared the error counts. The current output of zpool status:
Code:
pool: chunk
state: DEGRADED
status: One or more devices has experienced an error resulting in data
corruption. Applications may be affected.
action: Restore the file in question if possible. Otherwise restore the
entire pool from backup.
see: http://www.sun.com/msg/ZFS-8000-8A
scrub: none requested
config:
NAME STATE READ WRITE CKSUM
chunk DEGRADED 0 0 0
raidz2 DEGRADED 0 0 0
replacing DEGRADED 0 0 0
ad14 OFFLINE 0 0 0
ad38 ONLINE 0 0 0
replacing DEGRADED 0 0 0
7415432913300468315 REMOVED 0 0 0 was /dev/ad6/old
ad34 ONLINE 0 0 0
ad12 ONLINE 0 0 0
ad10 ONLINE 0 0 0
ad4 ONLINE 0 0 0
ad36 ONLINE 0 0 0
ad40 ONLINE 0 0 0
ad30 ONLINE 0 0 0
ad18 ONLINE 0 0 0
ad42 ONLINE 0 0 0
ad16 ONLINE 0 0 0
ad32 ONLINE 0 0 0
errors: 194225 data errors, use '-v' for a list
Attempting to manually detach one of the failed devices does not succeed:
Code:
sudo zpool detach chunk ad14
cannot detach ad14: no valid replicas
Code:
sudo zpool detach chunk ad38
cannot detach ad38: no valid replicas
Any ideas as to how I might fix this? I'm considering zeroing one of the new drives and attaching it to the array in the hope that after resilvering completed, I would be able to detach one of the stale, failed devices.
Is the problem perhaps that I attempted to replace two drives in a double parity array simultaneously? Any ideas?