Hi,
We have FreeBSD 9.1, and one of our drives has failed in the RAID-1 array. What is the proper way to replace the failed drive, while the system is still running? If the second drive fails, we lose the OS and won't be able to boot our server.
1.) Figure out which physical slot the failed drive is located in
2.) Command to replace the drive
3.) Command to replicate/rebuild the raid array
The main filesystem / is running on the RAID-1 array. Here is the output, let me know if you need any other information;
The other failed drive is part of a zpool;
We have FreeBSD 9.1, and one of our drives has failed in the RAID-1 array. What is the proper way to replace the failed drive, while the system is still running? If the second drive fails, we lose the OS and won't be able to boot our server.
1.) Figure out which physical slot the failed drive is located in
2.) Command to replace the drive
3.) Command to replicate/rebuild the raid array
The main filesystem / is running on the RAID-1 array. Here is the output, let me know if you need any other information;
Code:
root # mfiutil show volumes
mfi0 Volumes:
Id Size Level Stripe State Cache Name
mfid0 ( 930G) RAID-1 64k DEGRADED Enabled
mfid1 ( 3725G) RAID-0 64k OPTIMAL Enabled
mfid2 ( 3725G) RAID-0 64k OPTIMAL Enabled
mfid3 ( 3725G) RAID-0 64k OPTIMAL Enabled
mfid4 ( 3725G) RAID-0 64k OPTIMAL Enabled
3 ( 3725G) RAID-0 64k OFFLINE Enabled
mfid18 ( 3725G) RAID-0 64k OPTIMAL Writes
mfid19 ( 3725G) RAID-0 64k OPTIMAL Writes
mfid20 ( 3725G) RAID-0 64k OPTIMAL Writes
mfid21 ( 3725G) RAID-0 64k OPTIMAL Writes
root # mfiutil show drives
mfi0 Physical Drives:
0 ( 931G) FAILED <ST31000524AS JC4B serial=5VPDLEQ8> SATA E1:S0
1 ( 931G) ONLINE <ST31000524AS JC4B serial=5VPDLPGZ> SATA E1:S1
2 ( 3726G) ONLINE <WL4000GSA6472E\011 1KX1 serial=WOL240256793\011> SATA E1:S2
3 ( 3726G) ONLINE <WL4000GSA6472E 1KX0 serial=WOL240241285> SATA E1:S3
4 ( 3726G) FAILED <ST4000DM000-1F21 CC52 serial=W300ANQ8> SATA E1:S4
5 ( 3726G) ONLINE <WL4000GSA6472E 1KX0 serial=WOL240256926> SATA E1:S5
6 ( 3726G) ONLINE <ST4000DM000-1F21 CC54 serial=Z3015J64> SATA E1:S6
7 ( 3726G) ONLINE <ST4000DM000-1F21 CC54 serial=Z3015L8E> SATA E1:S7
8 ( 3726G) ONLINE <WL4000GSA6472E 1KX1 serial=WOL240256967> SATA E1:S8
9 ( 3726G) ONLINE <WL4000GSA6472E HP00 serial=WOL240241417> SATA E1:S9
10 ( 3726G) ONLINE <WL4000GSA6472E\011 1KX0 serial=WOL240256966\011> SATA E1:S10
Code:
root # zpool status
pool: sysvol
state: DEGRADED
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Wed Mar 14 08:45:01 2018
2.13T scanned out of 12.8T at 266M/s, 11h42m to go
272G resilvered, 16.62% done
config:
NAME STATE READ WRITE CKSUM
sysvol DEGRADED 0 0 0
raidz2-0 DEGRADED 0 0 0
mfid1 ONLINE 0 0 0
mfid2 ONLINE 0 0 0
mfid3 ONLINE 0 0 0
mfid4 ONLINE 0 0 0
spare-4 REMOVED 0 0 0
3265633713998955857 REMOVED 0 0 0 was /dev/mfid5
mfid21 ONLINE 0 0 0 (resilvering)
mfid18 ONLINE 0 0 0
mfid19 ONLINE 0 0 0
mfid20 ONLINE 0 0 0
logs
ada0 ONLINE 0 0 0
spares
11427004879980126793 INUSE was /dev/mfid21