ZFS Drive going offline spontaneously

I have a new 3 drive raidz1 on FreeBSD 13.0. Twice in the last few days I've seen the same drive be marked as OFFLINE in zpool status. The output suggested it had been been administratively offlined, which is consistent with the zpoolconcepts(8) definition of OFFLINE, but I've not done that.

Is this a sign of a faulty drive?
 
It could be also a flaky cable connection, however if the cable is connected tight there is a good change your drive will become unusable soon.

Anyway it's worth investigation.
 
Can you see anything in /var/log/messages?

I would expect to see the change of status logged, something like this:
Code:
Jan  6 10:42:52 sherman kernel: ada2 at ahcich2 bus 0 scbus2 target 0 lun 0
Jan  6 10:42:52 sherman kernel: ada2: <WDC WD2500HHTZ-04N21V0 04.06A00> s/n WD-WXN1E32KUAUF detached
Jan  6 10:42:52 sherman kernel: (ada2:ahcich2:0:0:0): Periph destroyed

I'd certainly shut down the system and carefully re-seat all the cables. A good quality pencil eraser is good for cleaning contacts.

I would then regularly grep in/var/log/messages to alert of any status changes to the drive.
 
Check your power usage too. Your PSU might just crap out, causing the drive to go offline. Also check the SMART parameters (use smartctl(8)) of the drive, just to make sure the drive itself is still good.
 
Back
Top