ZFS HDD Mirror read error for one of two HDDs

Running a GPU and CPU stress test is a good PSU tester occasionally; I had a couple PSUs not handle that after a few years (fine for lower-powered stuff), and the first one I ever tried that on failed instantly :p (implying that might have been a random problem later)
 
HI as your HDD has no reallocated and no unrecoverable Sectors with only three Months Power on time , it appears to be healthy. A FEW DMA errors is not uncommon.
From a Population of 20000 HDDs we used to get half a dozen disks a Month with a 1000+ UDMA errors . That is when you know that there is a connection error between the HDD and the Motherboard.
 
Today after booting, and rebooting I got 2 errors.
1. Shutting down the system showed me the error: "Spin down disk failed" after issuing poweroff.
2. Booting up again showed me errors I previously posted, and if issuing zpool status I get the following:
Code:
  pool: libraryz
 state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
    Sufficient replicas exist for the pool to continue functioning in a
    degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
    repaired.
  scan: resilvered 1.59M in 00:00:01 with 0 errors on Thu Oct 23 14:31:20 2025
config:

    NAME          STATE     READ WRITE CKSUM
    libraryz      DEGRADED     0     0     0
      mirror-0    DEGRADED     0     0     0
        ada1.eli  ONLINE       0     0     0
        ada0.eli  FAULTED      0     0     0  too many errors

Trying smartctl -HA /dev/ada0 shows me this:
Code:
smartctl 7.5 2025-04-30 r5714 [FreeBSD 14.3-RELEASE-p3 amd64] (local build)
Copyright (C) 2002-25, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: Input/output error

If this is a USB connected device, look at the various --device=TYPE variants
A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

Could this still be a SATA cable error, or is it more like a HDD error now ?
If a disk fails to spin down upon shutdown, isn't it a HDD error ?
Kind of curious whether I should get new SATA cables now, or replace the HDD as I still have around 2 years warranty.

Another point I can think of is that the HDD was still doing some work, and the OS tried to forcefully close it, without asking the HDD to finish what it is doing.
I read that there are tools for Seagate HDDs to ask them politely to finish their task, before shutting down the system.
Just wondering whether there are similar tools for WD RED Plus HDDs.

Maybe I should try something like # camcontrol standby "device" before issuing poweroff ?
To confirm that the HDD is ready for a shutdown.

EDIT:
The HDD completely died on me.
It is not even recognized anymore by the BIOS.
Issuing zpool status gives me:
Code:
pool: libraryz
 state: DEGRADED
status: One or more devices could not be opened.  Sufficient replicas exist for
    the pool to continue functioning in a degraded state.
action: Attach the missing device and online it using 'zpool online'.
   see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-2Q
  scan: resilvered 1.59M in 00:00:01 with 0 errors on Thu Oct 23 14:31:20 2025
config:

    NAME                      STATE     READ WRITE CKSUM
    libraryz                  DEGRADED     0     0     0
      mirror-0                DEGRADED     0     0     0
        ada1.eli              ONLINE       0     0     0
        13283179534583615310  UNAVAIL      0     0     0  was /dev/ada0.eli

I am pretty sure it is the HDD now.
 
If it's still under warranty return it for replacement.
If it's out of warranty you can remove the hdd controller and check the contacts.
Brought it back today, and in the following week I will get a new HDD.
Also handed over a smartctl log for the manufacturer to check and see whether he gets the same errors.
Pretty sure I will get a new HDD as replacement.

It wasn't the cable, because I even tried only one HDD at a time, and it failed with both SATA cables.
Once I get a new HDD I will see whether I can create a new pool as I messed up my old one, and all my precioius data needs to be gathered again, as my backup device is also broken and out of warranty :/
Bought a new backup device so, I will have fun getting my data back together...
 
hi, sorry to hear that. Nothing we could see from "smartctl" indicated that a total fault was about to happen.
As the media itself did not show excessive errors , I can speculate that onboard HDD controller was marginal somehow and failed .
ill be interested in the vendors response to this .
 
Back
Top