ZFS megaraid smartcl command error

larlap

New Member


Messages: 10

uname -a
Code:
FreeBSD bsd 13.0-RC3 FreeBSD 13.0-RC3 #0 releng/13.0-n244696-8f731a397ad: Fri Mar 19 04:00:20 UTC 2021     root@releng1.nyi.freebsd.org:/usr/obj/usr/src/amd64.amd64/sys/GENERIC  amd64

My problem began with these console messages.
Code:
Apr  6 05:41:26 bsd kernel: mfisyspd1: hard error cmd=read 1104-1655
Apr  6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c7530, status=0x3c, scsi_status=0
Apr  6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr  6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 0-551
Apr  6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c57f8, status=0x3c, scsi_status=0
Apr  6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr  6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 552-1103
Apr  6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c5088, status=0x3c, scsi_status=0
Apr  6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr  6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 1104-1655

The zpool does not show degraded, so I'm confused. I want to run smartcl on the disks in the pool.
zpool status
Code:
  pool: tank
state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
    attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
    using 'zpool clear' or replace the device with 'zpool replace'.
   see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
  scan: resilvered 506M in 00:01:04 with 0 errors on Thu Apr  1 08:37:28 2021
config:

    NAME           STATE     READ WRITE CKSUM
    tank           ONLINE       0     0     0
      raidz2-0     ONLINE       0 10.8K     0
        mfisyspd0  ONLINE      43 36.7K     7
        mfisyspd1  ONLINE       0   40K     3
        mfisyspd2  ONLINE       0 38.5K     5
        mfisyspd4  ONLINE      22 36.9K     7
        mfisyspd5  ONLINE      17 38.1K     7
        mfisyspd6  ONLINE      17 37.7K     4


According to Thomas Krenn's article, the command is smartctl -a -d megaraid,N /dev/??, where the N corresponds to the DID of the disk on the HBA. I am having difficulty determining the relationship between DID and the device.

First I loaded the mfip module, kldstat -v | grep mfip resulting in
Code:
13    1 0xffffffff82b02000     21b8 mfip.ko (/boot/kernel/mfip.ko)
        510 mfi/mfip

mfiutil show adapter
Code:
mfi0 Adapter:
    Product Name: LSI MegaRAID SAS 9271-8i
   Serial Number: SK84375861
        Firmware: 23.34.0-0019
     RAID Levels: JBOD, RAID0, RAID1, RAID5, RAID6, RAID10, RAID50
  Battery Backup: not present
           NVRAM: 32K
  Onboard Memory: 1024M
  Minimum Stripe: 8K
  Maximum Stripe: 1M

camcontrol devlist -v
Code:
scbus7 on mfi0 bus 0:
<ATA WDC WD20EFRX-68E 0A80>        at scbus7 target 8 lun 0 (pass3)
<ATA WDC WD20EFRX-68E 0A82>        at scbus7 target 9 lun 0 (pass4)
<ATA WDC WD20EFRX-68E 0A82>        at scbus7 target 10 lun 0 (pass5)
<ATA ST3000DM001-9YN1 CC9E>        at scbus7 target 12 lun 0 (pass6)
<ATA WDC WD20EFRX-68E 0A82>        at scbus7 target 13 lun 0 (pass7)
<ATA WDC WD20EFRX-68E 0A82>        at scbus7 target 14 lun 0 (pass8)
<ATA WDC WD20EFRX-68A 0A80>        at scbus7 target 15 lun 0 (pass9)

storcli /c0 /eall /sall show
Code:
CLI Version = 007.1211.0000.0000 Nov 07, 2019
Operating system = FreeBSD 13.0-RC3
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.


Drive Information :
=================

--------------------------------------------------------------------------------
EID:Slt DID State DG     Size Intf Med SED PI SeSz Model                Sp Type
--------------------------------------------------------------------------------
252:0    13 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68EUZN0 U  -   
252:1    10 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68EUZN0 U  -   
252:2    14 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68EUZN0 U  -   
252:3     9 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68EUZN0 U  -   
252:5    12 JBOD  -  2.728 TB SATA HDD N   N  512B ST3000DM001-9YN166   U  -   
252:6     8 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68EUZN0 U  -   
252:7    15 JBOD  -  1.819 TB SATA HDD N   N  512B WDC WD20EFRX-68AX9N0 U  -

mfiutil show drives
Code:
mfi0 Physical Drives:
8 ( 1863G) JBOD <WDC WD20EFRX-68E 0A80 serial=WD-WMC4M3291997> SATA E1:S6
9 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M1SRNXN0> SATA E1:S3
10 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M3XAHPP6> SATA E1:S1
12 ( 2795G) JBOD <ST3000DM001-9YN1 CC9E serial=S1F0C6EK> SATA E1:S5
13 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M4PTHTTT> SATA E1:S0
14 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M1SRN3HS> SATA E1:S2
15 ( 1863G) JBOD <WDC WD20EFRX-68A 0A80 serial=WD-WMC301253288> SATA E1:S7

Finally, smartctl -a -d megaraid,13 /dev/mfisyspd4
Code:
smartctl 7.2 2020-12-30 r5155 [FreeBSD 13.0-RC3 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

/dev/mfisyspd4: Unknown device type 'megaraid,13'
=======> VALID ARGUMENTS ARE: ata, scsi[+TYPE], nvme[,NSID], sat[,auto][,N][+TYPE], usbcypress[,X], usbjmicron[,p][,x][,N], usbprolific, usbsunplus, sntjmicron[,NSID], sntrealtek, intelliprop,N[+TYPE], jmb39x[-q],N[,sLBA][,force][+TYPE], jms56x,N[,sLBA][,force][+TYPE], 3ware,N, hpt,L/M/N, cciss,N, areca,N/E, atacam, auto, test <=======

It doesn't matter what number I use for the DID (N), the error is the same.

Sorry for the long post. Any suggestions?
 
OP
L

larlap

New Member


Messages: 10

smartctl -a -T permissive -d atacam /dev/pass7
Code:
smartctl 7.2 2020-12-30 r5155 [FreeBSD 13.0-RC3 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: Input/output error

=== START OF INFORMATION SECTION ===
Device Model:     [No Information Found]
Serial Number:    [No Information Found]
Firmware Version: [No Information Found]
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   [No Information Found]
Local Time is:    Thu Apr  8 03:38:06 2021 PDT
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.
smartctl -a -T permissive -d ata /dev/pass7
Code:
smartctl 7.2 2020-12-30 r5155 [FreeBSD 13.0-RC3 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

Smartctl open device: /dev/pass7 failed: Operation not permitted
 
Top