Other Sesutil error

One of the Disks disappeared three times already. Last time they pulled it out and pushed it back in and it came online. So I don't know what to think as I'm not there.

What does Status: Critical (0x02 0x00 0x00 0x40) mean. I found some error codes for 0x02 and 0x40 for an LSI RAID but I don't know if those are relevant nor what they actually mean even if they were relevant.
https://techdocs.broadcom.com/us/en...ri-mode-software/1-0/v11685216/v11685227.html


I'm trying to figure out if it's the Disk or the slot that's broken. I'll tell them to move it to a different slot, but is there a way I can confirm?

This is a Dell R640

Sesutil shows

sh:
ses2:
        Enclosure Name: DP BP14G+EXP 2.52
        Enclosure ID: 500056b326deabff
        Element 0, Type: Array Device Slot
                Status: Unsupported (0x00 0x00 0x00 0x00)
                Device Names:
        Element 1, Type: Array Device Slot
                Status: Unknown (0x06 0x00 0x00 0x00)
                Description: Drive Slot 0
                Device Names: da0,pass2
        Element 2, Type: Array Device Slot
                Status: Unknown (0x06 0x00 0x00 0x00)
                Description: Drive Slot 1
                Device Names: da1,pass3
        Element 3, Type: Array Device Slot
                Status: Unknown (0x06 0x00 0x00 0x00)
                Description: Drive Slot 2
                Device Names: da2,pass4
        Element 4, Type: Array Device Slot
                Status: Critical (0x02 0x00 0x00 0x40)
                Description: Drive Slot 3
                Device Names:

camcontrol doesn't see the disk either. Neither does iDRAC anymore.

sh:
scbus14 on ahcich13 bus 0:
<>                                 at scbus14 target -1 lun ffffffff ()
scbus15 on ahciem1 bus 0:
<AHCI SGPIO Enclosure 2.00 0001>   at scbus15 target 0 lun 0 (ses1,pass1)
<>                                 at scbus15 target -1 lun ffffffff ()
scbus16 on mpr0 bus 0:
<TOSHIBA AL13SXL600N DT01>         at scbus16 target 8 lun 0 (pass2,da0)
<TOSHIBA AL13SXL600N DT01>         at scbus16 target 9 lun 0 (pass3,da1)
<ATA WD Blue SA510 2. 00WD>        at scbus16 target 10 lun 0 (pass4,da2)
<ATA WD Blue SA510 2. 00WD>        at scbus16 target 12 lun 0 (pass5,da3)
<ATA WD Blue SA510 2. 00WD>        at scbus16 target 13 lun 0 (pass6,da4)
<DP BP14G+EXP 2.52>                at scbus16 target 18 lun 0 (ses2,pass7)
<>                                 at scbus16 target -1 lun ffffffff ()
scbus-1 on xpt0 bus 0:
<>                                 at scbus-1 target -1 lun ffffffff (xpt0)

sh:
➜  ~ sudo sysinfo storage
Generated by SysInfo v1.0.1 by Daniel Gerzo

Storage information

Available hard drives:
da4: <ATA WD Blue SA510 2. 00WD> Fixed Direct Access SPC-4 SCSI device
da4: Serial Number 24061C4A0U13
da4: 1200.000MB/s transfers
da4: Command Queueing enabled
da4: 1907729MB (3907029168 512 byte sectors)
da3: <ATA WD Blue SA510 2. 00WD> Fixed Direct Access SPC-4 SCSI device
da3: Serial Number 24100F4A2Q15
da3: 1200.000MB/s transfers
da3: Command Queueing enabled
da3: 1907729MB (3907029168 512 byte sectors)
da2: <ATA WD Blue SA510 2. 00WD> Fixed Direct Access SPC-4 SCSI device
da2: Serial Number 24100F4A1802
da2: 1200.000MB/s transfers                                                             da2: Command Queueing enabled
da2: 1907729MB (3907029168 512 byte sectors)
da1: <TOSHIBA AL13SXL600N DT01> Fixed Direct Access SPC-4 SCSI device
da1: Serial Number 6740A0BDF5YE
da1: 600.000MB/s transfers
da1: Command Queueing enabled
da1: 572325MB (1172123568 512 byte sectors)
da0: <TOSHIBA AL13SXL600N DT01> Fixed Direct Access SPC-4 SCSI device
da0: Serial Number 1730A00EF5YE
da0: 600.000MB/s transfers
da0: Command Queueing enabled
da0: 572325MB (1172123568 512 byte sectors)

Raid controllers:
mpr0:
mpr0@pci0:26:0:0: class=0x010700 rev=0x02 hdr=0x00 vendor=0x1000 device=0x0097 subvendor=0x1028 subdevice=0x1f53
mpr0@pci0:26:0:0: class=0x010700 rev=0x02 hdr=0x00 vendor=0x1000 device=0x0097 subvendor=0x1028 subdevice=0x1f53
vendor='Broadcom / LSI'
device='SAS3008 PCI-Express Fusion-MPT SAS-3'
 
I think sesutil has an option for more verbose output, which will show you extra status. Otherwise, get the documentation from the disk enclosure vendor to see what they report in the status. I don't know whether the SES standard describes this in detail; last time I worked with SES (10 years ago), the problem was that the standard describes mostly the syntax of commands and responses, but not much about the semantics.

Another option: Ask the HBA what it sees. Either by booting into its BIOS-based UI, or with some utility like megaraid or stor-cli or whatever it is called today. Do you have any vendor-provided UI for the enclosure?
 
Yeah there is iDRAC on this server it sees everything BIOS sees. Log says Disk is pulled. Log says it was pulled 3 times, so there's something iffy about it. But I didn't check the error code while disk is actually removed, that way I would know if "critical" pertains to the Disk or the actuall Disk Controller or single slot.

I wouldn't even know how to look for documentation apart from a single number which I'll paste later. I'll see what Google comes up with after I post this.

I check the manpage and ses doesn't seem to have verbose, I did try the other commands it has though. Here it is below.

sh:
➜  ~ sudo sesutil show 
ses0: <AHCI SGPIO Enclosure 2.00>; ID: 3061686369656d30
Desc            Dev     Model                     Ident                Size/Status
Slot 00         -       -                         -                    Not Installed
Slot 01         -       -                         -                    Not Installed
Slot 02         -       -                         -                    Not Installed
Slot 03         -       -                         -                    Not Installed
Slot 04         -       -                         -                    Not Installed
Slot 05         -       -                         -                    Not Installed

ses1: <AHCI SGPIO Enclosure 2.00>; ID: 3061686369656d31
Desc            Dev     Model                     Ident                Size/Status
Slot 00         -       -                         -                    Not Installed
Slot 01         -       -                         -                    Not Installed
Slot 02         -       -                         -                    Not Installed
Slot 03         -       -                         -                    Not Installed
Slot 04         -       -                         -                    Not Installed
Slot 05         -       -                         -                    Not Installed
Slot 06         -       -                         -                    Not Installed
Slot 07         -       -                         -                    Not Installed

ses2: <DP BP14G+EXP 2.52>; ID: 500056b326deabff
Desc            Dev     Model                     Ident                Size/Status
Drive Slot 0    da0     TOSHIBA AL13SXL600N       1730A00EF5YE         Unknown
Drive Slot 1    da1     TOSHIBA AL13SXL600N       6740A0BDF5YE         Unknown
Drive Slot 2    da2     ATA WD Blue SA510 2.      24100F4A1802         Unknown
Drive Slot 3    -       -                         -                    Critical
Drive Slot 4    da3     ATA WD Blue SA510 2.      24100F4A2Q15         Unknown
Drive Slot 5    da4     ATA WD Blue SA510 2.      24061C4A0U13         Unknown
Drive Slot 6    -       -                         -                    Not Installed
Drive Slot 7    -       -                         -                    Not Installed
Drive Slot 8    -       -                         -                    Not Installed
Drive Slot 9    -       -                         -                    Not Installed
➜  ~ sudo sesutil status
ses0: OK
ses1: OK
ses2: CRITICAL
➜  ~ sudo sesutil       
sesutil: Missing command
Usage: sesutil [-u /dev/ses<N>] <command> [options]
Commands supported:
    fault       (<disk>|<sesid>|all) (on|off)
        Change the state of the fault LED associated with a disk

    locate      (<disk>|<sesid>|all) (on|off)
        Change the state of the locate LED associated with a disk

    map         
        Print a map of the devices managed by the enclosure

    show       
        Print a human-friendly summary of the enclosure

    status     
        Print the status of the enclosure
 
One out of 4 results for "DP BP14G+EXP 2.52" was this post :) from two days ago.

I'm thinking since this is a Dell computer and the component isn't Dell, they don't really make components the manual should be found at some other companies site. I'll also go check the Dell Docs and see.
 
Found some info on Storage Controllers

sh:
Raid controllers:
ahcich5:
mps0:
mps0@pci0:1:0:0: class=0x010700 rev=0x05 hdr=0x00 vendor=0x1000 device=0x0087 subvendor=0x1028 subdevice=0x05a1
mps0@pci0:1:0:0: class=0x010700 rev=0x05 hdr=0x00 vendor=0x1000 device=0x0087 subvendor=0x1028 subdevice=0x05a1
vendor='Broadcom / LSI'
device='SAS2308 PCI-Express Fusion-MPT SAS-2'
ahcich4:

Dell just says it could be one of these "PowerEdge RAID Controller (PERC) H330, PERC H730P, PERC H740P, HBA330, S140, and Boot Optimized Server Storage (BOSS-S1)."

Or one of these

sh:
Internal Controllers: PERC H330, HBA330, HBA350i (adapter only), H730P, H740P, H750 (adapter only)
External Controllers: H840, 12 Gbps SAS HBA, HBA355e (adapter only, non-RAID)
Software RAID: S140
Boot Optimized Storage Subsystem (BOSS): HWRAID 2 x M.2 SSDs 240 GB, 480 GB
Internal Dual SD Module1

I just read the Manual for "HBA355e Adapter, HBA355i Front, HBA355i Adapter, HBA350i MX, and HBA350i Adapter" it didn't have anything with error codes. It just has how to use things etc.
 
Well idk. I rebooted couple of days ago, nothing happened. I rebooted now again and the disk is back online, it's resilvering, but it will go away again. No errors reported. I'll still have them put it in a different slot.

sh:
Drive Slot 2    da2     ATA WD Blue SA510 2.      24100F4A1802         Unknown
Drive Slot 3    da3     ATA CT2000MX500SSD1       2406E895DFF7         Unknown
Drive Slot 4    da4     ATA WD Blue SA510 2.      24100F4A2Q15         Unknown
Drive Slot 5    da5     ATA WD Blue SA510 2.      24061C4A0U13         Unknown
 
Back
Top