Today after several hours of write operation I had 3 disks removed around same time:
Am I correct assuming, that because I lost 3 disks in succession, this cannot be disk problem or cable problem or hot swap bay problem, but a SATA controller hardware problem instead?
From startup log:
It is LSI 9200-8i controller. Disks are 8TB Samsung 870 QVO.
I currently have no spare controllers to test. I have 9400-16i on order, but it will be month before it arrives.
Code:
Aug 13 12:05:05 ****** mps0: (da3:mps0:0:8:0): CAM status: CCB request completed with an error
Aug 13 12:05:05 ****** Controller reported scsi ioc terminated tgt 7 SMID 1498 loginfo 31110d00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Retrying command, 3 more tries remain
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 cb d0 00 00 00 08 00 00
Aug 13 12:05:05 ****** mps0: Controller reported scsi ioc terminated tgt 7 SMID 1799 loginfo 31110d00
Aug 13 12:05:05 ****** mps0: Controller reported scsi ioc terminated tgt 7 SMID 1181 loginfo 31110d00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request completed with an error
Aug 13 12:05:05 ****** mps0: Controller reported scsi ioc terminated tgt 7 SMID 2040 loginfo 31110d00
Aug 13 12:05:05 ****** mps0: Controller reported scsi ioc terminated tgt 7 SMID 1825 loginfo 31110d00
/---/
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 b0 58 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 ca d8 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request completed with an error
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request completed with an error
Aug 13 12:05:05 ****** (da2:mps0:00 00 03 89 47 d6 90 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Retrying command, 2 more tries remain
/---/
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 ca 70 00 00 00 08 00 00
Aug 13 12:05:05 ****** mps0: (da3:mps0:0:8:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** mpssas_prepare_remove: Sending reset for target ID 7
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 ca 68 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** mps0: (da3:mps0:0:8:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** Unfreezing devq for target ID 8
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 ca 60 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request aborted by the host
/---/
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** da3 at mps0 bus 0 scbus0 target 8 lun 0
Aug 13 12:05:05 ****** da3: <ATA Samsung SSD 870 2B6Q> s/n *************** detached
Aug 13 12:05:05 ****** GEOM_MIRROR: Request failed (error=6). da3[WRITE(offset=7776301056000, length=4096)]
Aug 13 12:05:05 ****** GEOM_MIRROR: Device k5: provider da3 disconnected.
/---/
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 b0 f0 00 00 00 08 00 00
Aug 13 12:05:05 ****** mps0: (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** No pending commands: starting remove_device
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 b0 e8 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 af a8 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** mps0: (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** Unfreezing devq for target ID 7
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 af a0 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
/---/
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): WRITE(16). CDB: 8a 00 00 00 00 03 89 47 9f c0 00 00 00 08 00 00
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): CAM status: CCB request aborted by the host
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Retrying command, 2 more tries remain
Aug 13 12:05:05 ****** da2 at mps0 bus 0 scbus0 target 7 lun 0
Aug 13 12:05:05 ****** da2: <ATA Samsung SSD 870 2B6Q> s/n *************** detached
Aug 13 12:05:05 ****** GEOM_MIRROR: Request failed (error=6). da2[WRITE(offset=7776297885696, length=4096)]
Aug 13 12:05:05 ****** GEOM_MIRROR: Device k4: provider da2 disconnected.
Aug 13 12:05:05 ****** (da3:mps0:0:8:0): Periph destroyed
Aug 13 12:05:05 ****** (da2:mps0:0:7:0): Periph destroyed
Am I correct assuming, that because I lost 3 disks in succession, this cannot be disk problem or cable problem or hot swap bay problem, but a SATA controller hardware problem instead?
From startup log:
Code:
Aug 12 15:08:02 ****** mps0: <Avago Technologies (LSI) SAS2008> port 0xe000-0xe0ff mem 0xf76c0000-0xf76c3fff,0xf7680000-0xf76bffff irq 16 at device 0.0 on pci1
Aug 12 15:08:02 ****** mps0: Firmware: 20.00.07.00, Driver: 21.02.00.00-fbsd
Aug 12 15:08:02 ****** mps0: IOCCapabilities: 1285c<ScsiTaskFull,DiagTrace,SnapBuf,EEDP,TransRetry,EventReplay,HostDisc>
It is LSI 9200-8i controller. Disks are 8TB Samsung 870 QVO.
I currently have no spare controllers to test. I have 9400-16i on order, but it will be month before it arrives.