Hello there!
We are running FreeBSD 8.1 on a Dell PowerEdge 860 that has an SAS5ira raid controller with two 1.5 TB drives attached. We've started to notice some messages in the logs pertaining to the mpt driver, usually when writing larger amounts of data to the server. I've been using mptutil to try and diagnose what the issue might be but I don't seem to be able to get any useful information.
If any of the drives are failing, is there a way to determine which one? Looking for any help whatsoever which might help to find out what the issue might be.
Thanks in advance!
Cheers,
Phil
We are running FreeBSD 8.1 on a Dell PowerEdge 860 that has an SAS5ira raid controller with two 1.5 TB drives attached. We've started to notice some messages in the logs pertaining to the mpt driver, usually when writing larger amounts of data to the server. I've been using mptutil to try and diagnose what the issue might be but I don't seem to be able to get any useful information.
Code:
server# mptutil show adapter
mpt0 Adapter:
Board Name: SAS5ira
Board Assembly:
Chip Name: C1068
Chip Revision: UNUSED
RAID Levels: RAID0, RAID1, RAID1E
RAID0 Stripes: 64K
RAID1E Stripes: 64K
RAID0 Drives/Vol: 2-8
RAID1 Drives/Vol: 2
RAID1E Drives/Vol: 3-8
Code:
server# mptutil volume status 0
Volume 0 status:
state: OPTIMAL
flags: ENABLED
Code:
server# mptutil show drives
mpt0 Physical Drives:
0 ( 1397G) ONLINE <WDC WD15EADS-00P 0A01> SATA bus 0 id 1
1 ( 1397G) ONLINE <WDC WD15EADS-00P 0A01> SATA bus 0 id 32
Code:
server# grep mpt /var/log/messages
Apr 28 10:22:09 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:22:09 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:24:17 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:24:17 server kernel: mpt0: mpt_cam_event: 0x12
Apr 28 10:24:17 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:24:25 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:24:25 server kernel: mpt0: mpt_cam_event: 0x12
Apr 28 10:24:25 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:28:08 server kernel: mpt0: request 0xc56c9b90:57649 timed out for ccb 0xc5d29800 (req->ccb 0xc5d29800)
Apr 28 10:28:08 server kernel: mpt0: attempting to abort req 0xc56c9b90:57649 function 0
Apr 28 10:28:08 server kernel: mpt0: request 0xc56ce140:57650 timed out for ccb 0xc6611000 (req->ccb 0xc6611000)
Apr 28 10:28:08 server kernel: mpt0: request 0xc56ceaf0:57651 timed out for ccb 0xc5d1a000 (req->ccb 0xc5d1a000)
Apr 28 10:28:08 server kernel: mpt0: mpt_recover_commands: IOC Status 0x4a. Resetting controller.
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x80
Apr 28 10:28:08 server kernel: mpt0: completing timedout/aborted req 0xc56c9b90:57649
Apr 28 10:28:08 server kernel: mpt0: completing timedout/aborted req 0xc56ce140:57650
Apr 28 10:28:08 server kernel: mpt0: completing timedout/aborted req 0xc56ceaf0:57651
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x12
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x12
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x21
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x21
Apr 28 10:28:08 server kernel: mpt0:vol0(mpt0:0:0): Volume Status Changed
Apr 28 10:28:08 server kernel: mpt0: mpt_wait_req(4) timed out
Apr 28 10:28:08 server kernel: mpt0: read_cfg_page(1) timed out
Apr 28 10:28:08 server kernel: mpt0: mpt_refresh_raid_data: Failed to read IOC Page 2
Apr 28 10:28:08 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:28:47 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:34:38 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:34:38 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:39:53 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:42:16 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:42:16 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:43:24 server kernel: mpt0:vol0(mpt0:0:0): RAID-1 - Optimal
Apr 28 10:43:24 server kernel: mpt0:vol0(mpt0:0:0): Status ( Enabled )
Apr 28 10:43:24 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:43:54 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:48:12 server kernel: mpt0: mpt_cam_event: 0x16
Apr 28 10:48:12 server kernel: mpt0: mpt_cam_event: 0x16
If any of the drives are failing, is there a way to determine which one? Looking for any help whatsoever which might help to find out what the issue might be.
Thanks in advance!
Cheers,
Phil