Ladies/Gents, just recently I have noticed the following errors in two of my FreeBSD 8.3 VMs (a name server and an MX server - both running as ESXi 4.1 guests). I'll post output from the MX server as it appears more often (and obviously does more I/O).
Example:
Any ideas what may be causing this?
I am running GENERIC kernel, and have VMware tools installed. System is up to date using freebsd-update.
Physical hardware is a Cisco B200 blade in a UCS 8 slot chassis, the physical storage is a Netapp FAS2240 connected via NFS over 10 gig fibre through a Cisco 4507.
The virtual storage is just VMware virtual machine provided virtual disks - LSI logic parallel emulation.
The Netapp is not running anywhere near flat out in terms of IO, so I'm pretty sure it shouldn't be timing out due to IO throttling - and all our user ports on the 4507 are running at 100Mb POE (plugged into old phones which are 100Mb limited) with only 8 10Gb ports in use and say 36 ports running at 1Gb. It has dual Sup 7s with SSO, so should be no problem there either.
I'm not seeing storage errors on anything else.
Any idea where to start looking to track this down? The machine had 188 days of uptime at that point and stupidly, I rebooted it.
However my name server (also exhibiting the issue to a lesser extent due to less IO) has not yet been rebooted (also has 188 days of uptime) - if there are any diagnostics I should perform prior to reboot I can perform them on that.
Cheers
Example:
Code:
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: request 0xffffff80002378b0:9223 timed out for ccb 0xffffff000198b000 (req->ccb 0xffffff000198b000)
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: attempting to abort req 0xffffff80002378b0:9223 function 0
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: completing timedout/aborted req 0xffffff80002378b0:9223
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: abort of req 0xffffff80002378b0:0 completed
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: request 0xffffff80002324e0:9224 timed out for ccb 0xffffff003eaca000 (req->ccb 0xffffff003eaca000)
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: attempting to abort req 0xffffff80002324e0:9224 function 0
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: completing timedout/aborted req 0xffffff80002324e0:9224
Jan 16 01:25:42 <kern.crit> mx2 kernel: mpt0: abort of req 0xffffff80002324e0:0 completed
Any ideas what may be causing this?
I am running GENERIC kernel, and have VMware tools installed. System is up to date using freebsd-update.
Physical hardware is a Cisco B200 blade in a UCS 8 slot chassis, the physical storage is a Netapp FAS2240 connected via NFS over 10 gig fibre through a Cisco 4507.
The virtual storage is just VMware virtual machine provided virtual disks - LSI logic parallel emulation.
The Netapp is not running anywhere near flat out in terms of IO, so I'm pretty sure it shouldn't be timing out due to IO throttling - and all our user ports on the 4507 are running at 100Mb POE (plugged into old phones which are 100Mb limited) with only 8 10Gb ports in use and say 36 ports running at 1Gb. It has dual Sup 7s with SSO, so should be no problem there either.
I'm not seeing storage errors on anything else.
Any idea where to start looking to track this down? The machine had 188 days of uptime at that point and stupidly, I rebooted it.
However my name server (also exhibiting the issue to a lesser extent due to less IO) has not yet been rebooted (also has 188 days of uptime) - if there are any diagnostics I should perform prior to reboot I can perform them on that.
Cheers