cryptidfan said:
My guarded optimism has been dashed. FreeBSD 10.0 RC4 kernel panic around 27 hours uptime and while running the previously mentioned dd command against all 8 SATA drives. Contents of
/var/crash/core.txt.0 are here:
http://codepad.org/svtRfRT1
Can anyone glean helpful clues, assuming they exist, from the crash file? Is there a more appropriate forum or mailing list for such pleas?
In general, if the fault always happens inside the same function (which may or may not have the same physical address, nor the same line number, depending on how the kernel was compiled and any modules loaded), you've found a kernel bug. If the fault happens in random functions without any defined pattern, you have flakey hardware.
Note that not all of the "this should never happen" things in the kernel have nice panic strings, so a piece of broken hardware can give you this type of fault if the kernel developer that handles that piece of code has never seen that particular type of hardware fault. The developer may add a new panic string for the event if they think that panic could happen to other users.
As far as diagnosing the bug that is causing your particular fault, others are a lot better at it than I am. However, something looks odd. Refer to
the source file in question. That's the relevant version for 10.0-RC4. Your actual fault was:
Code:
#7 0xffffffff80b21847 in vm_pageout () at /usr/src/sys/vm/vm_pageout.c:1359
But according to the source file I linked to above, the
vm_pageout() function occupies source file lines 1629-1697, far away from the line 1359 reported in your fault. 1359 is inside the [much more complicated]
vm_pageout_scan() function (source lines 887-1424). This could be for a number of reasons:
- The Clang compiler doesn't use the same line numbers as SVN.
- There's something inconsistent about the kernel vs. symbol tables on your system.
- I have no idea what I'm talking about.
In any event, as
@wblock@ suggested, you can probably get more advanced help on the -CURRENT mailing list, as that's where the developers responsible for this code hang out.