… this is a known bug closed for convenience. :/ …
Yes I do. What's the problem? That report reads "Closed", and it describes quite precisely what we have seen here too.I hope that you don't mean openzfs/zfs issue 5346.
You're not the only one. In my experience the probability of getting a problem fixed is often related to:There is not much focus and drive behind identifying and weeding out all the "ordinary" races and corner-cases. And I don't like that.
I understand your point of view, the only issue I have with that methodology is that someone 5 years from now is going to run into the same issue, post the issue here, everyone is going to look over the issue, get confused, run through the same steps to diagnose, find out it's the same issue, and then ignore it. Secondary to that, while my issue (and weirdly, the original reporter in the github report) had to do with very unimportant cache directories, nothing really says it couldn't be something more important. But again, agree that it seems to be a very fringe circumstance.Yes I do. What's the problem? That report reads "Closed", and it describes quite precisely what we have seen here too.
It was closed after the creator had figured he had probably defective memory, so there was a possible (but unprovable) explanation (if they run the machine four years before detecting the bad memory, it may well have gone defect at a later time). Then, there is another person in that report who observed the same issue happen. Adding the case of here, the likelyhood of an actual bug in ZFS does increase.
As I have mentioned occasionally, I recognize two levels of software quality (interplanetary and interstellar), and in this scenario such reports would not be closed. There would be a third path besides "actively working on a fix" and "no feasible way of action, so just close it for convenience", and that would be "under surveillance".
What I observe is, such critical surveillance does happen only for security-related issues. There is not much focus and drive behind identifying and weeding out all the "ordinary" races and corner-cases. And I don't like that.
So, that's my viewpoint. You may have a different one, then please explain.
If they are not reproducible, and there is no debugging information (such as core dumps when the problem first occurred, and traces of how it happened), then it can not be diagnosed nor fixed. Keeping a bug open that will not be addressed is pointless.There is not much focus and drive behind identifying and weeding out all the "ordinary" races and corner-cases. And I don't like that.
This is where determined / stubborn / obsessed people are sometimes of the greatest help:You're not the only one. In my experience the probability of getting a problem fixed is often related to:
It's obvious
It's reproducible
I have a coredump
I have detailed, relevant logs
And in my experience races and corner-cases are the hardest to fix because it may be reproducible, but difficult to do and often no coredump or detailed logs. "I reboot and it went away".
Sure it can. If there might be bug, then it is confined into the code, it can't escape. You just need to read the code and understand what it does (in any possible case).If they are not reproducible, and there is no debugging information (such as core dumps when the problem first occurred, and traces of how it happened), then it can not be diagnosed nor fixed.
…
Upon restarting from single user mode at this point, it seemed that there was a kernel panic (image attached)
…
init died (signal 0, exit 0)
…