deb3 Crashing every hour (almost to the second) - The FreeBSD Forums
The FreeBSD Forums  

Go Back   The FreeBSD Forums > Base System > General

General General questions about the FreeBSD operating system. Ask here if your question does not fit elsewhere.

Reply
 
Thread Tools Display Modes
  #1  
Old August 9th, 2012, 23:20
BobWalter BobWalter is offline
Junior Member
 
Join Date: Apr 2012
Posts: 4
Thanks: 0
Thanked 0 Times in 0 Posts
Default Crashing every hour (almost to the second)

I have 8.2 amd 64 installed. All was stable until I decided to reload my perl port and picked the old version by mistake. After a couple of frustrating hours, perl was reinstalled and using the various port tools, plus some hard work to pick up modules that the ports system missed, my software all worked correctly.

However, since then the system crashes every hour (there are no cron jobs involved, and I have waited before restarting to see if time of day affects the issue: it doesn't). The error is an ad5 DMA (unable to write error 5). No dump is made,the server says it is unable to do it.

This sounds like it should be a disk problem. Ad5 is not a disk on my machine. There is no ad5 entry in /dev. When the problem first occurred, I thought it might be my disk. I was using two SSDs with gmirror(8) in a RAID 1 config. I tried removing a disk, same problem. I physically substituted another disk. No change. fsck(8) is clean.

I realize this isn't necessarily the OS; though it might be. The problem is that I can't come up with a way to understand the problems. No logs offer anything helpful. The system dies due to the DMA timeouts. That much I am pretty sure is true. One of the port maintainers went through my system and repeated the perl installs. The problem continues and no one has any idea why. I searched on every term I can think of with no success.

I would appreciate any help I can get here.

Thanks,
Bob

Last edited by SirDice; August 10th, 2012 at 08:15. Reason: Read about formatting tags: http://forums.freebsd.org/misc.php?do=bbcode
Reply With Quote
  #2  
Old August 10th, 2012, 05:57
graudeejs's Avatar
graudeejs graudeejs is offline
Style(9) Addict
 
Join Date: Nov 2008
Location: Riga, Latvia
Posts: 4,525
Thanks: 422
Thanked 607 Times in 475 Posts
Default

Quote:
Originally Posted by BobWalter View Post
I have 8.2 amd 64 installed. All was stable until I decided to reload my perl port and picked the old version by mistake. After a couple of frustrating hours, perl was reinstalled and using the various port tools, plus some hard work to pick up modules that the ports system missed, my software all worked correctly.

However, since then the system crashes every hour (there are no cron jobs involved, and I have waited before restarting to see if time of day affects the issue: it doesn't). The error is an ad5 dma (unable to write error 5). No dump is made,the server says it is unable to do it.

This sounds like it should be a disk problem. Ad5 is not a disk on my machine. There is no ad5 entry in /dev. When the problem first occured, I thought it might be my disk. I was using two SSHD's with gmirror in a RAID 1 config. I tried removing a disk, same problem. I physically substituted another disk. No change. fsck is clean.

I realize this isn't necessarily the OS; though it might be. The problem is that I can't come up with a way to understand the problems. No logs offer anything helpful. The system dies due to the dma timeouts. That much I am pretty sure is true. One of the port maintainers went through my system and repeated the perl installs. The problem continues and no one has any idea why. I searched on every term I can think of with no success.

I would appreciate any help I can get here.

Thanks,
Bob
Have you verified ram?
I once hand random crashes (one every few hours) when I had one bad bit in RAM
Reply With Quote
  #3  
Old August 10th, 2012, 08:16
SirDice's Avatar
SirDice SirDice is offline
Moderator
 
Join Date: Nov 2008
Location: Rotterdam, Netherlands
Posts: 13,725
Thanks: 47
Thanked 2,023 Times in 1,862 Posts
Default

It might be the controller, the cable or the disk itself that's on the fritz. Try installing sysutils/smartmontools and see if the disk is OK. Replacing the cable should be easy to try too.
__________________
Senior UNIX Engineer at Unix Support Nederland
Experience is something you don't get until just after you need it.
Reply With Quote
  #4  
Old August 11th, 2012, 03:31
BobWalter BobWalter is offline
Junior Member
 
Join Date: Apr 2012
Posts: 4
Thanks: 0
Thanked 0 Times in 0 Posts
Default

I did check the disk. In fact, since I had a raid 1, I used both disks and tried both controllers. Disks and controllers are fine. The exact time between crashes makes me think this is software.
Reply With Quote
  #5  
Old August 11th, 2012, 05:36
chrcol chrcol is offline
Member
 
Join Date: Nov 2008
Posts: 325
Thanks: 23
Thanked 9 Times in 8 Posts
Default

I have never heard of software making DMA errors get reported. Other than maybe a driver issue, but perl wouldnt cause it. So I think its a coincidence.
Reply With Quote
  #6  
Old August 11th, 2012, 05:58
freethread freethread is offline
Member
 
Join Date: Aug 2009
Location: italy
Posts: 168
Thanks: 25
Thanked 18 Times in 18 Posts
Default

It happens to me after installation of some ports. While installing java/eclipse at some point I was not able to install it and its dependencies, the system crash at random. Nothing in logs, only about 10-15 lines on screen when not in X but not able to read, system instantly reboot.

I thought it was OpenJDK 6, in post installation message there are instructons to mount fdesc and proc, I was not add them to fstab. Before that I was installed emulators/fuse as gvfs dependency and an USB disk (NTFS) is always plugged in but used only in Windows (multiboot system).

After adding fdesc and proc in /etc/fstab and unplugging USB disk, the system stop rebooting. I need more investigation but hope this helps you in some way.
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[Solved] cron is one hour behind Pushrod General 9 February 4th, 2011 19:31
[Solved] Move clock forward by one hour? caesius General 4 September 27th, 2010 15:17
[Solved] X crashing z662 X.Org 4 March 3rd, 2010 04:13
svn crashing Alt Installation and Maintenance of FreeBSD Ports or Packages 2 August 31st, 2009 18:34
my computer is resetting each hour hirohitosan General 22 March 24th, 2009 06:18


All times are GMT +1. The time now is 07:32.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2013, vBulletin Solutions, Inc.
The mark FreeBSD is a registered trademark of The FreeBSD Foundation and is used by The FreeBSD Project with the permission of The FreeBSD Foundation.
Web protection and acceleration provided by CloudFlare
0