Complete system freeze up; suspect graphics card

Complete system freeze up, hard reset necessary. Has only happened when running cad/linux-eagle5.

After rebooting, it appears this is the offending line in /var/log/messsages

Code:
Jun 16 16:02:02 blackbox kernel: NVRM: GPU at 0000:01:00.0 has fallen off the bus.

I have reproduced this problem (involuntarily) three times now. Can someone help me hone in on the issue, what other logs to start looking at, etc.

Code:
> uname -a
FreeBSD blackbox.hansons 8.2-STABLE FreeBSD 8.2-STABLE #0: Sat Mar 26 19:15:18 NZDT 2011     
benjamin@blackbox.hansons:/usr/obj/usr/src/sys/BLACKBOX  i386

Code:
> pkg_info -xI nvidia
nvidia-driver-295.49 NVidia graphics card binary drivers for hardware OpenGL ren
nvidia-settings-295.40 Display Control Panel for X NVidia driver
nvidia-xconfig-295.40 Tool to manipulate X configuration files for the NVidia dri
 
Do you build x11/xorg from ports or did you install packages?

I recommend building from ports and setting WITHOUT_NOUVEAU in /etc/make.conf. This will give you a slightly newer DRI version. I've noticed the NVidia driver is more stable that way.
 
Thanks SirDice,

I've tried what you said, but just had another lock up. Lost three hours of work. For some reason eagle "reverted" back to the state it started in, even though I had saved many times.

Again, /var/log/messages came up with:
Code:
Jun 25 23:18:49 blackbox kernel: NVRM: GPU at 0000:01:00.0 has fallen off the bus.

Am I fighting a losing battle here? I can't afford to lose more time like this. Although I see this is happening on Linux as well. Could an upgrade to 9-STABLE help?
 
I've found this problem reported under Linux too. Here's the "fix".

This issue is reported all over the places and most recommended solutions are as follows:
Install Latest Kernel Version and NVIDIA Driver

You need to update your kernel and install the latest NVIDIA Unix driver.
Put NVIDIA Driver In Persistence Mode

You need to set your GPU in persistence mode. From the man page:

A flag that indicates whether persistence mode is enabled for the GPU. Value is either "Enabled" or "Disabled". When persistence mode is enabled the NVIDIA driver remains loaded even when no active clients, such as X11 or nvidia-smi, exist. This minimizes the driver load latency associated with running dependent apps, such as CUDA programs. For all CUDA- capable products. Linux only.

Edit /etc/rc.local file and add the following line before exit 0 statement:

/usr/bin/nvidia-smi -pm 1


Save and close the file. The above line ensures that your GPU is set to persistence mode as soon as it boots into the system.
How Do I Set Persistence Mode From Command Line?

Type the following command as root user:
# /usr/bin/nvidia-smi -pm 1
How Do I Verify That Persistence Mode Is Set From My Device?

Type the following command as root user:
# /usr/bin/nvidia-smi -q | grep -i Persistence

Sample outputs:
Persistence Mode : Enabled
How Do I View All Settings?

Type the following command to display GPU or unit info:
# nvidia-smi -q | less

Is there any way this can be done under FreeBSD?
 
Bump. Problem still exists for me. Although I've just seen it happen without the message about something falling of the bus.

Very frustrating. Complete system lock-up, no mouse.
 
Sorry for necrobumping this.

I am getting the same error with GT610. Usually it is a system crash, but sometimes network/ssh are still functional.

Just reported it to NVidia with nvidia-bug-report.log.gz attached.

Amazing this hasn't been fixed in so many years.
 
Are you sure its a GPU's fault? My system was hanging and hard reset was necessary to use it again. I thought it was the hard drive, the gpu, the cpu, the ram memory, everything was suspicious but the problem was the on the PSU, replacing it solved all my problems. Hope this helps
 
Back
Top