[mythtv-users] Advice on debugging crashing system needed
Gert van der Knokke
gertk at xs4all.nl
Thu Sep 20 19:32:29 UTC 2007
Craig Huff wrote:
> On 9/17/07, *Craig Huff* <huffcslists at gmail.com
> <mailto:huffcslists at gmail.com>> wrote:
>
> I would agree, but this system is < 2 months old. I'm thinking
> more along the
> lines of a defective or infant mortality part failure. Scans
> through /var/log/*
> haven't revealed anything clearly an issue, but I did find this
> interesting:
>
> In /var/log/Xorg.0.log.old (yesterday's log) and in
> /var/log/gdm/:0.log.1 these
> lines appear at the end:
>
> AUDIT: Sun Sep 16 22:12:40 2007: 4351 Xorg: client 2 rejected from
> local host
> AUDIT: Sun Sep 16 22:12:40 2007: 4351 Xorg: client 3 rejected from
> local host
> AUDIT: Sun Sep 16 22:12:41 2007: 4351 Xorg: client 3 rejected from
> local host
> AUDIT: Sun Sep 16 22:12:41 2007: 4351 Xorg: client 3 rejected from
> local host
>
> Backtrace:
> 0: /usr/bin/Xorg(xf86SigHandler+0x81) [0x80bc151]
> 1: [0x110420]
> 2: /usr/lib/xorg/modules/drivers/nvidia_drv.so(_nv001038X+0x44)
> [0x2d6110]
>
> Fatal server error:
> Caught signal 11. Server aborting
>
> (WW) NVIDIA(0): Failed to free video decoder object
> (EE) NVIDIA(0): Failed to tear down DAC
>
>
> These lines don't appear in every Xorg.0.log.* nor in every
> gdm/:0.log.*, so I
> wonder...
>
> No smoking guns with lm_sensors output, either. Core0 Temp: +75F,
> Core1 Temp: +75F, temp1: +75F, temp2: +88F, temp3: +77F,
> fan1: 2884 RPM.
>
> What else might I delve into?
>
> Craig.
>
>
> Typical. The old saw about a watched pot...
>
> With sensors output in a loop piped to a log file and tee'd to a
> window on the
> backend and top running in another window on the backend, I ran it all
> evening
> without a hiccup. Enjoyed the movie, too. ;-) I guess I could try
> backing the
> cooling fans down from tropical storm speed (as opposed to hurricane
> force or
> balmy breeze) and see if I can exacerbate the problem because at this
> point I
> have no breadcrumb trail, unless the details above are more
> significant to a
> trained eye than they are to me.
>
> Any advice welcome. I won't post on this again unless I find
> something that
> looks significant or someone suggests something to look at and report
> back on.
>
I guess you already have tried re-seating the CPU and memory modules,
the ATX power connector(s) and such. Quite often a bad contact on one of
these will react to temperature changes. Try running memtest for some
time, a bad 'bit' can cause all kinds of havoc.
Assuming you use the onboard video and not the PCIe slot, otherwise
re-seating the videocard might help too.
Good luck,
Gert
> Craig.
>
> ------------------------------------------------------------------------
>
> _______________________________________________
> mythtv-users mailing list
> mythtv-users at mythtv.org
> http://mythtv.org/cgi-bin/mailman/listinfo/mythtv-users
>
More information about the mythtv-users
mailing list