[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

20041011: rpc.ldmd signal 11s



Art,

[Please don't send me a 28 megabyte email.  My system can't handle it.]

Date: Mon, 11 Oct 2004 12:01:55 -0400 (EDT) (10:01 MDT)
From: "Arthur A. Person" <address@hidden>
To: Steve Emmerson <address@hidden>
Subject: Re: 20040921 rpc.ldmd signal 11s

The above message contained the following:

> Looks like we have a core dump for the rpc.ldmd signal 11 problem I 
> reported previously.  The system info is:
> 
>           RedHat Enterprise Linux 2.4.21-15.0.4.ELsmp
>           LDM V6.0.15
>           System:  ls2.meteo.psu.edu Dell 4600 dual 3.0 Ghz
> 
> The ldmd.log file shows some abnormal behaviour before the failure (which 
> I dont' recall seeing before):
> ...
> 
> Oct 11 07:18:52 ls2 rpc.ldmd[22635]: child 22648 terminated by signal 11
> Oct 11 07:18:52 ls2 rpc.ldmd[22635]: Killing (SIGINT) process group

What was child process 22648?  Was it an rpc.ldmd process?

> Finally, I will also mention that I've had two (I think) hangs of this 
> system for reasons unknown in the past couple of weeks.  I put some 
> resource monitors in for the last one but couldn't find anything that I 
> thought was related to the cause.  The hangs both occurred near 03Z 
> (similar to the time the rpc.ldmd quit last night) and would seem to be, 
> perhaps, a time of peak system activity for data reception and file system 
> activity (scour is running at that time and we have lots of little radar 
> files).

I'll have to think about this.

> I'm attaching the core dump.  Let me know if you need anything else or 
> want to get on the system...

Would you please send me a stack trace of the core-file.

Regards,
Steve Emmerson