[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
LDM Dies
- Subject: LDM Dies
- Date: Tue, 05 Feb 2002 23:34:06 +0000
My ldm (running on aeolus.ucsd.edu) keeps shutting down with the following
message:
Feb 05 22:59:41 aeolus rpc.ldmd[4244]: Terminating process group
This is a complete shutdown. There are no ldm owned process left in the
mix. I bring it back on-line and it will work for a while. Then it shuts
down again with the same message.
I haven't a clue as to what is causing this. All sites feeding from aeolus
should consider failing over to their alternate until this stops. I have a
meeting to go to this evening (and six or seven hours worth of sleep)
during which I won't be able to monitor the ldm.
UPC: Anybody there able to help me?
Larry
---===---=-=-=-=-=-=-=-=-=-=-=====[\/]=====-=-=-=-=-=-=-=-=-=-=---===---
-----===(* Climate's what we expect, but weather's what we get. *)===-----
Larry Riddle : Climate Research Division : Scripps Institution of
Oceanography
University of California, San Diego : La Jolla, California 92093-0224
Phone: (858) 534-1869 : Fax: (858) 534-8561 : E-Mail: address@hidden
From address@hidden Tue Feb 5 17:32:34 2002
To: address@hidden, address@hidden, address@hidden
Subject: Re: LDM Dies
Cc: address@hidden
Larry...
In the last week, I've seen LDM mysteriously die on one Linux box. I checked
system messages, and saw a logged "segmentation violation" from rpc.ldmd.
The same process was running on another Linux box with no problems, though
it wasn't processing exactly the same data. Since "rpc.ldmd" is setuid
ROOT, I didn't get a core dump.
You might check your messages file (probably /var/adm/messages or
/var/log/messages) to see if something similar is logged.
Kevin W. Thomas
Center for Analysis and Prediction of Storms
University of Oklahoma
Norman, Oklahoma
Email: address@hidden
From address@hidden Tue Feb 5 17:39:59 2002
CC: address@hidden, address@hidden,
address@hidden
Subject: Re: LDM Dies - aeolus downstream sites should fail over
Hi Larry,
I'm on aeolus. I see the problem - an assertion about the state of the
product queue is regularly failing. But, I don't yet see why this is
happening.
As per Larry's suggestion, sites feeding from aeolus should fail over
until further notice.
Anne
--
***************************************************
Anne Wilson UCAR Unidata Program
address@hidden P.O. Box 3000
Boulder, CO 80307
----------------------------------------------------
Unidata WWW server http://www.unidata.ucar.edu/
****************************************************