[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: issues with LDM
- Subject: Re: issues with LDM
- Date: Fri, 15 Jun 2007 12:37:10 -0600
Justin,
That didn't change the behavior. Still seeing latency.
perhaps turning off the other feed. Is there any load
other than LDM on the system?
Steve
On Fri, 2007-06-15 at 12:56 -0400, Justin Cooke wrote:
> Steve,
>
> I've recreated the queue, let me know if you are still seeing issues.
>
> If so I'll turn off the feed to ldm2 to see if that corrects things.
>
> Justin
>
> Steve Chiswell wrote:
> > Justin,
> >
> > I don't know if they saw a disk space problem with
> > log files not being rotated, but it might just be
> > best today to build a new queue:
> >
> > ldmadmin stop
> > ldmadmin delqueue
> > ldmadmin mkqueue
> > ldmadmin start
> >
> > That will mean some queued data would be lost, but if users aren't
> > getting it
> > anyway, then its best to ensure that the queue isn't corrupt for the
> > weekend.
> >
> > Happy Friday....
> >
> > Thanks,
> >
> > Steve
> >
> >
> > On Fri, 2007-06-15 at 12:13 -0400, Justin Cooke wrote:
> >
> >> Steve,
> >>
> >> Our logs on the primary ldm system "ldm1" had not rotated for nearly a
> >> week. I sent email to the WOC support and this was the response:
> >>
> >> Looks like the seed file was missing after we brought the system backup
> >> from the last outage. should be good now.
> >>
> >> Justin Cooke wrote:
> >>
> >>
> >>> WOC,
> >>>
> >>> I noticed that our logs for LDM have not been rotated on machine ldm1
> >>> since 06/05/2007. We have a cron entry that runs "ldmadmin newlog" at
> >>> 00Z every day.
> >>>
> >>> I attempted to run the command by hand and got the following back:
> >>>
> >>> ldm@ldm1:~$ bin/ldmadmin newlog
> >>> hupsyslog: couldn't open /var/run/syslogd.pid
> >>>
> >>> I checked but /var/run/syslogd.pid is not there but it is on ldm2.
> >>>
> >>> Could there be a problem with syslogd on ldm1?
> >>>
> >>> Justin
> >>>
> >> Also around that time I turned on our backup feed to the ldm2 system
> >> which had been off since that system had issues a few weeks ago (we were
> >> asked by WOC to turn it back on). I have sent email to their support
> >> group asking if both ldm1 and ldm2 are responding to the
> >> ncepldm.woc.noaa.gov address or if something else is going on.
> >>
> >> Justin
> >>
> >> Steve Chiswell wrote:
> >>
> >>> Justin,
> >>>
> >>> Yesterday just after 18Z, the data flow from ncepldm.woc.noaa.gov
> >>> to top level sites at NSF and Unidata both began showing high latency:
> >>>
> >>> http://www.unidata.ucar.edu/cgi-bin/rtstats/iddstats_nc?CONDUIT
> >>> +atm.cise-nsf.gov
> >>>
> >>> and
> >>>
> >>> http://www.unidata.ucar.edu/cgi-bin/rtstats/iddstats_nc?CONDUIT
> >>> +daffy.unidata.ucar.edu
> >>>
> >>> Data volume out has dropped as a result:
> >>>
> >>> http://www.unidata.ucar.edu/cgi-bin/rtstats/iddstats_vol_nc?CONDUIT
> >>> +atm.cise-nsf.gov
> >>>
> >>> Since the behavior is similar at both sites at separate locations, the
> >>> problem would appear to be near your end. Since that coincides with your
> >>> restart of the LDM, could you fill me in on the issues you were
> >>> experiencing?
> >>>
> >>> Thanks
> >>>
> >>> Steve Chiswell
> >>> Unidata User Support
> >>>
> >>>
> >>>
> >>> On Fri, 2007-06-15 at 11:38 -0400, Justin Cooke wrote:
> >>>
> >>>
> >>>> Doug,
> >>>>
> >>>> I had to restart our LDM yesterday right before the 18Z cycle, we had an
> >>>> issue with out logging but none of the configuration files changed.
> >>>> Could one of your feeds have lost the connection to our LDM during that
> >>>> restart?
> >>>>
> >>>> Justin
> >>>>
> >>>> Douglas Schuster wrote:
> >>>>
> >>>>
> >>>>> Yes, we've received partial cycles. More than half of the expected
> >>>>> fields have been missing
> >>>>> in each cycle from June 14 18Z, to June 15, 06Z. The number of
> >>>>> missing fields varies between
> >>>>> each cycle.
> >>>>>
> >>>>> Doug
> >>>>>
> >>>>> On Jun 15, 2007, at 9:11 AM, Justin Cooke wrote:
> >>>>>
> >>>>>
> >>>>>
> >>>>>> Doug,
> >>>>>>
> >>>>>> Have you received any GEFS data from us today? Or is it just certain
> >>>>>> fields you are missing?
> >>>>>>
> >>>>>> Justin
> >>>>>>
> >>>>>>
--
Steve Chiswell <address@hidden>
Unidata