[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Support #SBB-325304]: Re: 20110712: CONDUIT request -- fire weather grids
- Subject: [Support #SBB-325304]: Re: 20110712: CONDUIT request -- fire weather grids
- Date: Thu, 18 Aug 2011 09:10:32 -0600
Hi Justin,
Preliminary comments on gribinsert test running on 140.90.33.32:
A comparison of the volume stats for the test CONDUIT datastream
received on gale.unidata.ucar.edu and the "operational" CONDUIT
datastream as seen by our toplevel IDD relay idd.unidata.ucar.edu
shows what might be a problem.
Compare:
CONDUIT volume on gale.unidata.ucar.edu
http://www.unidata.ucar.edu/cgi-bin/rtstats/iddstats_vol_nc?CONDUIT+gale.unidata.ucar.edu
CONDUIT volume on idd.unidata.ucar.edu
http://www.unidata.ucar.edu/cgi-bin/rtstats/iddstats_vol_nc?CONDUIT+idd.unidata.ucar.edu
The volume for the 22+ hours (as I write this note) on gale.unidata.ucar.edu
is significantly less than for the same 22+ hour time period on
idd.unidata.ucar.edu.
I expected that the volume on gale.unidata.ucar.edu would be measurably
_greater_ than
on idd.unidata.ucar.edu because you indicated that the fireweather products
were being
inserted in addition to the products being sent in the "operational" CONDUIT
datastream.
If this is not the case, my surmise is completely invalid.
If there are no other problems, this may mean that the insertion of the
fireweather
products into the LDM queue on 140.90.33.32 is interfering with the other
products
being inserted into the queue. The first thing that jumps to mind as being a
potential problem is that the LDM queue on 140.90.33.32 is not large enough for
both sets of products to be queue-resident long enough to be delivered to the
downstream requester (gale.unidata.ucar.edu). Since gale.unidata.ucar.edu is
only ingesting the test CONDUIT datastream, and since it is a high-end
workstation
(dual quad core CPUs w/24 GB RAM) that is essentially idling, and since the
network connection on gale.unidata.ucar.edu is 1 Gps through Internet2, I am not
hopeful that the problem could be on it.
Please check the age of the oldest product in the LDM queue on 140.90.33.32
to see if it ever becomes exceedingly small. This check should be routinely
performed (like every minute via a cron job) so that we can see if the addition
of the fireweather products has exceeded the 6 GB queue size.
Comments:
- way back when I was working with Patrick O'Reilly on CONDUIT issues, I
strongly
recommended that routine monitoring of things like the age of the oldest
product in the LDM queue be made. I don't recall if Patrick ever setup this
monitoring, so I can't say if you are already doing what needs to be done
(which is running the 'ldmadmin addmetrics' action from a cron job running
in the account where the LDM is running on 140.90.33.32). If you are
already running the metrics gathering actions from cron, you can plot
the time history of things like age of the oldest product in the queue
by running 'ldmadmin plotmetrics' (the ability to plot the metrics is
based on the availability of gnuplot).
- I will be merging additions from the GRIB tables Becky provided yesterday
to the GEMPAK tables being used in my gribinsert v1.4 setup here at Unidata
today. After the merge (which looks a little messier than I expected), I
will see if all of the fields available in the firewx.hires products I FTPed
from the NCEP FTP server can be identified/cracked/harvested for metadata.
More later on this...
Cheers,
Tom
--
****************************************************************************
Unidata User Support UCAR Unidata Program
(303) 497-8642 P.O. Box 3000
address@hidden Boulder, CO 80307
----------------------------------------------------------------------------
Unidata HomePage http://www.unidata.ucar.edu
****************************************************************************
Ticket Details
===================
Ticket ID: SBB-325304
Department: Support CONDUIT
Priority: Normal
Status: Closed