[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: 19990113: idd: delayed metars & NOAAPORT
- Subject: Re: 19990113: idd: delayed metars & NOAAPORT
- Date: Wed, 13 Jan 1999 14:10:52 -0700 (MST)
On Wed, 13 Jan 1999, Unidata Support wrote:
> >To: address@hidden
> >From: Tom McDermott <address@hidden>
> >Subject: Re: 19981223: idd: delayed metars & NOAAPORT
> >Organization: .
> >Keywords: 199901131548.IAA14251
>
>
> On Wed, 23 Dec 1998, Robb Kambic wrote:
> > On Wed, 23 Dec 1998, Unidata Support wrote:
> >
> > >
> > > ------- Forwarded Message
> > >
> > > >From: Tom McDermott <address@hidden>
> > > >Subject: idd: delayed metars & NOAAPORT
> > > >Organization: SUNY Brockport
> > > >Keywords: 199812231947.MAA02806 IDD NOAAPORT
> > >
> > > Dear Unidata Support,
> > >
> > > We are having a chronic problem here at SUNY Brockport with delayed
> > > METARS since the switch to NOAAPORT took place. For obs made at 50
> > > minutes past the hour, we had formerly been receiving before the hour.
> > > Now they are coming in anywhere from 1 to 3 hours late.
> > >
> > > This first occurred when the switch to NOAAPORT took place, and has been
> > > consistently happening ever since. Network connectivity to our feed at
> > > Cornell seems to be OK, no dropped packets, normal transmission times.
> > > However, the problem does seem to lie somewhere between us, since SUNY
> > > Albany, which is also fed from Cornell, is getting them on time.
> > >
> > > The only thing we have to go on is the number of disconnects from Cornell
> > > in our logs, which usually averages around 270 to 280 per 24hr period, or
> > > around 12/hr. The messages in the logs at Cornell (accessible from
> > > Unidata's web page) are a little more informative than ours are. There's
> > > a lot of lines like this (I've edited out line not relevant to our server
> > > vortex):
> >
> > Tom,
> >
> > That's a lot of disconnects. It's also the reason for the late data,
> > surprizing that your site is not loosing more data. Here's what I think
> > what is happening. Before, NOAAport Brockport was almost running close to
> > it's network capacity, now it's saturated. That's the reason for the
> > disconnect( time outs) and the LDM RECLASS statements. I would contact
> > your network people to see if they can arrange a solution and further
> > restrict the incoming data. You also could check the status of your
> > machine to see if it cpu loaded, disk loaded, etc.
>
>
> Robb,
>
> Here's what I've found regarding this problem:
>
> 1. CPU and disk are not loaded, so I ruled this out immediately.
>
> 2. Network saturation seemed a plausible diagnosis. Failing over to
> SUNY Albany resulted in no improvement whatsover, which would be
> consistent with network saturation.
>
> 3. However, applying the Unidata filter for NOAAPORT made no difference
> either, in terms of the number of disconnects and delayed products, which
> was a bit surprising if it was a network capacity problem.
>
> 4. Last week I spoke with our network people about this problem. They
> were skeptical it was a network capacity problem since semester had ended
> and very few people were at work. They gave me a figure of 23% of
> capacity for outbound packets and 3% for inbound packets on 12/23/98. As
> a test, on last Tues, 1/5/99, I changed back our REQUEST for UNIDATA to be
> ".*" and we ran statistics for our campus internet connection (maximum
> utilization was 30%) and on the subnet our ldm server is on (maximum
> utilization was 9%). We will have to see what happens when classes
> resume, but it seems unlikely that network saturation was the cause of the
> problem during the time period we are talking about. (The semester had
> ended when NOAAPORT started.
>
> 5. Looking at Cornell's ldm logs, I noticed that in almost every case,
> the RPC timeout messages that preceded the disconnects referred to
> products with headers beginning with "SDXX" (mainly), or else "SDUS".
> I then did a watch on the pq, and you could see that at certain times the
> only things being received were these 2 products, nothing else was getting
> through. So I added a pattern to the Unidata filter REQUEST statement
> which excluded products beginning with "SD". The results were immediate:
> the METARs were received on time and all the other delayed products, such
> as models, were received on time as well. To confirm that the "SD"
> products were the culprit, I changed our REQUEST statement for all UNIDATA
> streams except HDS to be ".*" and for HDS everything except products
> beginning with "SD". We ran with this for several days last week. We
> received all products on time. The number of disconnects from Cornell
> went down to about 2/hr on average. Since then I've further refined the
> HDS REQUEST to exclude other products that we aren't currently using such
> as ^P, ^AG, ^IS, ^VE, etc. Since none of our active downstreams sites is
> requesting HDS, I didn't see a problem with this. We're now averaging
> about 1 disconnect per hour.
>
Tom,
That's great research and interesting results, I'll remember it if any
other site have similar problems. It still implies network
saturation at some point in your connection or a router problem. I did a
notifyme:
wcfields.unidata.ucar.edu.rkambic> notifyme -vl - -f HDS -p "^SD" \
-h zero.unidata.ucar.edu
The results didn't show much. Average product size is 1-2 k and the
number of products/min was low, ranging from 1-20. There is no way that
data rate would saturate a network. The only guess and it's a far one,
would be coincidence of receiving a SD(XX|US) product an a saturation
point. No other sites have problems receiving these products
> So I guess my question is: do you have any ideas why we would be having
> such a problem receiving these "SD.." products? I'm sort of surprised
> that we seem to be the only site experiencing this. Not being able to
> receive them is not a problem currently, but if in the future these Radar
> Coded Messages were to be supported by gempak, we might want to get them.
I wouldn't worry about that now. It seems that John Steer(sp) and Dewain
Wallace are try to get Brockport on the vBNS. I would talk to them, they
seemed a little confused about the data rates needed for feeds. Keep me
inform of any other events that you solve or notice.
Thanks,
Robb...
>
> Info about our server: SparcStation 10 with dual 75mhz processors, 256MB,
> running Solaris 2.6 and ldm 5.0.6.
>
> Tom
> ------------------------------------------------------------------------------
> Tom McDermott Email: address@hidden
> System Administrator Phone: (716) 395-5718
> Earth Sciences Dept. Fax: (716) 395-2416
> SUNY College at Brockport
>
===============================================================================
Robb Kambic Unidata Program Center
Software Engineer III Univ. Corp for Atmospheric Research
address@hidden WWW: http://www.unidata.ucar.edu/
===============================================================================