[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[IDD #JLJ-308670]: NEXRAD Level II outage
- Subject: [IDD #JLJ-308670]: NEXRAD Level II outage
- Date: Tue, 26 Feb 2013 15:26:49 -0700
James,
> We just experienced a full outage of all our NEXRAD Level II data that we pull
> from Unidata via LDM. We're now trying to determine whether the problem was at
> our end or the Unidata end.
>
> We lost data at 20:43:17Z and it returned at 21:30:05Z. Our logs contained
> many
> messages like the following during the outage:
>
> Feb 26 21:16:25 llwxldm1 idd.unidata.ucar.edu[12796] NOTE: LDM-6 desired
> product-class: 20130226210125.139 TS_ENDT {{NEXRAD2, "(.*)"},{NONE,
> "SIG=e8cdcd0c6992e8d6e3a46eda90eb93f4"}}
> Feb 26 21:16:25 llwxldm1 idd.unidata.ucar.edu[12796] INFO: Resolving
> idd.unidata.ucar.edu to 128.117.140.3 took 0.011059 seconds
> Feb 26 21:16:25 llwxldm1 idd.unidata.ucar.edu[12796] INFO: Connected to
> upstream
> LDM-6 on host idd.unidata.ucar.edu using port 388
> Feb 26 21:16:25 llwxldm1 idd.unidata.ucar.edu[12796] ERROR: Disconnecting due
> to
> LDM failure; Upstream LDM says we're not allowed to receive requested
> products:
> 20130226210125.139 TS_ENDT {{NEXRAD2, "(.*)"},{NONE,
> "SIG=e8cdcd0c6992e8d6e3a46
> eda90eb93f4"}}
> Feb 26 21:16:25 llwxldm1 idd.unidata.ucar.edu[12796] INFO: Sleeping 13 seconds
> before retrying...
>
> Can you tell us if there were any problems at Unidata during this time? If
> not,
> were there problems upstream at NWS, do you think?
It looks like the downstream LDM process responsible for receiving NEXRAD-2
data at your end decided that it needed to reconnect and, consequently, closed
the connection. The closure, however, doesn't appear to have been propagated to
the matching upstream LDM at our site. Consequently, all subsequent
re-connection attempts were rebuffed until the matching upsteam LDM finally
received a broken connection signal.
This might indicate a problem with our Linux Virtual Server (LVS)
implementation (idd.unidata.ucar.edu is actually a cluster of computers served
by LVS).
I'm investigating further and will let you know if I find anything.
Please keep me apprised of any more problems at your end.
> Thanks.
>
> ---------------------------+---------------------------
> James M. Pelagatti (Jamie) | MIT Lincoln Laboratory
> Software Engineer | Group 43 (Weather Sensing)
> (781) 981-1886 | 244 Wood St., Room S1-611
> FAX: (781) 981-0632 | Lexington, MA 02420-9108
> mailto:address@hidden | http://www.ll.mit.edu
Regards,
Steve Emmerson
Ticket Details
===================
Ticket ID: JLJ-308670
Department: Support LDM
Priority: Normal
Status: Closed