[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[LDM #AET-991057]: LDM 6.4.4----two problems
- Subject: [LDM #AET-991057]: LDM 6.4.4----two problems
- Date: Fri, 10 Mar 2006 09:08:07 -0700
Robert,
> Here are the errors from the upstream machine:
>
> Mar 09 03:34:41 wxmcidas psnldm(feed)[5085]: up6.c:287: nullproc_6() failure
> to psnldm.balloonfacility.org: RPC: Unable to receive; errno = Connection
> reset by peer
> Mar 09 03:34:41 wxmcidas rpc.ldmd[29393]: child 5085 exited with status 5
> Mar 09 03:34:41 wxmcidas psnldm[19030]: ldm6_server.c:136: Restricting
> request: 20060309032443.783 TS_ENDT {{CONDUIT, ".*"},{NONE,
> "SIG=42a66de4294053b81ab7dbe73f7bcabd"}} -> 20060309032443.783 TS_ENDT
> {{CONDUIT, ".*"}}
> Mar 09 03:34:43 wxmcidas psnldm(feed)[19030]: up6.c:334: Starting
> Up(6.3.0/6): 20060309032443.783 TS_ENDT {{CONDUIT, ".*"}}
> Mar 09 03:34:43 wxmcidas psnldm(feed)[19030]: topo:
> psnldm.balloonfacility.org CONDUIT
> Mar 09 03:56:18 wxmcidas psnldm(feed)[19030]: up6.c:287: nullproc_6() failure
> to psnldm.balloonfacility.org: RPC: Unable to receive; errno = Connection
> reset by peer
> Mar 09 03:56:18 wxmcidas rpc.ldmd[29393]: child 19030 exited with status 5
> Mar 09 03:56:18 wxmcidas psnldm[19064]: ldm6_server.c:136: Restricting
> request: 20060309033043.867 TS_ENDT {{CONDUIT, ".*"},{NONE,
> "SIG=ab3d73a9c923cae62de4dc92f66904b2"}} -> 20060309033043.867 TS_ENDT
> {{CONDUIT, ".*"}}
> Mar 09 03:56:20 wxmcidas psnldm(feed)[19064]: up6.c:334: Starting
> Up(6.3.0/6): 20060309033043.867 TS_ENDT {{CONDUIT, ".*"}}
> Mar 09 03:56:20 wxmcidas psnldm(feed)[19064]: topo:
> psnldm.balloonfacility.org CONDUIT
> Mar 09 03:58:18 wxmcidas psnldm(feed)[19064]: up6.c:167: HEREIS: RPC: Unable
> to send; errno = Broken pipe
> Mar 09 03:58:18 wxmcidas psnldm(feed)[19064]: up6.c:430: Product send
> failure: Input/output error
> Mar 09 03:58:18 wxmcidas rpc.ldmd[29393]: child 19064 exited with status 6
The above indicates that the upstream LDM is being told by the operating-system
that the connection was broken: in the first two cases because the TCP layer on
the downstream host sent a "reset" control-message. In the last case no good
reason for the disconnection was given. Given the extensive use of the LDM, it
seems unlikely that these disconnections are due to it, rather, it seems likely
that they are due to either the network or the networking layers of the hosts
in question. You *might* be able to verify this by switching to a previous
version of the LDM -- although the autoshifting feature in 6.4 might confuse
the issue.
Incidentally, the timestamps on the two sets of log messages don't correlate.
Do you have a network administrator to whom you can take this problem?
> I haven't seen these types of errors before.
>
> Thanks,
> Robert Mullenax
> NMSU/CSBF
Regards,
Steve Emmerson
Ticket Details
===================
Ticket ID: AET-991057
Department: Support LDM
Priority: Normal
Status: On Hold