[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: No WSI feed from wsihcsn.unidata.ucar.edu to iita at 99082610 (fwd)
- Subject: Re: No WSI feed from wsihcsn.unidata.ucar.edu to iita at 99082610 (fwd)
- Date: Thu, 2 Sep 1999 16:03:49 -0600 (MDT)
===============================================================================
Robb Kambic Unidata Program Center
Software Engineer III Univ. Corp for Atmospheric Research
address@hidden WWW: http://www.unidata.ucar.edu/
===============================================================================
---------- Forwarded message ----------
Date: Thu, 26 Aug 1999 17:49:40 -0600 (MDT)
From: Celia Chen <address@hidden>
To: Robb Kambic <address@hidden>
Peter Neilley <address@hidden>
Subject: Re: No WSI feed from wsihcsn.unidata.ucar.edu to iita at 99082610
Robb,
I have to tell your that iita didn't take the pq_size
of 850MB too well. Please remember that iita is a linux
box and has 1GB memory. It ran out of space quickly and
can cause files to disappear with such a big pq_size. I
have reduced the pq_size back to 650MB now.
Celia
>
> Hiya,
>
> I logged into iita and did some configuration changes and also changed it
> back to run version 5.0.8 The configuration changes are: made the queue
> size 850 megabytes changed pqexpire to run every 20 minutes instead of
> every 5 minutes. When pqexpire was running the machine was 0% idle.
>
> When I stopped the ldm, there were many ldm rogue
> processes running on the machine not affiliated with the running
> ldm. When stopping the ldm, care needs to be taken that all ldm
> processes are gone before restarting. Before the ldm was started, the
> queue was deleted/remade.
>
> When trying to contact iita with ldmping from wsihcsn:
>
> ldmping -i 5 -h iita.rap.ucar.edu
> Aug 26 22:09:19 State Elapsed Port Remote_Host
> rpc_stat
> Aug 26 22:09:19 ADDRESSED 0.100148 0 iita.rap.ucar.edu RPC:
> Unable to receive; errno = Connection reset by peer
> Aug 26 22:09:24 SVC_UNAVAIL 0.050834 0 iita.rap.ucar.edu RPC:
> Unable to receive; errno = Connection reset by peer
> Aug 26 22:09:29 SVC_UNAVAIL 0.032411 0 iita.rap.ucar.edu RPC:
> Unable to receive; errno = Connection reset by peer
>
> rpcinfo on iita also had the same problem:
>
> iita:~/logs> rpcinfo -n 388 -u iita 300029 4
> rpcinfo: RPC: Unable to receive; errno = Connection refused
> program 300029 version 4 is not available
> iita:~/logs> ^-u^-t
> rpcinfo -n 388 -t iita 300029 4
> rpcinfo: RPC: Unable to receive; errno = Connection reset by peer
> program 300029 version 4 is not available
>
>
>
> wsihcsn is feeding 4 machines:
>
> 2 machines are having latency problems, iita and torrent inside security
> perimeter
>
> 2 machines are current: shemp and ldm.comet outside security perimeter.
>
> It's 4:30 and the WSI feed is about 31 behind on iita.
>
> Aug 26 22:29:36 pqutil: 2268 19990826215757.317 WSI 646
> NEX/EWX/LREF1/199908262145
>
> Robb...
>
> ===============================================================================
> Robb Kambic Unidata Program Center
> Software Engineer III Univ. Corp for Atmospheric Research
> address@hidden WWW: http://www.unidata.ucar.edu/
> ===============================================================================
>
>