[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[AWIPS #RDH-492687]: EDEX Outage
- Subject: [AWIPS #RDH-492687]: EDEX Outage
- Date: Fri, 08 Mar 2024 14:35:49 -0700
Hi there,
> Thanks for your reply. I had subscribed to the Allison House EDEX server for
> the past few years, but have unfortunately
> discontinued this service a week ago. This leaves the Unidata server as the
> only feed for this critical data.
>
> Would there be any avenues that the Unidata AWIPS team could muster to
> provided a backup EDEX server or marshal other
> local entities to provide this only as a back up ? Sadly, we do not have to
> IT expertise to spin up a local EDEX server.
>
> Thanks for any ideas you may have
Sorry for the delayed response.
That is unfortunate to hear about AllisonHouse.
We do have a backup server, but it is internal only and we use it for
development as well. That is how we were able to
get services back up and running so quickly on Monday -- our main production
server is actually still down from another
outage on Wednesday. The main problem with this particular outage was the cloud
provided did hardware migrations on a
Friday, and we weren't aware of the outage until Monday. Why they decided to do
this kind of potentially disruptive
work on a Friday is beyond me. We did not have any staff available over the
weekend to correct the problem, and we
technically don't actually work on weekends. If we are available and able, we
try to address issues as quickly as possible.
We need our backup server for development and testing, so we can troubleshoot
and fix problems without disrupting our live
server. Our EDEX servers consist of 3 machines and are using over a combined
100 CPUs, and 100s of GB of RAM and TBs of
storage. We don't have the resources to be running multiple production, public
EDEX servers for our users.
We do our best to provide near 100% uptime, but unfortunately since our cloud
provider transitioned to version 2 of their
cloud platform almost two years ago, we have had continuous issues with
reliability. We are doing our best to provide a
reliable service, but we only have so many resources (human power, money,
hardware, etc) available to us.
Again, your only other option that I can think of, is to run your own
production server. This is something some of our
users are doing successfully, and we have offered support and troubleshooting
to help them establish their systems.
--Shay Carter
She/Her/Hers
AWIPS Software Engineer
NSF Unidata
If you're interested, please feel free to fill out a survey about the support
you receive:
https://docs.google.com/forms/d/e/1FAIpQLSeDIkdk8qUMgq8ZdM4jhP-ubJPUOr-mJMQgxInwoAWoV5QcOw/viewform
Ticket Details
===================
Ticket ID: RDH-492687
Department: Support AWIPS
Priority: Low
Status: Open
===================
NOTE: All email exchanges with NSF Unidata User Support are recorded in the
Unidata inquiry tracking system and then made publicly available through the
web. If you do not want to have your interactions made available in this way,
you must let us know in each email you send to us.