[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: THREDDS performance [was Re: THREDDS and grib]

Subject: Re: THREDDS performance [was Re: THREDDS and grib]
Date: Mon, 29 Dec 2008 09:31:59 -0800

Hi John -

I realize this email thread goes back a ways, but I was wondering ifthere'd been any recent work done regarding the performance of Unionaggregations in TDS. We (LAS group) are still working with the GFDLdata collection and finding that it is still really slow to initializethe aggregations (sometimes as long as 7 minutes!). I've actually movedall of the data to a public TDS at

   http://data1.gfdl.noaa.gov:8380/thredds/ipcc/all_ipcc.html

So that they can be accessed by anyone.

Like I said, I realize this discussion was started a while ago, but justthought I'd touch base and see if there were any new thoughts that Icould try in order to speed up performance. I've been working a lotwith this data and installing our new LAS on top of the data, which isof course why it's on my mind...


Thanks -
Kevin

John Caron wrote:

Kevin O'Brien wrote:
Hi John

John Caron wrote:
- can you send me the cache file for the test dataset?
Would I find this in the $TOMCAT_HOME/content/thredds/cacheAgeddirectory? Because what I see in there are the cache files for theJoinExisting aggregations, which then get put into a Unionaggregation, and it is the Union aggregation URL that is beingaccessed and is slow. There doesn't seem to be any cache entries, inthat directory anyway, representing the Union aggregation. Are theysomewhere else?
only the joinExisting aggs get cached.
- can yuo give me approximate times of what you see vs what youwould expect ?
On example took 3 minutes and 40 seconds to open up the dataset htmlpage through the OPeNDAP html interface. That particular example wasa Union aggregation of 47 JoinExisting aggregations. Maybe that'sjust the time it should take? And if that is the case, then I thinkwe can live with it on the first access, but then the question is whynot use the cached info after a tomcat restart?
that seems ok for the first time, but then it should use the cache andbe much faster.
im guessing theres some other problem i dont see yet.
can you send me the ncml for the union, and also one or more of theaggExisting cached files.
kev
Kevin O'Brien wrote:
Hi John -
I installed the version 3.16.37 of the server and unfortunately, itdoesn't seem to make the problem go away. It did kind of seem likethings were a bit quicker at times, but it was hard to accuratelyassess. Of course, the initial access of these large aggregationsis still pretty slow, and then subsequent accesses are faster.However, it does seem like a restart of the tomcat server somehowerases the cache information and so every initial access after atomcat reboot is slow.
Is there anything else I can do to help further debug the problem?

thanks -
Kevin

John Caron wrote:
Hi Kevin: I made a small fix that looks like it would affect yourcase, but im not convinced it really would cause a huge slowdown.anyway, i wonder if you would give it a try and let me know?
Its TDS release 3.16.37.

thanks for your patience

Kevin O'Brien wrote:
Hi John -
Not to be a pest - but I was wondering if you'd had a chance tolook at these performance issues, or even been able to recreatethem?
Thanks -
kevin

John Caron wrote:
these are all good questions - there have been similar reportsof the agg cache not working like it should. i will have toreproduce to see whats happening.
Kevin O'Brien wrote:
Hi John -
I tried what you suggested and it didn't seem to have asignificant effect in making the initial access of theaggregated dataset quicker. It still took over a minute and ahalf to open the dataset. I've pasted the xml config that Iused to define the new aggregation below. To be honest, I'mactually kind of glad because I wasn't looking forward tomodifying the guts of the application which generates the xmlconfig automatically.... :-)
I guess I can understand and probably even accept the fact thatfor the first time the dataset is accessed, things will be alittle slow. After that, I presume the dataset is available inthe cache, and of course subsequent accesses prove that it isbecause the response is quite quick. However, if the tomcatserver is restarted, it seems like whatever is in the cache isignored and the cache entries have to be rebuilt. I have myaggregation cache set like so:
 <AggregationCache>
<dir>/home/pmel/DataPortal/apache-tomcat-5.5.25/content/thredds/cacheAged/</dir>
   <scour>24 hours</scour>
   <maxAge>90 days</maxAge>
</AggregationCache> Does that seem correct? Also, as anaside, you mention that you thought this would be quickerbecause it avoids the OPeNDAP URL's....Shouldn't there be someclient side caching done w/ the OPeNDAP datasets? For example,if I access a remote dataset with ncdump (or Ferret), and myOPeNDAP caching is turned on my ~/.dodsrc file, it will cachethe response in the ~/.dods_cache directory. Does any of thathappen when OPeNDAP URL's are accessed through TDS???
Anyway - here's the xml config I used as per your suggestion:
<dataset ID="CM2.1U-D4_1PctTo2X_I1 atmos daily all vars00010101-02201231_2" name="CM2.1U-D4_1PctTo2X_I1 atmos dailyall vars 00010101-02201231_2"urlPath="ipcc_ar4_CM2.1_R1_1to2x-1_daily_atmos_00010101-02201231_2">
       <serviceName>thisDODS3</serviceName>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
         <aggregation type="union">
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
         </aggregation>
       </netcdf>
   </dataset>


I'm open to any suggestions or ideas!

thanks -
kevin


John Caron wrote:
Hi Kevin:
I havent had time to reproduce this yet, but im guessing onesource of the slowdown is using opendap URLS in the compoundaggregation. It would be interesting to time 1) the singleaggregations, 2) the compound agg as it exists, and 3) thecompound agg, but replace the opendap URLs with direct netcdffiles,
see attached file


--
Kevin O'Brien                   UW/JISAO        
Research Scientist              NOAA/PMEL/TMAP
206-526-6751                    http://www.pmel.noaa.gov

"The contents of this message are mine personally and donot necessarily reflect any position of the Governmentor the National Oceanic and Atmospheric Administration."

References:
- Re: THREDDS and grib
  - From: Kevin O'Brien
- Re: THREDDS performance [was Re: THREDDS and grib]
  - From: Kevin O'Brien

Prev by Date: [THREDDS #CXD-368286]: <variables> metadata in THREDDS catalogs
Next by Date: Re: phone message -- sample XML to follow (here it is)
Previous by thread: Re: THREDDS performance [was Re: THREDDS and grib]
Next by thread: [THREDDS #RLA-366291]: IOSP and dataset plugin
Index(es):
- Date
- Thread