[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: THREDDS performance [was Re: THREDDS and grib]

This archive contains answers to questions sent to Unidata support through mid-2025. Note that the archive is no longer being updated. We provide the archive for reference; many of the answers presented here remain technically correct, even if somewhat outdated. For the most up-to-date information on the use of NSF Unidata software and data services, please consult the Software Documentation first.

Subject: Re: THREDDS performance [was Re: THREDDS and grib]
Date: Tue, 27 May 2008 11:43:55 -0600

Hi Kevin:

no, but im hoping to switch my attention to TDS next week and start to resolve 
a number of performance issues. thanks for your patience.

Kevin O'Brien wrote:

Hi John -
Not to be a pest - but I was wondering if you'd had a chance to look atthese performance issues, or even been able to recreate them?
Thanks -
kevin

John Caron wrote:
these are all good questions - there have been similar reports of theagg cache not working like it should. i will have to reproduce to seewhats happening.
Kevin O'Brien wrote:
Hi John -
I tried what you suggested and it didn't seem to have a significanteffect in making the initial access of the aggregated datasetquicker. It still took over a minute and a half to open thedataset. I've pasted the xml config that I used to define the newaggregation below. To be honest, I'm actually kind of glad because Iwasn't looking forward to modifying the guts of the application whichgenerates the xml config automatically.... :-)
I guess I can understand and probably even accept the fact that forthe first time the dataset is accessed, things will be a littleslow. After that, I presume the dataset is available in the cache,and of course subsequent accesses prove that it is because theresponse is quite quick. However, if the tomcat server isrestarted, it seems like whatever is in the cache is ignored and thecache entries have to be rebuilt. I have my aggregation cache setlike so:
 <AggregationCache>
<dir>/home/pmel/DataPortal/apache-tomcat-5.5.25/content/thredds/cacheAged/</dir>
   <scour>24 hours</scour>
   <maxAge>90 days</maxAge>
</AggregationCache> Does that seem correct? Also, as an aside, youmention that you thought this would be quicker because it avoids theOPeNDAP URL's....Shouldn't there be some client side caching done w/the OPeNDAP datasets? For example, if I access a remote dataset withncdump (or Ferret), and my OPeNDAP caching is turned on my ~/.dodsrcfile, it will cache the response in the ~/.dods_cache directory.Does any of that happen when OPeNDAP URL's are accessed through TDS???
Anyway - here's the xml config I used as per your suggestion:
<dataset ID="CM2.1U-D4_1PctTo2X_I1 atmos daily all vars00010101-02201231_2" name="CM2.1U-D4_1PctTo2X_I1 atmos daily all vars00010101-02201231_2"urlPath="ipcc_ar4_CM2.1_R1_1to2x-1_daily_atmos_00010101-02201231_2">
       <serviceName>thisDODS3</serviceName>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
         <aggregation type="union">
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/pr_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmax_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
<netcdfxmlns="http://www.unidata.ucar.edu/namespaces/netcdf/ncml-2.2";>
              <aggregation dimName="time" type="joinExisting">
<netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.00010101-01001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.01010101-02001231.nc"ncoords="36500" /><netcdflocation="file:/data/gfdl_cm2_1/CM2.1U-D4_1PctTo2X_I1/pp/atmos/ts/daily/tasmin_A2.02010101-02201231.nc"ncoords="7300" />
              </aggregation>
            </netcdf>
         </aggregation>
       </netcdf>
   </dataset>


I'm open to any suggestions or ideas!

thanks -
kevin


John Caron wrote:
Hi Kevin:
I havent had time to reproduce this yet, but im guessing one sourceof the slowdown is using opendap URLS in the compound aggregation.It would be interesting to time 1) the single aggregations, 2) thecompound agg as it exists, and 3) the compound agg, but replace theopendap URLs with direct netcdf files,
see attached file

References:
- Re: THREDDS and grib
  - From: John Caron
- Re: THREDDS and grib
  - From: John Caron

Prev by Date: Re: THREDDS and grib
Next by Date: Re: CF support in netcdf-java
Previous by thread: Re: THREDDS and grib
Next by thread: Re: THREDDS performance [was Re: THREDDS and grib]
Index(es):
- Date
- Thread