[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: THREDDS Aggregation limits

Subject: Re: THREDDS Aggregation limits
Date: Mon, 06 Feb 2006 12:28:10 -0700



Roy Mendelssohn wrote:

Hi John:

Some questions on THREDDS aggregation.

1.  Is there a limit to the number of files that can be aggregated over?


nope


2.  Can aggregation occur over sub-directories of a directory structure?


using "scan" i assume?

supposedly you can have multiple scan directives within the aggregation, but its not well tested. But I would try this if you need this feature.

the scan directive is still pretty primitive, we will continue to improve it, adding a 
"recurse" tag might be one way.

remember that the aggregated files need to be pretty much homogenous.

3. For a lot of time periods, when aggregating fields over time. do youhave any feel for the trade-off in speed of aggregation for the size ofthe netcdf file versus the number of files aggregated over (i.e. if wehave 6-hourly data, should we produce 6-hourly files, daily files,weekly files, monthly files - and what would be the likely speedtradeoff if we want to extract a time series of a relatively smallregion?).


My intuition is that you want to create fewer large files, not lots of little 
files. It costs the same to open a big or a little file. My current rule of 
thumb is try to write files that are 50 - 200 Mbytes.

In the future, we may add a feature that will try to open all the needed files 
in different threads. So that may argue for smaller file sizes, but its 
theoretical at this point.


TIA,

-Roy

Prev by Date: TDS downloads
Next by Date: Announce: Common Data Model Coordinate System validation
Previous by thread: TDS downloads
Next by thread: Announce: Common Data Model Coordinate System validation
Index(es):
- Date
- Thread