Hi Ethan,
I'm not sure I fully grokked what you said to me, so I've just included my
catalog generator file without further modification.
I have data living on disk in /data/pymars/2004/netcdf_anal, and
/data/pymars/2004/netcdf_fore. I would like to set up the catalog generator to
crawl the /data/pymars directory and publish what it find there -- no
requirement for very intelligent structuring at this stage.
The dods server is running on localhost:8010.
I'm not entirely certain what version is running, but it is whatever is current
on the web page as of about 2 months ago. I look forward to the new version,
and the simpler configuration!
I wasn't sure what I had to do with all that pattern matching stuff, so I
decided to just leave it unchanged from the example, and just see what
happened. I imagine I have to replace the datasetFilter to accept *.nc, or some
other pattern of my choosing. I couldn't work out if the dataset namer was
mandatory or not. I'd really just like to capture everything, and am happy with
the title being the filename at this stage.
Cheers,
-Tennessee
<?xml version="1.0" encoding="UTF-8"?>
<!-- $Id: catGenConf.exampleLocal.xml,v 1.2 2004/06/03 20:38:07 edavis Exp $ -->
<!--
- Simple example CatalogGenConfig file.
-->
<!DOCTYPE catalog SYSTEM
"http://www.unidata.ucar.edu/projects/THREDDS/xml/CatalogGenConfig.0.5.dtd">
<catalog name="THREDDS CatalogGen test config file" version="0.6">
<dataset name="THREDDS CatalogGen test config file">
<dataset name="NCEP Eta 80km CONUS model data">
<metadata metadataType="CatalogGenConfig">
<catalogGenConfig type="Catalog">
<datasetSource name="Local Disk Data Sets" type="Local"
structure="DirTree"
accessPoint="/data/pymars">
<resultService name="linuxdev" serviceType="DODS"
base="http://localhost:8010/thredds/cataloggen/"
accessPointHeader="/home/tjl/jakarta-5.0.28/content/thredds/cataloggen/"/>
<datasetFilter name="Accept netCDF files only" type="RegExp"
matchPattern="/[0-9][^/]*_eta_211\.nc$"/>
<datasetNamer name="NCEP Eta 80km CONUS model data"
type="RegExp" addLevel="false"
matchPattern="([0-9][0-9][0-9][0-9])([0-9][0-9])([0-9][0-9])([0-9][0-9])_eta_211.nc$"
substitutePattern="NCEP Eta 80km CONUS $1-$2-$3 $4:00:00 GMT"/>
</datasetSource>
</catalogGenConfig>
</metadata>
</dataset>
<dataset name="NCEP GFS 80km CONUS model data">
<metadata metadataType="CatalogGenConfig">
<catalogGenConfig type="Catalog">
<datasetSource name="model data source" type="Local"
structure="Flat"
accessPoint="./content/thredds/cataloggen/testData/model">
<resultService name="mlode" serviceType="DODS"
base="http://localhost:8080/thredds/cataloggen/"
accessPointHeader="./content/thredds/cataloggen/"/>
<datasetFilter name="Accept netCDF files only" type="RegExp"
matchPattern="/[0-9][^/]*_gfs_211\.nc$"/>
<datasetNamer name="NCEP GFS 80km CONUS model data"
type="RegExp" addLevel="false"
matchPattern="([0-9][0-9][0-9][0-9])([0-9][0-9])([0-9][0-9])([0-9][0-9])_gfs_211.nc$"
substitutePattern="NCEP GFS 80km CONUS $1-$2-$3 $4:00:00 GMT"/>
</datasetSource>
</catalogGenConfig>
</metadata>
</dataset>
</dataset>
</catalog>
Ethan Davis wrote:
Tennessee Leeuwenburg wrote:
Ethan Davis wrote:
Hi Tennessee,
Did you edit the config.xml file (which sets up the tasks) as well as
the cat gen config file? I guess you must have if it is showing up in
the interface. Make sure the period value is not set to zero; if it
is, the task won't be run. Are you getting any messages in the log
files? What version of the server are you running? Is this a publicly
available server? If so, send me the URL and I'll take a look at the
config files.
Sorry these config file formats are so ugly. We're working on
simplifying and cleaning up the configuration throughout the server.
But for now ...
Well, as long as you're willing to help me, ugly is fine :)
More than willing to help. But I want simpler because it would make
it easier for me to remember what is going on :)
After making that change, the server started to process the various
files. The exampls DODS catalog was generated fine, the example
filesystem catalog and my own filesystem catalog both failed with
similar messages. I've appended the results.
I think I'm failing to understand what exactly the serviceName, base
and
accessPointHeader are actually used for.
As with regular catalogs, I assume one is used for reconstructing the
URL to the file to be resourced, and the other is used for constructing
the URL to be used in an OpenDAP request, but it's not clear to me
exactly what is happened. I read the documentation, but it was a bit
hand-wavy about the specifics.
The accessPoint is the directory that is to be scanned for data
files. The accessPointHeader is a parent directory of the accessPoint
directory and is used to remove the part of the data file path that
is not to appear in the resulting dataset access URL. The base value
is the URL for the OPeNDAP server that is serving your data. For
instance, if you want to crawl the /my/data/radar/level3/FTG
directory and a resulting dataset access URL is something like
http://.../nph-dods/radar/level3/FTG/file.nc, you would want
something like
<datasetSource name="model data source" type="Local" structure="Flat"
accessPoint="/my/data/radar/level3/FTG">
<resultService name="mlode" serviceType="DODS"
base="http://.../nph-dods/"
accessPointHeader="/my/data/"/>
<datasetFilter ... />
<datasetNamer ... />
</datasetSource>
Does that clear things up at all? If not, feel free to send me your
config file to look at.
Sorry about the documentation. It isn't all that clear and I haven't
put much effort into it since we decided to move to a simpler config
file format. Not sure what's up below with the example file system
dataset. I must have broken something at some point.
What version of the cat gen servlet (or THREDDS server) are you running?
Ethan
PS In the new TDS, catalogs for the data it is serving are
automatically generated and the config files are much simpler than
these.
Thanks for your help,
-T
<catalog name="THREDDS CatalogGen test config file" version="0.6">
−
<dataset name="THREDDS CatalogGen test config file">
<service name="linuxdev" serviceType="DODS"
base="http://localhost:8010/thredds/cataloggen/"/>
<service name="mlode" serviceType="DODS"
base="http://localhost:8080/thredds/cataloggen/"/>
−
<dataset name="NCEP Eta 80km CONUS model data">
<dataset name="The DatasetSource "Local Disk Data Sets" could not be
expanded. The accessPointHeader
(/home/tjl/jakarta-5.0.28/content/thredds/cataloggen/) is not a
directory." serviceName="linuxdev"/>
</dataset>
−
<dataset name="NCEP GFS 80km CONUS model data">
<dataset name="The DatasetSource "model data source" could not be
expanded. The accessPointHeader (./content/thredds/cataloggen/) is
not a
directory." serviceName="mlode"/>
</dataset>
</dataset>
</catalog>