ANDS Logo

Data collections can be seen on:

http://services.ands.org.au/home/orca/rda/search#!/group=Tasmanian%20Partnership%20for%20Advanced%20Computing/tab=collection;

Software is available at:

http://tpac-dlp.svn.sourceforge.net/viewvc/tpac-dlp/GeoNetwork

Categories:

Metadata Store Solutions

Metadata Feed/Harvest/Publish

Project Members:

Peter Blain (peter.blain@utas.edu.au,peter.walsh@utas.edu.au)

Peter Walsh (peter.walsh@utas.edu.au)

Paola Petrelli (Paola.Petrelli@utas.edu.au)

Natalia Atkins (Natalia.Atkins@utas.edu.au)

ANDS Contact:

Mingfang Wu (Mingfang.Wu@ands.org.au)

Project Status:

Completed

Publication of collections into the ARDC by TPAC and IMAS of UTAS

University of Tasmania

Project Description

Tasmanian Partnership for Advanced Computing (TPAC) hosts and publishes an extensive range of data sets from around 150 separate collections. The collections hosted and/or published by TPAC comprise data from approximately one million NetCDF data sets, and total approximately 50 terabytes (at TPAC) and a further 30 Terrabytes as part of this federation of OpeNDAP servers at CSIRO and ANU http://dl.tpac.org.au/ . The collections are produced by research organisations both internal and external to UTAS. These include the following:

IPCC Model Output Results
NCEP-DOE Reanalysis
Ocean Modelling Results
WOCE Global Data V3.0
Oceanographic Datasets [TPAC/CSIRO]
Australian Antarctic Division Datasets
Climate Futures for Tasmania
Bluelink Reanalysis (BRAN) output
ACECRC Sea Ice Forecasting Project
World Ocean Database 2005 (WOD05)
International Datasets
Bathymetry - Topography - Coastlines
BMRC - Bureau of Meteorology Research Centre
Remote Sensing Data
IMOS

Institution for Marine and Antarctic Studies (IMAS) / Tasmanian Aquaculture and Fisheries Institute (TAFI) within UTAS have also a large quantity of marine research data. Some of these datasets have meta-data stored at the IMOS Geo-Network MEST. One of the project aims was to identify and describe the remainder. 239 collection records (and 87 party records) were described and made available for harvesting by the ARDC.

The aims of the project were:
- To describe the entire contents of the TPAC digital library using RIF-CS. The current descriptions were augmented to comply with RIF-CS and stored on site in a metadata store, and harvested into the ARDC.
- To discover and describe IMAS/TAFI marine research community data using RIF-CS. The descriptions were stored in the IMOS Geo-Network MEST, and harvested into the ARDC.
- To develop and deliver internal training programs and resources for research data management for both TPAC and IMAS/TAFI data collections. This included identifying and documenting the processes required to ensure continued submission of meta-data to the ARDC for all future TPAC and IMAS/TAFI data collections.
- To enhance existing web software tools and services to allow users to create, store and manage research data collections along with their metadata descriptions, persistent identifiers and digital object identifiers.

Data Type:

Please see the project description.

High Level Software Functionality:

TPAC Digital library enhancements:
The TPAC DLP software has been extended to include additional metadata as it relates to the datasets. This metadata includes: additional long title, additional ANDS friendly description, citation, access rights (previously part of description), related individuals and their detail, related groups and their detail, relationships between datasets, and ANZSRC codes.


IMAS MEST:
1) Use the original XSLT file to start customization. Don’t build from scratch, unless the requirements for the translation are very different.
2) Always use a version control system to keep the various versions of modified XSLT files.
3) Use an agile approach to incrementally modify the original XSLT file, when there is a large number of metadata and some metadata might have different translation requirements.
4) If there is a large amount of metadata, set up a local XSLT project to develop and test the XSLT file. By default, GeoNetwork only displays the first 10 metadata records in web browsers. To display the rest, you must use the OAI-PMH token to rotate the rest of the metadata records. Therefore, it is very difficult to manually test the specific metadata records. A local XSLT project enables you to check results for any metadata records. Automated testing is also good to have, since it is difficult to examine metadata one by one.
5) It is advisable to use the ANDS API to validate metadata records before sending to ANDS, since the ANDS sandbox doesn’t expose the metadata record identifier. This makes it difficult to trace metadata records if there are problems.

All the source code is available on Sourceforge.

The TPAC Digital Library:
http://tpac-dlp.svn.sourceforge.net/viewvc/tpac-dlp/

GeoNetwork:
https://geonetwork.svn.sourceforge.net/svnroot/geonetwork/trunk

The GeoNetwork RIF-CS translation component:
http://sourceforge.net/projects/andsstcproject/

ANZSRC-FOR code:

04 EARTH SCIENCES
05 ENVIRONMENTAL SCIENCES
06 BIOLOGICAL SCIENCES
07 AGRICULTURAL AND VETERINARY SCIENCES