ANDS Logo

Project promotion materials:

Project Homepage:

https://wiki.csiro.au/display/CASDA/CASDA+Project+Wiki

Categories:

Observational Instruments (e.g. from telescope, camera, etc.)

Integration metadata from various systems which are internal to an institution

Metadata Feed/Harvest/Publish

Project Members:

Dan Miller ( )

ANDS Contact:

Cynthia Love (cynthia.love@ands.org.au)

Project Status:

Completed

CSIRO ASKAP Science Data Archive Major Open Data Collection

CSIRO

Project Description

The CSIRO Australian Square Kilometre Array Pathfinder (ASKAP) project requires the establishment of a Science Data Archive to store, manage, and make available for discovery and access the data that is generated by ASKAP operations at the Murchison Radio Observatory (MRO). The CSIRO ASKAP Science Data Archive (CASDA) Project has been established to build and implement the archive at the Pawsey Centre and build the data transfer infrastructure required between MRO and Pawsey.

The rates of data arriving at the Pawsey Centre are approximately 2.5 Gigabytes (GB) per second, equivalent to 75 Petabytes (PB) per year. This is beyond the current ability to archive data and so the majority will be processed in quasi real time with only a sub-set to be archived. The total volume of archive data is expected to reach 5 PB per year.

This ANDS project will be carried out as a sub-component of the broader CSIRO CASDA project and will be used to modify the data transfer and archive systems to improve metadata capture and management and enable publication and syndication of collection metadata to Research Data Australia (RDA). The funding will also contribute to the ability for CSIRO DOIs to resolve to collections that are stored outside of the CSIRO Data Access Portal (DAP).

This project will:
- define the collections in the ASKAP data
- implement links from the CSIRO Data Access Portal metadata store to the data store (Science Data Archive) at the Pawsey Centre
- implement DOI assignment and resolution on collections as they are generated
- feed records to ANDS RDA