Project Homepage:

Data collections can be seen on:!/group=BioGrid%20Australia%20Ltd/tab=collection

Software is available at:

Programming language(s):

Java, XSLT, XML System: IBM Infosphere Information Server

Software categories:


Project Members:

Owen O'Neill (Project Manager,

King Chiu (Lead Developer,

Naomi Rafael (Data Source Administrator,

ANDS Contact:

Richard Ferrers (

Project Status:


Capture of Complex Data to Support Clinical Research in Cardiovascular and Neurological Medicine

University of Melbourne

Collaborator(s): VeRSI, Biogrid Australia

Project Description:

BioGrid Australia Limited provides a secure research platform and infrastructure that enables access to real-time clinical, imaging and biospecimen data across jurisdictions, institutions and diseases. This web-based platform provides ethical access while protecting both privacy and intellectual property. As modern research and planning becomes more complex, the need for collaboration in research significantly increases. The web-based platform has the capacity to uniquely identify and ethically integrate multimodal data collected about a consumer across multiple institutions. BioGrid Australia provides researchers authorised access to data in many disease types and data from disparate existing databases at many institutions.

The number of databases at BioGrid continues to grow each year. In order to consistently display to the current database information, BioGrid uses a tool called the Business Glossary, published on the Web. Due to time constraints, the content of this Glossary is still preliminary and limited. A significant challenge to BioGrid is to consistently and accurately represent the data it manages to the research public. The ANDS project has given impetus to increase and standardize the information presented on the web both in the Business Glossary and consequently in Research Data Australia (RDA).

Not all data custodians gave authorization to publish information about their datasets to ANDS. At this stage, approximately 20 databases are published, with this number expected to continue to grow. Meanwhile the information for the other databases will be enhanced to comply with the ANDS standards so that publication will be simple when/if the data custodian chooses to publish.
Publishing information about BioGrid datasets to ANDS will increase the likelihood of researchers with similar interest finding out about the BioGrid datasets. The aim is to facilitate increasing collaboration in the research areas facilitated by BiogGrid.

--> original description

"Complex physiological data is routinely collected on patients as part of clinical care (echocardiography, intravascular ultrasound, x-ray angiography, optical computerised tomography, patient clinical data, etc.). However, this rich multi-model data is not usually subjected to subsequent analysis nor is it made available to researchers from other disciplines for novel analysis. Making this multi-model data available along with patient outcomes such as morbidities will provide the opportunity for collaborative groups to employ novel strategies to developed assessments and models based on this data. This project will form necessary base of making multi-modal data collections available, enabling the establishment of new links between biomedical research groups in engineering, physics and bioinformatics. This project will occur in collaboration with BioGrid Australia where it will use the access, de-identification and privacy protection protocols already established there.
The major activities were:
- Map BioGrid metadata to the ANDS schema,
- Write a service to populate ANDS-compliant metadata,
- Develop a service to harvest ANDS-compliant metadata from multiple BioGrid data sets which form a single study,
- The original intention was to enable the assignment of globally unique identifiers that link to the source of multi-modal datasets.... During the project, persistent identifiers were not deemed required. The project team will monitor the need for persistent identifiers for future implementation.

Data Type:

"echocardiography, intravascular ultrasound, x-ray angiography, optical computerised tomography, patient clinical data, etc."

High Level Software Functionality:

Features: "PD: Deployed, tested, documented system that:
D3 High level design documents:
a. Mapping of BioGrid datasets to RIF-CS.
b. Design of service to generate RIF-CS.
c. Process descriptions, including ethics approvals, ETL processes, deidentification and metadata annotation.";
Download link:;