ANDS Logo

Software is available at:

https://sourceforge.net/projects/mediacrawl

Software categories:

Integration metadata from various systems which are internal to an institution

Project Members:

Danqing Zhang (Project Manager, )

Dr. Joseph Young (Project Manager, j.young@qut.edu.au)

Lance De Vine (Lead Developer, l.devine@qut.edu.au)

Yi Sun (Developer, Y14.sun@qut.edu.au)

Professor Phil Graham (Research Officer, )

Stephanie Bradbury (Data Librarian, s.bradbury@qut.edu.au)

Craig Milne (Data Librarian, c2.milne@qut.edu.au)

ANDS Contact:

Andrew White (Andrew.White@ands.org.au)

Project Status:

Completed

B150BigJam

Queensland University of Technology

Project Description:

A roster of 900 musicians improvising across a number of genres, performing and recording non-stop music for 150 hours.
Software written using eclipse to facilitate the management of metadata relating to multimedia data captured.
Management of metadata -- annotation, mapping and querying of multimedia data.

The Q150 Data Capture Project aimed to study the management of metadata relating to multimedia data captured during Q150 Big Jam Live Music Festival. The multimedia data includes video, image and text that was recorded relating to the music, bands, artists and other background information during the festival.

A Java-based desktop software application, "Media Crawler", was developed for this purpose through the data capture project. The application provides a view onto the file-system containing the multimedia assets, and which allows the user to annotate, search and group the recorded assets. Metadata is automatically extracted from various file formats and other metadata can be imported as well, for example, from CSV files. The software is meant to facilitate this process so that views, or 'collections', may be defined and exported with the associated metadata. Collection metadata may also be exported as XML.

The software is able to crawl a given file-system, extract metadata from a specific set of file formats and then index the metadata fields using a text indexer. It assists the user to define a query with multiple search conditions, to execute the query, and to view the results.

The software provides the user with tools to organise assets and metadata via the creation of "Smart Folders" (folders defined via a query) and also by dragging and dropping individual items into a collection. It also allows the user to view objects in relation to each other via a graphical timeline view.

The "Media Crawler" software contains built-in schemas for describing document, audio, video and image data, but is extensible so that further schemas, content analysers, mappers and exporters can be added. The software was developed to facilitate the management of media generated by Q150 live music events but it is extensible to facilitate the publishing of event data capture through other local, State and National music festivals. It also has potential to be used for management of general multi-media in all disciplines.

The Q150 Data Capture Project has helped the Research Group and the Project Team with:

-improved understanding of the challenges of managing multimedia data in its diverse forms and formats;
-improved understanding of professional music production and the data that is produced from this process.
-improved understanding of the challenges associated with trying to deal with proprietary data formats.
-improved knowledge of methods for extracting metadata from files.

This knowledge is valuable and will help the Research Group manage their multimedia data more effectively.

In addition, the Q150 and the other two data capture projects provided QUT with an opportunity to establish a system architecture and suitable workflows for publishing research data information to Research Data Australia. The system architecture and workflows are now existing components of infrastructure that other research groups can use in the future, and therefore support the broader development of research data management and the publishing of QUT research data.

ANZSRC-FOR code:

410100
410300
420300