Metadata Repository
This is a DRAFT – please send us your comments and suggestions.
The most fundamental component of the ALA infrastructure will be a Metadata Repository which can not only document the existence of biodiversity data resources but also provide a flexible and scalable framework for managing information which can assist users of all types in finding relevant information.
The ALA has received additional funding in the period 2008-2010 from the NCRIS Platforms for Collaboration capability’s NeAT programme to assist in the development of its Metadata Repository and Data Annotation Services. See Data Integration and Annotation Services in Biodiversity. It is intended that the ALA Metadata Repository will provide an environment in which tools and best practices can be developed which may subsequently be applied to other NCRIS capabilities.
The following functional requirements have been defined for the ALA Metadata Repository:
- Storage of metadata documents, including:
- Support for Dublin Core
- Support for ISO 11179
- Support for arbitrary RDF properties
- Support for tagging metadata with ontology (OWL/OBO) terms
- Metadata documents to describe:
- Online databases and web services
- Text documents (including PDF, Word, etc.)
- Images and other multimedia resources
- Harvesting of metadata from other repositories
- Harvesting of documents using OAI-PMH
- Pluggable framework for tagging metadata with terms derived from data content
- Publication of metadata to other tools and repositories
- Data provider interface
- Register/update/delete data provider
- Register/update/delete database/service/document/image from data provider
- Register/update/delete OAI-PMH feed from data provider
- Accept/reject annotations
- Administrator interface
- Annotate metadata documents with arbitrary RDF properties and ontology terms
- Accept/reject annotations
- End-user interface
- Full-text search
- Browse by data provider
- Browse by ontology terms
- Faceted search via multiple ontologies
- Propose annotations to metadata documents with arbitrary RDF properties and ontology terms
- Access control
- Integration with Shibboleth/PKI (to exploit AAF infrastructure and services)
- AAF authenticated access to data provider, administrator and end-user interfaces
- AAF-mediated restrictions on visibility for some metadata documents (possible – may not be necessary)
- Wider compatibility (may be ensured by other requirements)
