Skip to main content

SHARE

  • About SHARE
    • Our Community
  • Projects and Partners
  • News
  • Contact

SHARE News

Rick’s MetaTips | 19 September 2016

Rick’s MetaTips: Digging into the SHARE 2.0 Metadata Schema

Rick Johnson, image courtesy of University of Notre Dame Hesburgh Libraries

At the July 2016 SHARE Community Meeting held in Charlottesville, Virginia, we announced more details about our new metadata schema and related utilities for SHARE, just released in early September. To recap my previous posts on improvements to the SHARE metadata schema, our priorities heading into the redesign of the SHARE schema were:

  1. Increase links among related materials, events, and researchers
  2. Harness partial and additional metadata retrieved from multiple data providers and link across those data providers (i.e., single metadata records in SHARE using data from multiple data providers)
  3. Enable better grouping and filtering of research events
  4. Improve the interoperability of SHARE’s metadata so that it is more consumable by other systems

 

SHARE 2.0 metadata schema on white board

SHARE 2.0 metadata schema on white board. Image courtesy of the Center for Open Science (click to enlarge).

 

These priorities have resulted in the following improvements:

  • Master creative work records supporting multiple data providers
  • Enhanced independent objects in the schema for research contributors, affiliated institutions, awards, funders, venues (e.g., event, publication)
  • Better versioning and provenance tracking across data sources
  • More support for linking elements across entities:
    • People (ORCID IDs)
    • Venues—journals, conference presentations, locations, etc. (geonames, uniform resource identifiers [URIs])
    • Subjects (URIs, subject terms)
    • Institutions (International Standard Name Identifiers [ISNIs], Ringgold IDs)
    • Funders (Funder IDs)
    • Awards (Award IDs)
  • Re-architected ingesting and normalizing pipeline of harvested metadata
  • Redesigned discovery interface for SHARE with enhanced filtering

 

SHARE 2.0 search filters

SHARE 2.0 search interface

 

Working towards this new SHARE release, the team at the Center for Open Science has completed:

  • Implementation of new metadata structure and processing pipeline
  • Ingest and processing of data from our existing 125+ data providers migrated to the new structure
  • Linking of related entities, such as contributor to institution or contributor to work

Within our redesign plan, improvements in progress include:

  • Mapping updated SHARE data elements to common metadata terms (a data dictionary being created by the SHARE curation associates)
  • Updating the Push API (automated push of records to SHARE) and its documentation to exploit improvements to the schema
  • Linking related creative works (such as items by the same author, or supplementary materials)
  • Linking to other versions or derivatives of creative works
  • Developing tools for curation experts (especially SHARE curation associates) to review and enrich the SHARE metadata

While metadata from existing SHARE data providers have all been updated to feed into the new schema, critical steps need to be taken by our data providers to realize our vision for SHARE:

  1. Use more persistent identifiers of objects and related objects (e.g., digital object identifiers, ORCID IDs, ISNIs, Funder IDs, Award IDs)
  2. Use common subject terms (e.g., controlled vocabularies)
  3. Replace text values with URI values (a.k.a. “strings to things”)
  4. Populate more metadata fields or identify other sources for missing metadata

Persistent identifiers and common subject terms (numbers 1 and 2 above) are especially important because, even if all desired metadata is not available from a single data provider, these elements enable harvesting information from other sources and cross-referencing research entities in SHARE. Our curation associates have begun investigating this process at their institutions, and we are currently working through strategies to bring more data providers on board in enhancing their metadata this way. 

We have also heard from data providers that, even though flexibility is good for harvested data, a recommended standard for metadata mappings would be useful, especially via the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). We are currently considering what community standards would be the best fit for SHARE. 

For Further Reference

  • Related presentations on the updated schema and data model for SHARE 
  • Notes from the July 2016 SHARE Community Meeting 
  • For those interested, a more technical entity-relationship diagram showing how links are asserted between people, things, and other things 
  • Additional technical documentation on SHARE models

 

 

 

By Rick Johnson

rick.johnson@nd.edu
Tags Data Curation, metadata, Rick's MetaTips, Curation Associates
  • Related Posts

    • March 1, 2018SHARE v3: Rebooting the Metadata-Harvesting Framework, Metadata-Editing Pipeline

      Jeffrey Spies, SHARE’s co-director and the original architect of both SHARE and the Open Science Framework (OSF), will be ... read more.

    • January 26, 2018Technical Update: January 2018

      The SHARE developers have enhanced SHARE over the past few months, by back-harvesting a variety of metadata providers, and ... read more.

  • Topics

    • Uncategorized (2)
    • Events (37)
    • SHARE News Releases (22)
    • Partners (23)
    • Career Opportunities (5)
    • SHARE Updates (41)
    • What people are saying (16)
    • Presentations (23)
    • Resources (19)
    • Rick’s MetaTips (8)
    • General (11)
  • @SHARE_research

    Tweets by @SHARE_research
  • About SHARE
  • News
  • Contact
Sign up for updates
@SHARE_research

All content is © copyright SHARE and available under a CC-BY 4.0 license.

Association of Research Libraries
21 Dupont Circle NW #800
Washington, DC 20036
202-296-2296
info@www.share-research.org
  • Credits
  • Accessibility
  • Privacy Policy
  • Brand Guidelines
  • Dashboard
This site uses cookies. By clicking 'I understand', you are agreeing to our use of cookies. More Info...
I Understand
Privacy & Cookies Policy
Necessary
Always Enabled