Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Interim Collection Comparison Report for Big Earth Data Initiative

Metadata Source: CMR Metadata Collections

Metadata Dialect: ISO 19115-2

Evaluation Target: UMM-Collection metadata profile

The Unified Metadata Model Collection (UMM-Collection) profile describes documentation concepts that are considered important for collection level metadata.  The profile is includes three documentation levels (Required, Recommended and Optional). 

Metadata Collection

 

Section
Column
width50%
  • Alaska Satellite Facility (ASF)
  • Crustal Dynamics Data Information System (CDDIS)
  • Global Hydrology Resource Center (GHRC)
  • Goddard Earth Sciences Data and Information Center (GES_DISC)
  • Level 1 and Atmosphere Archive and Distribution System (LAADS)
  • Land, Atmosphere Near real-time Capability for EOS (LANCEMODIS)
  • Land, Atmosphere Near real-time Capability for EOS (LANCEAMSR2)
  • Langley Research Center (LARC)
  • Langley Research Center (LARC_ASDC) Atmospheric Science Data Center
  • Land Process DAAC - EOS Core System (LPDAAC_ECS)
  • National Snow and Ice Data Center Version 0 (NSIDCV0)
  • National Snow and Ice Data Center EOS Core System (NSIDC_ECS)
  • Ocean Biology Processing Group (OBPG)
Column
width50%
  • Oak Ridge National Laboratory (ORNL)
  • Ozone Monitoring Instrument Near Real Time (OMINRT)
  • Physical Oceanography DAAC (PODAAC)
  • Socioeconomic Data and Applications Center (SEDAC)
  • U.S. Geological Survey Earth Resources Observation Systems (USGS_EROS)
  • Australian Antarctic Data Centre (AU_AADC)
  • European Space Agency (ESA)
  • European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT)
  • Indian Space Research Organisation (ISRO)
  • Japan Aerospace Exploration Agency (JAXA)
  • LM_FIRMS
  • NOAA's National Centers for Environmental Information (NCEI)
  • U.S. Geological Survey Long Term Archive (USGS_LTA)

 

Overview

The focus of this page is to characterize the usage of UMM-Collection concepts in CMR metadata collections.  These views identify the existence of UMM-Collection concept paths in CMR metadata collections  However, these views do not show UMM-Collection concept paths that are not included in the metadata.  The UMM-Collection Metadata Completeness in the CMR page evaluates the completeness of CMR metadata collections with respect to the concept xpaths defined in the UMM-Collection profile.

Scope

We examined over 15,000  metadata records from 18 NASA collections and 8 Other collections extracted from the Common Metadata Repository (CMR) during April 2017. The links below connect to tables in google sheets which provide the average number of occurrences of UMM-Collection elements in each of these collections. A value of 1 or more typically (although not necessarily) indicates that the element is included one or more times in each record in a collection.  A value < 1.0 is typically the percentage of records in a collection that include the metadata element. Cells with pink backgrounds indicate values of 0, meaning the element is completely missing from the collection.

How are UMM-Collection Required concepts used in the CMR?

Overall CMR metadata collections are doing a good job of documenting Required UMM-Collection concepts. Table 1 shows the concept and element paths for 13 of the 15 required UMM-Collection concepts.  The 13 concepts in this table exist in all CMR metadata collections with the exception of OMINRT.  OMINRT is a bit of outlier because it only includes 1 record which needs quite a bit of improvement.

 

Table 1 - Required UMM-Collection Concepts included in All CMR metadata collections

ConceptElement PathNASA ExistsOther Exists
Abstractgmd:identificationInfo/gmd:abstract18.008.00
Data Dategmd:identificationInfo/gmd:citation/gmd:date/gmd:date/gco:DateTime18.008.00
Instrument Short Namegmi:acquisitionInformation/gmi:instrument/gmi:identifier/gmd:code18.008.00
Metadata Dategmd:dateStamp/gco:DateTime18.008.00
Platform Short Namegmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:code18.008.00
Resource Identifiergmd:identificationInfo/gmd:citation/gmd:identifier/gmd:code18.008.00
Resource Titlegmd:identificationInfo/gmd:citation/gmd:title18.008.00
Responsibility/Partygmd:contact/gmd:organisationName18.008.00
Responsibility/Partygmd:identificationInfo/gmd:pointOfContact/gmd:organisationName18.008.00
Science Keywordgmd:identificationInfo/gmd:descriptiveKeywords/gmd:keyword18.008.00
Temporal Extentgmd:identificationInfo/gmd:extent/gmd:temporalElement/gmd:extent/gml:TimePeriod/gml:beginPosition18.008.00
Processing Levelgmd:contentInfo/gmd:processingLevelCode/gmd:code17.008.00
Related URLgmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:linkage/gmd:URL17.008.00
Spatial Extentgmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:eastBoundLongitude/gco:Decimal17.008.00
Spatial Extentgmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:northBoundLatitude/gco:Decimal17.008.00
Spatial Extentgmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:southBoundLatitude/gco:Decimal17.008.00
Spatial Extentgmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:westBoundLongitude/gco:Decimal17.008.00

Note:  The OMINRT collection is missing Processing Level, Related URL and Spatial Extent.  This collection is an outlier since it includes just 1 record.

 

Table 2 shows the concept and element paths for the 2 required UMM-Collection concepts that are not included in all CMR metadata collections.  The Project Name concept exists in 22 of 26 CMR metadata collections and the Resource Version concept exists in 18 of 26 CMR collections.  See the UMM-Collection Metadata Completeness in the CMR to identify the records missing these concepts and for more detailed guidance.

 

Table 2 - Required UMM-Collection Concepts not included in All CMR metadata collections

ConceptElement PathNASA ExistsOther Exists
Project Namegmi:acquisitionInformation/gmi:operation/gmi:identifier/gmd:code14.008.00
Resource Versiongmd:identificationInfo/gmd:citation/gmd:identifier/gmd:version10.008.00

 

Table 3 provides expanded observations and guidance for improving UMM-Collection metadata.  The focus of this table is the consistency and usefulness of required concept field values.

 

Table 3 - Required UMM-Collection Concepts Metadata Improvement Guidance

ConceptObservationsMetadata Manager Guidance
Temporal Extent
  • All collections have a beginPosition
  • 19/26 collections have an endPosition
  • 7/26 collections have TimePeriod/gml:endPosition/@indeterminatePosition='now'
  • If the temporal extent is known, ensure the collection metadata record has a Temporal Element with beginPosition and endPosition elements
  • If the temporal extent is unknown or ongoing, ensure the collection metadata record has a Temporal Element with endPosition/@indeterminatePosition='now' value
Processing Level
  • Processing Level exists in allCollections except OMINRT
  • A Processing Level value = 'Not provided' is being used in a varying capacity in 6 NASA collections (LANCEMODIS, LAADS, LPDAAC_ECS, LARC, NSIDC_ECS and SEDAC).  In these cases we recommend replacing the 'Not Provided' value with @nilReason='unknown'
Keyword Thesaurus  
Keyword URL  
Related Resource
  • The AssociationTypeCode element occurs in 7 CMR collections.
  • The most common code list value is 'Input Collection'
  • Recommend changing  AssociationTypeCode value from 'Input Collection' to Input Collection ‘crossReference’
  • See ESD-1139

From:  //gmd:MD_AggregateInformation/gmd:associationType/gmd:DS_AssociationTypeCode='Input Collection'

To:  //gmd:MD_AggregateInformation/gmd:associationType/gmd:DS_AssociationTypeCode='crossReference'

Project Name  

 

 

How are UMM-Collection Recommended concepts used in the CMR?

The link below shows a google sheets comparison view of Required UMM-Collection concept usage across CMR metadata collections. The pink shaded cells near the bottom of the table show required elements that are missing for some of the collections.

Collection Comparison of UMM-Collection Required Elements

Highlights

  • 86% of CMR Collections (24 of 26) include all required UMM-Collection concepts with the exception of OMINRT
  • 60% of required UMM-Collection concepts (9 of 15) are complete in all 26 CMR metadata collections.  See UMM-Collection Metadata Completeness in the CMR page.

Improvement Focus Areas

  • The Project Name concept is missing from 4 CMR metadata collections.
  • The Resource Version concept is missing from 8 NASA collections

How are UMM-Collection Recommended concepts used in the CMR?

The link below provides a google sheets comparison view of Recommended UMM-Collection concept usage across CMR metadata collections. The pink shaded cells near the bottom of the table indicate recommended elements that are missing for some of the collections.

Collection Comparison of UMM-Collection Recommend Elements

Highlights

  • The Resource Language concept is included in all CMR metadata collections
  • The Spatial Representation concept is included in 96% of CMR metadata collections
  • The Quality Statement concept is included in 80% of CMR metadata collections

Improvement Focus Areas

  • Resource Citation content is missing from the majority of CMR metadata collections
  • Resource Access/Use Constraint content is missing from the majority of CMR metadata collections

How are UMM-Collection Optional concepts used in the CMR?

The link below provides a google sheets comparison view of Recommended UMM-Collection concept usage across CMR metadata collections. The pink shaded cells near the bottom of the table indicate recommended elements that are missing for some of the collections.

Collection Comparison of UMM-Collection Optional Elements

Highlights

  • Sensor Short Name is the most commonly used optional element.  It exists in 77% of NASA collections.
  • Additional Attributes are the most commonly used optional elements. Additional Attributes for describing content Information exist in 44% of NASA collections. 
  • Additional Attributes also exist in NASA metadata for describing Platform Information, Instrument Information and Quality Information.

 

 




 

 

 

 

 

 

Hide comments