You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 6 Next »

­

Overview

 

The CMR includes metadata that originate in three dialects: DIF, ECHO, and ISO. The largest portion of CMR collection records are in the SciOps collection, and are inserted into the CMR in DIF format. These are referred to here as the SciOps collection. The second largest portion of metadata for NASA collections originate from ECHO and are inserted into the CMR in ECHO format. These are referred to here as the NASA collection. A third group of metadata in the CMR originate from other agencies around the world and are generally either in DIF or possibly in ISO. These are referred to as the Other collection.

The DIF and ECHO dialects were originally developed to facilitate discovery of collections in the Global Change Master Directory or ECHO. The content of these “discovery” dialects is translated into the ISO dialects that are the eventual target for CMR. This translation is generally done without augmentation, so the content does not change very much.

Metadata providers have a choice about which metadata dialect(s) they use to submit metadata to the CMR. We compared the SciOps and NASA collections to understand how does this choice effects the metadata content.

 

Data Selection

 

The SciOps data collection includes over 13,000 metadata records from nearly 2,000 different sources, most outside of NASA. Over 1700 of these sources have less than ten metadata records. Assuming that there is variation in metadata creation techniques for these providers, this is likely not a homogeneous collection. We identified 25 providers that have collections with one hundred or more records and examined those as separate collections.

 

Table 1 below shows the 18 NASA provider and 25 SciOps Providers considered in this evaluation, as well as the record count for each provider.

 

Table 1. Collections and record counts

Collection

Organization

Count

 

Collection

Organization

Count

ASF

NASA

161

 

COLUMBIASCIOPS214

CDDIS

NASA

 38

 

DOESCIOPS202

GES_DISC

NASA

 1044

 

DOIUSGSGDSCIOPS128

GHRC

NASA

 361

 

DOIUSGSPUBSSCIOPS105

LAADS

NASA

 130

 

DOIUSGSSESCSCIOPS207

LANCEAMSR2

NASA

 6

 

IAI-DISSCIOPS116

LANCEMODIS

NASA

 154

 

ICSUSCIOPS112

LARC

NASA

 406

 

IOBISSCIOPS295

LARC_ASDC

NASA

 606

 

JPSCIOPS112

LPDAAC_ECS

NASA

 285

 

KRSCIOPS329

NSIDC_ECS

NASA

 223

 

LTERSCIOPS177

NSIDC_V0 

NASA

 784

 

NSIDCSCIOPS187

OB_DAAC

NASA

 132

 

NZSCIOPS857

OMNIRT

NASA

 5

 UCARSCIOPS437

ORNL_DAAC

NASA

 1216

 UEASCIOPS104

PODAAC

NASA

 603

 UKSCIOPS33

SEDAC

NASA

 202

 UMDSCIOPS169

USGS_EROS

NSIDC

 11

 UNEPDEWASCIOPS373
ACADISSCIOPS393 UNEPROAPSCIOPS162
ARSCIOPS142 USAPSCIOPS190
BCO-DMOSCIOPS136 USCSCIOPS151
CNSCIOPS134    

SciOps vs. NASA

This analysis compares item  usage (elements and attributes) in the 18 NASA collection with the 25 SciOps collections.  This evaluation identifies items which exist in collections as well as items that are complete in collections.  In order for an item to exist in a collection it must be present in at least 1 metadata record included in the collection. In order for an item to be complete in a collection it must be present in all metadata records included in the collection. 

Item usage for NASA collections and SciOps collections are shown below using bubble charts.  The bubble chart interpretation graphic provides a schematic for interpreting the bubble plots. 

 

Bubble Chart Interpretation

 

NASA Exist vs SciOps Exist

 

Chart 1 below compares item existence in NASA Collections with SciOps collections.  The X axis shows the number of NASA collections that include an item. The Y axis shows number of SciOps collections that include an item.  The bubble size shows the number of items included in the 2 collections.

 

The large red bubble in the upper right corner of the plot shows the items (elements and attributes) that exist is all 18 NASA collections and in all 25 SciOps collections.  This bubble includes 43 items, as shown in the legend.  The large blue bubble in the lower left corner of the plot shows items that exist is 1 NASA collection and in 0 SciOps collections.  This bubble includes 44 items, as shown in the legend. The data for Chart 1 is accessible in Google Sheets.  To view the data and bubble chart in Google Sheets click on the image. 

 

Chart 1: NASA EXIST vs SciOps Exists

Accessing the Chart

To access the data for this chart, click on the chart graphic above to view an interactive version in Google Sheets.  The interactive version enables data identification for each bubble and includes a look up table for identifying the ISO items (elements and attributes) associated with the bubble. To access and use the interactive version:

  1. Click on the chart graphic above to view the chart in Google Sheets
  2. Hover over the bubbles with your mouse to identify item existence in NASA collections and item existence in SciOps collections associated with the bubble.  The hover identification also shows the number of items (Counts) associated with each bubble.
  3. To the right of the chart is a lookup table for identifying the xpaths associated with each of the bubbles.  To identify the xpaths associated with a bubble, match the NASA Exists and SciOps Exists value pairs (from the bubble hover) with the value pairs in the look up table.

NASA Complete vs SciOps Complete

 

Chart 2 below compares item (elements and attributes) completeness in NASA Collections with SciOps collections.  The X axis shows the number of NASA collections that are complete with respect to an item. The Y axis shows number of SciOps collections that are complete with respect to an metadata items.  The bubble size shows the number of items included in the 2 collections.

 

The large red bubble in the upper right corner of the plot shows items that are complete is all 18 NASA collections and in all 25 SciOps collections.  This bubble includes 28 items, as shown in the legend.  The large blue bubble in the lower left corner of the plot shows the items that are complete in 1 NASA collection and in 0 SciOps collections.  This bubble includes 40 items, as shown in the legend. The data for Chart 2 is accessible in Google Sheets.  To view the data and bubble chart in Google Sheets click on the image.

 

Chart 2: NASA Complete vs SciOps Complete

Accessing the Chart

 

To access the data for this chart, click on the chart graphic above to view an interactive version in Google Sheets.  The interactive version enables data identification for each bubble and includes a look up table for identifying the ISO items associated with the bubble. To access and use the interactive version:

  1. Click on the chart graphic above to view the chart in Google Sheets
  2. Hover over the bubbles with your mouse to identify the number of complete NASA collections and the number of complete SciOps collections associated with the bubble.  The hover identification also shows the number of items (Counts) associated with each bubble.
  3. To the right of the chart is a lookup table for identifying the xpaths associated with each of the bubbles.  To identify the xpaths associated with a bubble, match the NASA Provided Complete and SciOps Provided Complete values (from the bubble hover) with the values in the look up table.

Elements in all NASA and Other Collections

Twenty-seven items (elements and attributes) are in all NASA and all Other Collections:  See the All NASA and All Other filtered view to identify these elements.

Elements in all Other Collections and some NASA Collections

Nineteen items (elements and attributes) are present in all Other collections and in a smaller number of NASA collections. See the All Other and Some NASA filtered view to identify these items.

Ted please double check this filter.  The table below includes 52 items not 19

Table 2. Metadata Elements that are in all Other and in a smaller number of NASA collections

xPath (simplified)*

NASA

gmd:contentInfo/gmd:processingLevelCode/gmd:code

94%

gmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:description

94%

gmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:southBoundLatitude/gco:Decimal

94%

gmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:linkage/gmd:URL

94%

gmd:identificationInfo/gmd:extent/gmd:description

94%

gmd:identificationInfo/gmd:processingLevel/gmd:code

94%

gmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:eastBoundLongitude/gco:Decimal

94%

gmd:referenceSystemInfo/gmd:referenceSystemIdentifier/gmd:code

94%

gmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:northBoundLatitude/gco:Decimal

94%

gmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:westBoundLongitude/gco:Decimal

89%

gmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:name

83%

gmi:acquisitionInformation/gmi:instrument/gmi:identifier/gmd:description

78%

gmi:acquisitionInformation/gmi:operation/gmi:identifier/gmd:code

78%

gmi:acquisitionInformation/gmi:operation/gmi:description

72%

gmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:description

72%

gmd:distributionInfo/gmd:distributor/gmd:distributorFormat/gmd:name

67%

gmd:identificationInfo/gmd:status/gmd:MD_ProgressCode

67%

gmd:dataQualityInfo/gmd:report/gmd:measureIdentification/gmd:code

56%

gmd:identificationInfo/gmd:citation/gmd:identifier/gmd:version

56%

gmi:acquisitionInformation/gmi:instrument/gmi:type

56%

gmd:identificationInfo/gmd:extent/gmd:geographicElement/gmd:extentTypeCode/gco:Boolean

56%

gmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:protocol

50%

gmd:contact/gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL

50%

gmd:contact/gmd:contactInfo/gmd:onlineResource/gmd:protocol

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:individualName

50%

gmd:metadataExtensionInfo/gmd:extendedElementInformation/gmd:source/gmd:role/gmd:CI_RoleCode

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:administrativeArea

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:city

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:country

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:deliveryPoint

50%

gmd:metadataExtensionInfo/gmd:extendedElementInformation/gmd:dataType/gmd:MD_DatatypeCode

50%

gmd:metadataExtensionInfo/gmd:extendedElementInformation/gmd:definition

50%

gmd:metadataExtensionInfo/gmd:extendedElementInformation/gmd:domainValue

50%

gmd:metadataExtensionInfo/gmd:extendedElementInformation/gmd:name

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:postalCode

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:onlineResource/gmd:protocol

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL

50%

gmd:identificationInfo/gmd:resourceConstraints/gmd:useLimitation

50%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:electronicMailAddress

44%

gmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:phone/gmd:voice

44%

gmd:contentInfo/gmd:dimension/gmd:otherPropertyType/gco:RecordType

44%

gmd:distributionInfo/gmd:distributor/gmd:distributionOrderProcess/gmd:fees

44%

gmd:identificationInfo/gmd:topicCategory/gmd:MD_TopicCategoryCode

44%

gmd:contentInfo/gmd:contentType/gmd:MD_CoverageContentTypeCode

44%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:type/eos:EOS_AdditionalAttributeTypeCode

44%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:dataType/eos:EOS_AdditionalAttributeDataTypeCode

44%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:name

44%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:description

39%

gmd:distributionInfo/gmd:distributor/gmd:distributorFormat/gmd:specification

33%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:value

28%

gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:identifier/gmd:code

28%

gmd:dataQualityInfo/gmd:report/gmd:evaluationMethodDescription

0%

Elements in some NASA Collections and no Other Collections

 

One hundred and fifteen items (elements and attributes) are present in 0 Other collections and in a some number of NASA collections. See the No Other and Some NASA filtered view to identify these items.

Other Filters

 

 

 

 

 

 

  • No labels