...
/gmi:acquisitionInformation/gmi:instrument/gmi:type
/gmd:identificationInfo/gmd:pointOfContact/gmd:organisationName
/gmd:identificationInfo/gmd:abstract
/gmd:identificationInfo/gmd:descriptiveKeywords/gmd:keyword
/gmd:identificationInfo/gmd:resourceConstraints/gmd:useLimitation
/gmd:contentInfo/gmd:processingLevelCode/gmd:code
/gmd:identificationInfo/gmd:pointOfContact/gmd:individualName
/gmi:acquisitionInformation/gmi:instrument/eos:sensor/eos:type
/gmd:contact/gmd:organisationName
/gmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:code
/gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:description
/gmd:identificationInfo/gmd:processingLevel/gmd:code
/gmi:acquisitionInformation/gmi:platform/gmi:description
/gmd:identificationInfo/gmd:status/gmd:MD_ProgressCode
/gmd:identificationInfo/gmd:processingLevel/gmd:code
/gmd:identificationInfo/gmd:citation/gmd:edition
/gmi:acquisitionInformation/gmi:platform/gmi:description
/gmd:identificationInfo/gmd:descriptiveKeywords/gmd:keyword
/gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:description
/gmi:acquisitionInformation/gmi:instrument/gmi:type
/gmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:code
/gmd:identificationInfo/gmd:status/gmd:MD_ProgressCode
/gmd:contentInfo/gmd:processingLevelCode/gmd:code
/gmd:identificationInfo/gmd:citation/gmd:identifier/gmd:version
...
Table 2 shows fields in the NASA Group with the 'Not provided' flag and the % of records from each data provider that include that value. In four cases these missing data flags make up over 50% of the content for fields. Records with these missing values were not considered further in the analysis.
Table 2. Occurrences of missing data ('Not provided') in NASA collections
Number of Records | 4 | 1044 | 50 | 154 | 305 | 2 | 11 | 19 | 783 | 16 | ||
Paths - Data Provider | Count | CDDIS | GES_DISC | LAADS | LANCEMODIS | LARC | LARC_ASDC | LPDAAC_ECS | NSIDC_ECS | NSIDCV0 | SEDAC | Average |
/gmi:acquisitionInformation/gmi:instrument/gmi:type | 10 | 1.00 | 1.00 | 0.78 | 0.49 | 0.83 | 1.00 | 0.91 | 1.00 | 0.75 | 1.00 | 86% |
/gmd:contentInfo/gmd:processingLevelCode/gmd:code | 8 | 0.00 | 0.00 | 0.90 | 0.95 | 0.02 | 0.00 | 0.91 | 1.00 | 1.00 | 1.00 | 64% |
/gmd:identificationInfo/gmd:processingLevel/gmd:code | 8 | 0.00 | 0.00 | 0.90 | 0.95 | 0.02 | 0.00 | 0.91 | 1.00 | 1.00 | 1.00 | 64% |
/gmd:identificationInfo/gmd:abstract | 6 | 0.00 | 0.90 | 0.90 | 0.92 | 1.00 | 1.00 | 0.91 | 0.00 | 0.00 | 0.00 | 63% |
/gmi:acquisitionInformation/gmi:platform/gmi:description | 6 | 0.00 | 0.00 | 0.66 | 0.00 | 0.30 | 1.00 | 0.09 | 0.00 | 1.00 | 0.00 | 34% |
/gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/ eos:AdditionalAttribute/eos:reference/eos:description | 5 | 0.00 | 0.87 | 0.68 | 0.01 | 0.00 | 0.00 | 0.91 | 0.00 | 1.00 | 0.00 | 39% |
/gmd:identificationInfo/gmd:descriptiveKeywords/gmd:keyword | 4 | 0.00 | 0.00 | 0.66 | 0.00 | 0.30 | 1.00 | 0.00 | 0.00 | 0.53 | 0.00 | 28% |
/gmd:identificationInfo/gmd:status/gmd:MD_ProgressCode | 4 | 0.00 | 0.00 | 0.00 | 0.01 | 0.17 | 0.00 | 0.09 | 0.00 | 0.22 | 0.00 | 5% |
/gmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:code | 4 | 0.00 | 0.00 | 0.66 | 0.00 | 0.30 | 1.00 | 0.00 | 0.00 | 0.53 | 0.00 | 28% |
/gmd:identificationInfo/gmd:aggregationInfo/gmd:aggregateDataSetName/gmd:citedResponsibleParty /gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL | 4 | 0.00 | 1.00 | 0.00 | 1.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 1.00 | 44% |
/gmi:acquisitionInformation/gmi:instrument/eos:sensor/eos:type | 3 | 0.00 | 0.00 | 0.00 | 0.01 | 0.04 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0% |
/gmd:identificationInfo/gmd:pointOfContact/gmd:individualName | 2 | 0.00 | 0.00 | 0.04 | 0.00 | 0.00 | 0.00 | 0.09 | 0.00 | 0.00 | 0.00 | 1% |
/gmd:identificationInfo/gmd:resourceConstraints/gmd:useLimitation | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 1.00 | 0.00 | 0.00 | 0.00 | 0.00 | 22% |
/gmd:contact/gmd:organisationName | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 | 11% |
/gmd:identificationInfo/gmd:pointOfContact/gmd:organisationName | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 | 11% |
/gmd:distributionInfo/gmd:distributor/gmd:distributorTransferOptions/gmd:onLine/gmd:linkage/gmd:URL | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 1.00 | 0.00 | 0.00 | 11 |
Table 3 shows fields in the SciOps Group with the 'Not provided' flag and the % of SciOps records from each data provider that include that value. In eight cases these missing data flags make up over 50% of the content for fields and in three cases (processingLevelCodes and platform descriptions) the values in all records are 'Not provided'. Records with these missing values were not considered further in the analysis.
Table 3. Occurrences of missing data ('Not provided') in SciOps collections
Paths - Number of Records | Count | UNEPDEWA | CN | DOIUSGSPUBS | KR | NZ | UK | IAI-DIS | LTER | DOIUSGSSESC | DOE | UCAR | NSIDC | UNEPROAP | JP | USC | ACADIS | BCO-DMO | ICSU | UEA | UMD | DOIUSGSGD | COLUMBIA | USAP | IOBIS | AR | Average |
/gmd:identificationInfo/gmd:processingLevel/gmd:code | 25 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 100% |
/gmi:acquisitionInformation/gmi:platform/gmi:description | 25 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 100% |
/gmd:contentInfo/gmd:processingLevelCode/gmd:code | 25 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 100% |
/gmd:contentInfo/gmd:dimension/gmd:otherProperty/gco:Record/eos:AdditionalAttributes/eos:AdditionalAttribute/eos:reference/eos:description | 25 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.99 | 1.00 | 0.96 | 1.00 | 1.00 | 0.95 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 0.85 | 1.00 | 1.00 | 99% |
/gmd:identificationInfo/gmd:citation/gmd:edition | 25 | 0.96 | 1.00 | 0.64 | 0.98 | 1.00 | 1.00 | 1.00 | 1.00 | 0.97 | 0.76 | 1.00 | 0.76 | 1.00 | 0.99 | 0.99 | 1.00 | 0.99 | 0.96 | 1.00 | 0.96 | 0.44 | 0.99 | 1.00 | 0.72 | 1.00 | 92% |
/gmd:identificationInfo/gmd:citation/gmd:identifier/gmd:version | 25 | 0.96 | 1.00 | 0.64 | 0.98 | 1.00 | 1.00 | 1.00 | 1.00 | 0.97 | 0.76 | 1.00 | 0.76 | 1.00 | 0.99 | 0.99 | 1.00 | 0.99 | 0.96 | 1.00 | 0.96 | 0.44 | 0.99 | 1.00 | 0.72 | 1.00 | 92% |
/gmd:identificationInfo/gmd:descriptiveKeywords/gmd:keyword | 25 | 0.99 | 0.13 | 0.47 | 0.64 | 0.49 | 0.58 | 0.94 | 0.99 | 0.96 | 0.45 | 0.25 | 0.75 | 0.86 | 0.49 | 0.99 | 1.00 | 0.26 | 0.86 | 0.01 | 0.89 | 0.45 | 0.87 | 0.99 | 0.46 | 0.53 | 65% |
/gmi:acquisitionInformation/gmi:platform/gmi:identifier/gmd:code | 25 | 0.99 | 0.13 | 0.47 | 0.64 | 0.49 | 0.58 | 0.94 | 0.99 | 0.96 | 0.45 | 0.25 | 0.75 | 0.86 | 0.49 | 0.99 | 1.00 | 0.26 | 0.86 | 0.01 | 0.89 | 0.45 | 0.87 | 0.99 | 0.46 | 0.53 | 65% |
/gmd:identificationInfo/gmd:status/gmd:MD_ProgressCode | 25 | 1.00 | 0.57 | 0.07 | 0.87 | 0.01 | 0.94 | 0.09 | 0.01 | 0.00 | 0.59 | 0.01 | 0.28 | 1.00 | 0.89 | 0.01 | 1.00 | 0.71 | 0.03 | 0.99 | 0.82 | 0.05 | 0.07 | 0.01 | 0.19 | 0.20 | 42% |
/gmi:acquisitionInformation/gmi:instrument/gmi:type | 23 | 0.02 | 0.90 | 0.22 | 0.73 | 0.56 | 0.67 | 0.11 | 0.01 | 0.12 | 0.37 | 0.06 | 0.78 | 0.36 | 0.50 | 0.00 | 0.00 | 0.82 | 0.04 | 0.04 | 0.51 | 0.21 | 0.10 | 0.01 | 0.38 | 0.43 | 32% |
This analysis compares item usage (elements and attributes) in the 18 NASA collection with the 25 SciOps collections. This evaluation identifies items which exist in collections as well as items that are complete in collections. In order for an item to exist in a collection it must be present in at least 1 metadata record included in the collection. In order for an item to be complete in a collection it must be present in all metadata records included in the collection.
...