Interim Collection Comparison Report for Big Earth Data Initiative

Metadata Source: NASA Common Metadata Repository

Metadata Dialect: ISO 19115-2

Evaluation Target: People and Organizations

Ted Habermann – The HDF Group 

Metadata serves an essential function in connecting users to people and organizations to help them access, use, and understand data. The ESDIS Common Metadata Repository (CMR) recognizes the importance of this type of information and includes two related elements in the Unified Metadata Model (UMM) Common Profile: Responsibility and Party. As elements in the Common Profile, these elements are included in all other UMM Profiles.

The Responsibility element broadly defines responsibilities related to data resources using the position of the element in the metadata model hierarchy. The UMM-Common Profile defines five responsibilities: Metadata Contact, Resource Author / Originator, Point of Contact, Distributor, and Processor. Each of these responsibilities can have multiple people or organizations (termed parties) associated with it. A RoleCode that is chosen from the standard ISO Codelist describes details of the roles of those parties.

Understanding usage, completeness and consistency of Responsibilities and Parties in ESDIS metadata is an important first step towards providing consistent and complete services to users of those data. This report provides an initial assessment of these characteristics.

We examined 4180 metadata records from 17 collections extracted from the CMR during March 2015 and 2126 metadata records from 15 collections extracted from the CMR during October 2015 (Collections Examined). For each recommended responsibility we provide a table that gives the average number of occurrences for elements of the associated party in each of these collections. A value of 1 or more typically (although not necessarily) indicates that the element is included one or more times in each record in a collection. A value < 1.0 is typically the percentage of records in a collection that include the metadata element. Empty cells indicate values of 0 – the element is completely missing from the collection.

 

Contact Information in the CMR

Metadata Contact:

ISO path: /gmi:MI_Metadata/gmd:contact/gmd:CI_ResponsibleParty

The Metadata Contact responsibility gives the party that is responsible for creating and maintaining the metadata. The xPath of the source of this information in the DIF and ECHO dialects is given in Table 1. 

Table 1. Sources for Metadata Contact information

Dialect

Source

DIF

/DIF/Personnel[Role=”DIF AUTHOR”]

ECHO

  1. /*/ArchiveCenter
  2. /*/Contacts/Contact[contains(Role,'DIF AUTHOR')]

 

This table shows the frequency of occurrence of elements of the Metadata Contact Responsibility / Parties for 25 collections in the CMR. The data indicates that all of the collections include the name of the organization that is responsible for the metadata (gmd:organisationName) and a roleCode (gmd:role/gmd:CI_RoleCode) for the organization. In general, no contact information is provided for these organizations with the exception of CDDIS.

This difference reflects the source of the information in ECHO. Metadata Contacts that originate as ArchiveCenter have only the organisationName while those that originate as contacts generally have more complete information.

 

Table 2. Occurrences of Metadata Contact Responsibility

 Number of Records161255938103231044193401306154406606128554882237841325121660320211130
ElementElement PathASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA
Administrative Areagmd:contact/gmd:contactInfo/gmd:address/gmd:administrativeArea  .11                      
Citygmd:contact/gmd:contactInfo/gmd:address/gmd:city  .11                      
Countrygmd:contact/gmd:contactInfo/gmd:address/gmd:country  .11                      
Delivery Pointgmd:contact/gmd:contactInfo/gmd:address/gmd:deliveryPoint  .21                      
Emailgmd:contact/gmd:contactInfo/gmd:address/gmd:electronicMailAddress  .11                      
Postal Codegmd:contact/gmd:contactInfo/gmd:address/gmd:postalCode  .11                      
Descriptiongmd:contact/gmd:contactInfo/gmd:onlineResource/gmd:description  .11                      
 URLgmd:contact/gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL 1.01.111.041.001.011.001.01.26 .50.75.001.00.041.34 1.63    .08 1.02
Protocolgmd:contact/gmd:contactInfo/gmd:onlineResource/gmd:protocol 1.01.111.041.001.011.001.01.26 .50.75.001.00.041.34 1.63    .08 1.02
Organization Namegmd:contact/gmd:organisationName1.001.011.001.041.001.011.001.01.281.00.501.001.001.001.001.351.001.631.00.801.001.001.001.001.02
Contact Rolegmd:contact/gmd:role/gmd:CI_RoleCode1.001.011.001.041.001.011.001.01.281.00.501.001.001.001.001.351.001.631.00.801.001.001.001.001.02

Feb 2016 Version

Resource Author / Originator:

ISO path: /gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:CI_Citation/gmd:citedResponsibleParty/gmd:CI_ResponsibleParty/

The Resource Author / Originator Responsibility gives the party that is responsible for creating the dataset. This is typically the Principal Investigator for the project or their Institution. The xPath of the source of this information in the DIF and ECHO dialects is given in Table 3.

Table 3. Sources for Resource Author / Originator information.

Dialect

Source

DIF

  1. /DIF/Organization
  2. /DIF/Personnel[contains(Role,'Investigator')]
  3. /DIF/Data_Set_Citation/Dataset_Creator

ECHO

  1. /*/Contacts/Contact[contains(Role,'Data Originator')]
  2. /*/Contacts/Contact[contains(Role,'Producer')]
  3. /*/Contacts/Contact[contains(Role,'Investigator')]
  4. /*/Contacts/Contact[contains(Role,'INVESTIGATOR')]+

 

Table 4 shows the frequency of occurrence of elements of the Resource Author / Originator Responsibility / Parties for 25  collections in the CMR.

 

Table 4. Occurrences of Resource Author / Originator Responsibility

 Number of Records161255938103231044193401306154406606128554882237841325121660320211130
ElementElement PathASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA
Administrative Areagmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:administrativeArea           .16  .82 1.41        
Citygmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:city           .16  .82 1.41        
Countrygmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:country           .16  .82 1.41        
Delivery Pointgmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:deliveryPoint           .16  .82 1.41        
Emailgmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:electronicMailAddress           .16  .82 1.49        
Postal Codegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:address/gmd:postalCode           .16  .82 1.41        
Contact Instructiongmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:contactInstructions           .16  .82 .90        
Hoursgmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:hoursOfService           .12    .71        
Faxgmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:phone/gmd:facsimile           .00    .02        
Phonegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:contactInfo/gmd:phone/gmd:voice           .20  .82 .89        
Individual Namegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:individualName           .16    1.17        
Organizational Namegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:organisationName              .82 .32        
Position Namegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:positionName           .16    .57        
Role Codegmd:identificationInfo/gmd:citation/gmd:citedResponsibleParty/gmd:role/gmd:CI_RoleCode           .16  .82 1.49        

Feb 2016 Version

Point of Contact:

ISO path: /gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:pointOfContact/gmd:CI_ResponsibleParty/

 

The Point of Contact Responsibility is responsible for answering scientific questions about the dataset. Often this is a data manager at the archive that houses the dataset. The xPath of the source of this information in the DIF and ECHO dialects is given in Table 5.

 

Table 5. Sources for Point of Contact information.

Dialect

Source

DIF

  1. /DIF/Organization
  2. /DIF/Personnel[contains(Role,’Technical Contact’)]

ECHO

  1. /*/ArchiveCenter
  2. /*/Contacts/Contact[contains(Role,'TECHNICAL CONTACT')]

 

Table 6 shows the frequency of occurrence of elements of the Point of Contact Responsibility / Parties for 25 collections in the CMR.

This difference reflects the source of the information in ECHO. Metadata Contacts that originate as ArchiveCenter have only the organisationName while those that originate as contacts generally have more complete information.

Table 6. Occurrences of Point of Contact Responsibility

 Number of Records161255938103231044193401306154406606128554882237841325121660320211130
ElementElement PathASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA
Administrative Areagmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:administrativeArea 3.01.21.111.783.202.001.491.24 1.442.35.011.00.072.50 4.26    .23 1.90
Citygmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:city 3.08.211.992.573.232.001.511.24 1.442.35.011.00.072.55 4.57    .23 1.91
Countrygmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:country 3.11.212.012.573.242.001.541.24 1.442.35.011.00.072.58 4.42    .23 1.91
Delivery Pointgmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:deliveryPoint 6.51.424.103.968.332.002.433.72 4.288.33.033.00.283.65 12.64    .23 4.50
Emailgmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:electronicMailAddress 2.97.211.521.393.252.001.642.24 2.102.35.012.00.072.58 4.21    .23 1.93
Postal Codegmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:address/gmd:postalCode 3.05.211.992.573.222.001.511.24 1.442.35.011.00.072.53 4.41    .23 1.91
Faxgmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:phone/gmd:facsimile 2.28 1.992.571.79 .131.22 1.41.35.00 .071.91 3.35      1.08
Phonegmd:identificationInfo/gmd:pointOfContact/gmd:contactInfo/gmd:phone/gmd:voice 2.68 2.002.572.792.00.131.35 1.462.35.011.00.142.31 3.74    .23 1.91
Individual Namegmd:identificationInfo/gmd:pointOfContact/gmd:individualName 3.14.112.062.573.312.001.561.43 2.882.35.012.00.082.82 4.83    .23 1.93
Organizational Namegmd:identificationInfo/gmd:pointOfContact/gmd:organisationName1.003.861.003.713.392.024.003.903.561.003.071.751.002.001.054.201.005.381.00.801.001.001.161.003.82
Role Codegmd:identificationInfo/gmd:pointOfContact/gmd:role/gmd:CI_RoleCode1.007.821.216.366.355.327.006.327.101.007.454.111.014.001.137.191.0910.691.00.801.001.001.391.006.51

Feb 2016 Version


Distribution Contact:

ISO Path: /gmi:MI_Metadata/gmd:distributionInfo/gmd:MD_Distribution/gmd:distributor/gmd:MD_Distributor

/gmd:distributorContact/gmd:CI_ResponsibleParty/

 

The Distribution Contact Responsibility contains the party responsible for answering questions about the distribution of the dataset. This is typically the Archive Center for the dataset. The xPath of the source of this information in the DIF and ECHO dialects is given in Table 7. 

 

Table 7. Sources for Distribution Contact Responsibility information.

Dialect

Source

DIF

  1. /DIF/Data_Center/Data_Center_Name

ECHO

  1. /*/ArchiveCenter
  2. /*/Contacts/Contact[contains(Role,'Archive')]
  3. /*/Contacts/Contact[contains(Role,’DATA CENTER CONTACT’)]
  4. /*/Contacts/Contact[contains(Role,'Distributor')]
  5. /*/Contacts/Contact[contains(Role,'User Services')]
  6. /*/Contacts/Contact[contains(Role,'GHRC USER SERVICES')]
  7. /*/Contacts/Contact[contains(Role,'ORNL DAAC User Services')]

 

Table 8 shows the frequency of occurrence of elements of the Distribution Contact Responsibility / Parties for 25 collections in the CMR.

 

Table 8. Occurrences of the Distributor Contact Responsibility

 Number of Records161255938103231044193401306154406606128554882237841325121660320211130
ElementElement PathASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA
Administrative Areagmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:administrativeArea  .89        .361.00 .96 .91   1.00 .921.00 
Citygmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:city  .89        .361.00 .96 .91   1.00 .921.00 
Countrygmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:country  .89        .361.00 .96 .91   1.00 .921.00 
Delivery Pointgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:deliveryPoint  .89        .361.00 .96 .91   1.00 .921.00 
Emailgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:electronicMailAddress1.00 .89      1.00 .361.00 .96 .91 1.00 1.001.09.921.00 
Postal Codegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:address/gmd:postalCode  .89        .361.00 .96 .91   1.00 .921.00 
URLgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL 1.01 1.041.00 1.001.01.72 .50    1.34 1.63    .08 1.02
Protocolgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:onlineResource/gmd:protocol 1.01 1.041.00 1.001.01.72 .50    1.34 1.63    .08 1.02
Phonegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:phone/gmd:voice1.00 1.79        .361.00 .96 .91   2.001.091.844.00 
Individual Namegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:individualName  .89        .11    .01   1.001.09.92  
Organization Namegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:organisationName 1.01.891.041.00 1.001.01.72 .50.251.99 .961.35.911.631.00.80 1.091.001.001.02
Role Codegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:role/gmd:CI_RoleCode1.001.01.891.041.00 1.001.01.721.00.50.361.99 .961.35.911.631.00.801.001.091.001.001.02
Contact Instructionsgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:contactInstructions           .36  .96 .91      1.00 
Hoursgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:hoursOfService           .36  .96 .91     .921.00 
Faxgmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:contactInfo/gmd:phone/gmd:facsimile           .36  .96 .91        
Position Namegmd:distributionInfo/gmd:distributor/gmd:distributorContact/gmd:positionName           .11    .01    1.09   

Feb 2016 Version

 

Processor

ISO path: /gmi:MI_Metadata/gmd:dataQualityInfo/gmd:DQ_DataQuality/gmd:lineage/gmd:LI_Lineage

/gmd:processStep/gmd:LI_ProcessStep/gmd:processor

 

The Processor Responsibility gives the party that is responsible for processing the dataset. It is included in the lineage metadata.  The xPath of the source of this information in the DIF and ECHO dialects is given in Table 9. 

 

Table 9. Sources for Processor Responsibility information.

Dialect

Source

DIF

  1. TBD

ECHO

  1. /*/ProcessingCenter

 

Table 10 shows the frequency of occurrence of elements of the Processor Responsibility / Parties for 25 collections in the CMR.

 

Table 10. Occurrences of Processor Responsibility

 Number of Records161255938103231044193401306154406606128554882237841325121660320211130
ElementElement PathASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA
Organization Namegmd:dataQualityInfo/gmd:lineage/gmd:processStep/gmd:processor/gmd:organisationName.98       .72 .50.25  .96 .91   1.001.00.921.00 
Role Codegmd:dataQualityInfo/gmd:lineage/gmd:processStep/gmd:processor/gmd:role/gmd:CI_RoleCode.98       .72 .50.25  .96 .91   1.001.00.921.00 
URLgmd:dataQualityInfo/gmd:lineage/gmd:processStep/gmd:processor/gmd:contactInfo/gmd:onlineResource/gmd:linkage/gmd:URL        .70 .50              
Protocolgmd:dataQualityInfo/gmd:lineage/gmd:processStep/gmd:processor/gmd:contactInfo/gmd:onlineResource/gmd:protocol        .70 .50              

Feb 2016 Version

E-Mail Addresses

The data shown above clearly indicates that the completeness of contact information varies significantly across collections and responsibilities. The standards all include extensive physical contact information, e.g. cities, addresses, and postal codes. This reflects the prevalence of physical mail delivery when these standards were created. Now electronic delivery dominates, so e-mail addresses are more likely to be helpful than physical addresses. We examined the occurrence of e-mail addresses for responsibilities in all 15 collections. Table 12 gives the results of this analysis.

 

Table 12. Completeness of contact email addresses

 161255938103231044193401306154406606128554882237841325121660320211130
ElementASFAU_AADCCDDISESAEUMETSATGES_DISCISROJAXALAADSLANCEAMSR2LANCEMODISLARCLARC_ASDCLM_FIRMSLPDAAC_ECSNOAA_NCEINSIDC_ECSNSIDCV0OB_DAACOMINRTORNL_DAACPODAACSEDACUSGS_EROSUSGS_LTA

Metadata Contact

  .11                      

Distribution Contact

1.00 .89      1.00 .361.00 .96 .91 1.00 1.001.09.921.00 

Point of Contact

 2.97.211.521.393.252.001.642.24 2.102.35.012.00.072.58 4.21    .23 1.93

Resource Author / Originator

           .16  .82 1.49        
Processor                         

Feb 2016 Version