Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents
stylecircle

Element Description

The science keyword elements enable the specification of Science Keywords element allows relevant Earth science keywords related to the data set. The science keywords to be associated with a dataset to better enable data search and discovery. The Science Keywords are chosen from a controlled keyword hierarchy maintained in the Keyword Management System (KMS). A list of valid science keywords Science Keywords can be found here: https://gcmdservicesgcmd.gsfcearthdata.nasa.gov/statickms/kmsconcepts/sciencekeywordsconcept_scheme/sciencekeywords.csv? ?format=csv.

(It is important to note that the "EARTH SCIENCE SERVICES" keywords included at the top of the list in the csv file should not be used as Science Keywords. Valid Science Keywords start with the "EARTH SCIENCE" Category). 

Best Practices

Science keywords are important for the precise search and retrieval of data

Best Practices

Science keywords are important for the precise search and retrieval of data, and should accurately represent the data set dataset being described. As a rule of thumb, science keywords should represent the scientific parameters being provided in the data as well as any broader conceptual terms that may aid in describing the data. At a minimum, one science keyword hierarchy must be provided, and this hierarchy must go down to the 'Term' level of detail. The 'Detailed Variable' element is the only science keyword element that is not controlled by the KMS. The 'Detailed Variable' keyword should only be used if there is a very specific parameter provided in the data which is not adequately described by keywords in the KMS. If a particular science keyword is missing from the KMS, it is possible to put in a request to have it added. The KMS is managed by the Global Change Master Directory (GCMD) and new keyword requests may be made through the GCMD Keywords Community Forum.

All positions in the science keyword hierarchy must be populated until the desired level of detail is reached. Skipping or leaving blank a position in the keyword hierarchy will render the keyword invalid. The only exception to this is the 'Detailed Variable' element; a Detailed Variable keyword may be provided as long as it is preceded by the required Category, Topic, and Term keywords. Science keywords are not case sensitive.  

        

Examples:

ScienceKeywords/Category: "EARTH SCIENCE"

ScienceKeywords/Topic: "ATMOSPHERE"

ScienceKeywords/Term: "CLOUDS"

ScienceKeywords/VariableLevel1: "TROPOSPHERIC/LOW LEVEL CLOUDS (OBSERVED/ANALYZED)"

ScienceKeywords/VariableLevel2: "STRATOCUMULUS"

ScienceKeywords/VariableLevel3: "STRATOCUMULUS CUMILIFORMIS"

ScienceKeywords/DetailedVariable: "STRATOCUMULUS VESPERALIS"


ScienceKeywords/Category: "EARTH SCIENCE"

ScienceKeywords/Topic: "BIOSPHERE"

ScienceKeywords/Term: "ECOLOGICAL DYNAMICS"

ScienceKeywords/VariableLevel1: "FIRE ECOLOGY"

ScienceKeywords/VariableLevel2: "FIRE MODELS"


Element Specification

An unlimited amount of science keywords may be listed One Science Keyword is required and more Science Keywords may be provided (Cardinality: 1..*).  For every science keyword that is provided the sub elements of Category, Topic, and Term are required.

Model
Model
ElementTypeUsable Valid ValuesConstraintsRequired?Cardinality
UMM-CScienceKeywords/CategoryStringScience Category Keywords

KMS controlled

Yes1
UMM-CScienceKeywords/TopicStringScience Topic KeywordsKMS controlled
Yes1
UMM-CScienceKeywords/TermStringScience Term KeywordsKMS controlledYes1
UMM-CScienceKeywords/VariableLevel1StringScience Variable_Level_1 KeywordsKMS controlledNo0..1
UMM-CScienceKeywords/VariableLevel2String

Science Variable_Level_2 Keywords

KMS controlledNo0..1
UMM-CScienceKeywords/VariableLevel3String

Science Variable_Level_3 Keywords

KMS controlledNo0..1
UMM-CScienceKeywords/DetailedVariableString
1 - 80 characters (Uncontrolled/Free-Text)No0..1


Metadata Validation and QA/QC

All metadata entering the CMR goes through the below process to ensure metadata quality requirements are met. All records undergo CMR validation before entering the system. The process of QA/QC is slightly different for NASA and non-NASA data providers. Non-NASA providers include interagency and international data providers and are referred to as the International Directory Network (IDN).

Lucidchart
rich-viewertrue
autofittrue
nameCopy of Wiki Page Metadata Evaluation Workflow-1939-12404ac7
width1102
id98e5dc28-3252-4209-953f-66f1378e1cf4
alignLeft
height299

Please see the expandable sections below for flowchart details.


Expand
titleGCMD Metadata QA/QC
  • Manual Review
    • Identify errors, discrepancies, or omissions.
    • Verify that all pertinent keywords have been applied.
    • Verify that existing facets and other controlled keyword values are consistent and suitable for the data.
  • Automated Review
    • Check that the field has been populated.
    • Check that the field is populated with a valid value from KMS.
    • Check that the field value is not a duplicate.
    • Check that the 'Detailed_Variable' field length is not greater than 80 characters.
Expand
titleCMR Validation

<>

  • This element is required and at least 1 science keyword must exist.
  • For every science keyword the sub elements of Category, Topic, and Term must exist.
  • All science keyword sub-elements except for DetailedVariable must be valid according to the keyword management system. Currently the CMR issues a warning if this constraint is violated.
Expand
Expand
titleARC Metadata QA/QC

ARC Priority Matrix

Priority CategorizationJustification

Red = High Priority Finding

This element is categorized as highest priority when:

  • No Science Keywords are provided.
  • A Science Keyword
  • The element is not included at all.
  • The element is included but is empty.
  • The science keyword does not align with the KMS.
    • The science keyword Science Keyword does not exist in the KMS.
    • A keyword(s) is missing from the keyword hierarchy.
    • A keyword(s) is placed in the incorrect position of the keyword hierarchy (e.g. a Variable Level 2 keyword is placed in the Variable Level 1 field).
  • The science keyword A Science Keyword provided is not appropriate for the dataset.

Yellow = Medium Priority Finding

This element is categorized as medium priority when:

  • A recommendation is made to add a relevant science keyword Science Keyword to the metadata.
  • A recommendation is made to add to an existing keyword in the metadata (i.e. i.e. to extend a keyword hierarchy down to a more detailed keywordlevel).

Blue = Low Priority Finding

Not Applicable

Green = No Findings/Issues

The element is provided , and follows all applicable criteria specified in the best practices section above.

ARC Automated Checks

<insert>

ARC uses the pyQuARC library for automated metadata checks. Please see the pyQuARC GitHub for more information. 

Dialect Mappings

Expand

Dialect Mappings

Expand
titleDIF 9

DIF 9 (Note: DIF-9 is being phased out and will no longer be supported after 2018)

Expand
titleDIF 10

DIF 10

Science_Keywords are required. An unlimited amount of science keywords Science Keywords may be listed provided (Cardinality: 1..*)

UMM-C ElementDIF 10
Specification
PathTypeConstraintsRequired in DIF 10?CardinalityNotes
DIF 10/DIF/
ScienceKeywords/Category

Science_Keywords/Category

String

KMS controlled

Yes

1

The category keyword will always be "EARTH SCIENCE"

DIF 10/DIF/
ScienceKeywords/TopicScience_Keywords/Topic
String

KMS controlled

Yes1
DIF 10/DIF/

ScienceKeywords/TermScience_Keywords/TermString

KMS controlled

Yes1
DIF 10/DIF/

ScienceKeywords/VariableLevel1Science_Keywords/Variable_Level_1StringKMS controlledNo0..1
DIF 10/DIF/

ScienceKeywords/VariableLevel2Science_Keywords/Variable_Level_2StringKMS controlledNo0..1
DIF 10/DIF/

ScienceKeywords/VariableLevel3Science_Keywords/Variable_Level_3StringKMS controlledNo0..1
DIF 10/DIF/

ScienceKeywords/DetailedVariableScience_Keywords/Detailed_VariableString1 - 80 characters (Uncontrolled
(
/Free-Text)No0..1


Example Mapping

Section
Column
width50%

DIF 10

No Format
<Science_Keywords>
  <Category>EARTH SCIENCE</Category>
  <Topic>BIOSPHERE</Topic>
  <Term>VEGETATION</Term>
  <Variable_Level_1>VEGETATION INDEX</Variable_Level_1>
  <Variable_Level_2>NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)</Variable_Level_2>
  <Detailed_Variable>0.9 DENSITY</Detailed_Variable>
</Science_Keywords>
<Science_Keywords>
  <Category>EARTH SCIENCE</Category>
  <Topic>BIOSPHERE</Topic>
  <Term>VEGETATION</Term>
  <Variable_Level_1>EVERGREEN VEGETATION</Variable_Level_1>
</Science_Keywords>
Column
width50%

UMM

No Format
ScienceKeywords: [
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "VEGETATION INDEX"
    VariableLevel2: "NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)"
    DetailedVariable: "0.9 DENSITY"
  },
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "EVERGREEN VEGETATION"
  }
],
Expand
titleECHO 10

ECHO 10

Science Keywords are required. An unlimited amount of science keywords Science Keywords may be listed provided (Cardinality: 1..*)

UMM-C ElementECHO 10
Specification
PathTypeConstraintsRequired in ECHO10?CardinalityNotes
ECHO 10
ScienceKeywords/Category/Collection/ScienceKeywords/ScienceKeyword/CategoryKeywordString

KMS controlled

Yes1The category keyword will always be "EARTH SCIENCE"
ECHO 10
ScienceKeywords/Topic/Collection/ScienceKeywords/ScienceKeyword/TopicKeywordString

KMS controlled

Yes1
ECHO 10

ScienceKeywords/Term/Collection/ScienceKeywords/ScienceKeyword/TermKeywordStringKMS controlledYes1
ECHO 10

ScienceKeywords/VariableLevel1/Collection/ScienceKeywords/ScienceKeyword/VariableLevel1Keyword/ValueStringKMS controlledNo0..1
ECHO 10

ScienceKeywords/VariableLevel2/Collection/ScienceKeywords/ScienceKeyword/VariableLevel1Keyword/VariableLevel2Keyword/ValueStringKMS controlledNo0..1
ECHO 10

ScienceKeywords/VariableLevel3/Collection/ScienceKeywords/ScienceKeyword/VariableLevel1Keyword/VariableLevel2Keyword/VariableLevel3Keyword/ValueStringKMS controlledNo0..1
ECHO 10

ScienceKeywords/DetailedVariable/Collection/ScienceKeywords/ScienceKeyword/DetailedVariableKeywordStringUncontrolled (Free-Text)No0..1


Example Mapping

Section
Column
width50%

ECHO 10

No Format
<ScienceKeyword>
  <CategoryKeyword>EARTH SCIENCE</CategoryKeyword> 
  <TopicKeyword>BIOSPHERE</TopicKeyword> 
  <TermKeyword>VEGETATION</TermKeyword> 
  <VariableLevel1Keyword>
    <Value>VEGETATION INDEX</Value>
      <VariableLevel2Keyword>
        <Value>NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)</Value>
      </VariableLevel2Keyword>
  </VariableLevel1Keyword>
  <DetailedVariableKeyword>0.9 DENSITY</DetailedVariableKeyword>
</ScienceKeyword>
<ScienceKeyword>
  <CategoryKeyword>EARTH SCIENCE</CategoryKeyword> 
  <TopicKeyword>BIOSPHERE</TopicKeyword> 
  <TermKeyword>VEGETATION</TermKeyword> 
  <VariableLevel1Keyword>
    <Value>EVERGREEN VEGETATION</Value>
  </VariableLevel1Keyword>
</ScienceKeyword>
Column
width50%

UMM

No Format
ScienceKeywords: [
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "VEGETATION INDEX"
    VariableLevel2: "NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)"
    DetailedVariable: "0.9 DENSITY"
  },
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "EVERGREEN VEGETATION"
  }
],



Expand
titleISO 19115-2 MENDS

ISO 19115-2 MENDS

Science Keywords are required. An unlimited amount of science keywords Science Keywords may be listed provided (Cardinality: 1..*)

UMM-C Element

ISO 19115-2 MENDS 

Specification

Path

TypeNotes
ISO 19115-2 MENDS

ScienceKeywords/Category

ScienceKeywords/Topic

ScienceKeywords/Term

ScienceKeywords/VariableLevel1

ScienceKeywords/VariableLevel2

ScienceKeywords/VariableLevel3

ScienceKeywords/DetailedVariable

/gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords/gmd:MD_Keywords/ gmd:keyword/gco:CharacterString  (list each value of the keyword hierarchy delimited by &gt; )

String


KMS controlled. This is where the entire keyword hierarchy should be listed. Each keyword in the hierarchy must be separated by "&gt;". If any keyword is missing and there exists a keyword later in the hierarchy (such as DetailedLocation), use NONE to fill in the values in between. The CMR will not translate the NONE values they are only used to place each keyword in its correct space in the hierarchy.



/gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords/gmd:MD_Keywords/ gmd:keyword/gmd:type/MD_KeywordTypeCode[@codeListValue="theme"]
Codelist

codeList=https://cdn.earthdata.nasa.gov/iso/resources/Codelist/gmxCodelists.xml#MD_KeywordTypeCode

Select the value "theme" from the codelist. This codelist value does not directly map to a UMM element

- choosing 'theme'

; choosing "theme" indicates to CMR that the Science Keywords should be mapped.



Example Mapping

Section
Column
width50%

ISO 19115-2 MENDS

No Format
<gmi:MI_Metadata>
  ...
  <gmd:identificationInfo>
    <gmd:MD_DataIdentification>
      <gmd:descriptiveKeywords>
        <gmd:MD_Keywords>
          <gmd:keyword>
            <gco:CharacterString>EARTH SCIENCE&gt;BIOSPHERE&gt;VEGETATION&gt;VEGETATION INDEX&gt;NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)&gt;NONE&gt;0.9 DENSITY</CharacterString>
          </gmd:keyword>
          <gmd:keyword>
            <gco:CharacterString>EARTH SCIENCE&gt;BIOSPHERE&gt;VEGETATION&gt;EVERGREEN VEGETATION</CharacterString>
          </gmd:keyword>
          <gmd:type>
            <gmd:MD_KeywordTypeCode codeList="https://cdn.earthdata.nasa.gov/iso/resources/Codelist/gmxCodelists.xml#MD_KeywordTypeCode" codeListValue="theme">theme</gmd:MD_KeywordTypeCode>
          </gmd:type>
        </gmd:MD_Keywords> 
      </gmd:descriptiveKeywords>
  </gmd:MD_DataIdentification>
</gmd:identificationInfo>
    ...
Column
width50%

UMM

No Format
ScienceKeywords: [
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "VEGETATION INDEX"
    VariableLevel2: "NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)"
    DetailedVariable: "0.9 DENSITY"
  },
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "EVERGREEN VEGETATION"
  }
],



Expand
titleISO 19115-2 SMAP

ISO 19115-2 SMAP

Science Keywords are required. An unlimited amount of science keywords Science Keywords may be listed provided (Cardinality: 1..*)

UMM-C Element

ISO 19115-2 SMAP 

Specification

Path

TypeNotes
ISO 19115-2 SMAP

ScienceKeywords/Category

ScienceKeywords/Topic

ScienceKeywords/Term

ScienceKeywords/VariableLevel1

ScienceKeywords/VariableLevel2

ScienceKeywords/VariableLevel3

ScienceKeywords/DetailedVariable

/gmd

/gmd

:DS_Series/gmd:seriesMetadata/gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords/ gmd:MD_Keywords/gmd:keyword/gco:CharacterString  (list each value of the keyword hierarchy delimited by &gt; )

StringKMS controlled. This is where the entire keyword hierarchy should be listed. Each keyword in the hierarchy must be separated by "&gt;". If any keyword is missing and there exists a keyword later in the hierarchy (such as DetailedLocation), use NONE to fill in the values in between. The CMR will not translate the NONE values they are only used to place each keyword in its correct space in the hierarchy.
ISO 19115-2 SMAP

/gmi:MI_Metadata/gmd:identificationInfo/gmd:MD_DataIdentification/gmd:descriptiveKeywords/gmd:MD_Keywords/gmd:keyword/ gmd:type/MD_KeywordTypeCode[@codeListValue="theme"]Codelist

codeList=https://cdn.earthdata.nasa.gov/iso/resources/Codelist/gmxCodelists.xml#MD_KeywordTypeCode

Select the value "theme" from the codelist. This codelist value does not directly map to a UMM element

- choosing 'theme'

; choosing "theme" indicates to CMR that the Science Keywords should be mapped.


Example Mapping

Section
Column
width50%

ISO 19115-2 SMAP

No Format
...
<gmd:DS_Series>
 <gmd:identificationInfo>seriesMetadata>
  <gmd<gmi:MD_DataIdentification>MI_Metadata>
    ...
    <gmd:descriptiveKeywords>identificationInfo>
      <gmd:MD_Keywords>DataIdentification>
        <gmd:keyword>descriptiveKeywords>
          <gco:CharacterString>EARTH SCIENCE<gmd:MD_Keywords>
            <gmd:keyword>
              <gco:CharacterString>EARTH SCIENCE&gt;BIOSPHERE&gt;VEGETATION&gt;VEGETATION INDEX&gt;NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)&gt;NONE&gt;0.9 DENSITY</CharacterString>
            </gmd:keyword>
            <gmd:keyword>
              <gco:CharacterString>EARTH SCIENCE&gt;BIOSPHERE&gt;VEGETATION&gt;EVERGREEN VEGETATION</CharacterString>
            </gmd:keyword>
            <gmd:type>
              <gmd:MD_KeywordTypeCode codeList="https://cdn.earthdata.nasa.gov/iso/resources/Codelist/gmxCodelists.xml#MD_KeywordTypeCode" codeListValue="theme">theme</gmd:MD_KeywordTypeCode>
            </gmd:type>
          </gmd:MD_Keywords> 
        </gmd:descriptiveKeywords>
  </gmd:MD_DataIdentification>
</gmd:identificationInfo>
      ...
Column
width50%

UMM

No Format
ScienceKeywords: [
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "VEGETATION INDEX"
    VariableLevel2: "NORMALIZED DIFFERENCE VEGETATION INDEX (NDVI)"
    DetailedVariable: "0.9 DENSITY"
  },
  {
    Category: "EARTH SCIENCE",
    Topic: "BIOSPHERE",
    Term: "VEGETATION",
    VariableLevel1: "EVERGREEN VEGETATION"
  }
],