Page tree
Skip to end of metadata
Go to start of metadata


brandon whitehead


Inconsistency with terms in 'Science Keywords'

There appears to be an inconsistency in what is created/available for each type of file/download.  For example, there appears to be ~1940 science keywords in the downloadable RDF/XML. There appear to be 2000 science keywords in the downloadable XML file as well as querying https://gcmd.earthdata.nasa.gov/kms/concepts/concept_scheme/sciencekeywords directly.  The downloadable CSV for science keywords contains nearly 3300 unique IDs with terms.  


There appears to be terms missing.  Examples for reference include:

YARDANGS

URI: https://gcmd.earthdata.nasa.gov/kms/concept/dabc0fc5-acac-48df-b32e-02c9166e8385

This term shows 'in scheme' science keywords, but not in RDF/XML, the raw XML, or the direct query of science keywords.  This term is available at the URL (above), and is listed in the CSV file.


WATER QUALITY INDICES

URI: https://gcmd.earthdata.nasa.gov/kms/concept/f2130ca3-3587-4312-b6d4-138456b5ea78

Again, this term shows 'in scheme' science keywords, but not in RDF/XML, the raw XML, or the direct query of science keywords.  This term is available at the URL (above), and is listed in the CSV file.


I'm sure there are other examples.  Is this on the GCMD side, or am I missing something?

Thanks in advance.

  • No labels

4 Comments


  1. brandon whitehead

    We looked into this and the reason this is happening is that each of these requests can at most return 2,000 keywords for the XML/RDF versions of this resource.  In order to get the rest of the keywords, you'll need to include a page_num, i.e.,

    https://gcmd.earthdata.nasa.gov/kms/concepts/concept_scheme/sciencekeywords?page_num=1

    https://gcmd.earthdata.nasa.gov/kms/concepts/concept_scheme/sciencekeywords?page_num=2

    The documentation for this is here:

    Keyword Management Service Application Program Interface#2.12ConceptsInConceptSchemeResource

    Thanks

  2. Thanks for looking into it, Tyler Stevens   The max return did occur to me, but I failed to check the documentation (obviously).  Thank you for the reference (very helpful). 


    However, I do think the static downloads should incorporate the entire concept scheme — i.e all the terms/results. 

    The raw XML, RDF/XML and CSV files were all downloaded as static files from: https://gcmd.earthdata.nasa.gov/static/kms/   This is the "inconsistency" to which I was referring in my original request.  Apologies for the lack of clarity.

  3. user-33879

    brandon whitehead Thanks for noticing and reporting this, as I hadn't noticed it either. Tyler Stevens please please please restore a means of getting all the RDF for a concept scheme in one go. It's indeed "very misleading" that the link "Science Keywords [as RDF]" from the static page https://gcmd.earthdata.nasa.gov/static/kms/ now returns only a subset of the keywords, without that page otherwise giving a warning.

    1. We have a KMS API improvement ticket in to our development team to resolve this issue. If you are interested, you can track it at  KMS-217 - Getting issue details... STATUS