<style type="text/css">
	.wiki-content h2, h2 { font-weight: 700;}
</style>

Capturing ideas and snippets of information about cloud analytics.

Problem Statements

Asynchronous services and workflows

Traditional data and service endpoints have been fairly static. Archives serve data from a generally predefined set of products that are either fixed or growing over time. Services are developed and published and are expected to be available for long time periods. How can we adapt to be able to quickly provide access to a more fluid pool of data that is being fed by new processing made possible by the cloud. How can services be extended to include very long-running jobs such as when we're aggregating results.

Paradigm Changes

Do scientists have to develop new mental models of how data are processed to make better use of the cloud environment? How much can (or should) the intricacies of distributed data analytics be hidden behind facades?

Assimilate new data

How do you feed new data into tools and workflows that have not been used for that kind of data before? What data formats, metadata, data structures, coordinate representations (time, space, spectral), or ancillary variables are needed?

Congruent spatio-temporal views

How can we provide views of data from multiple sources in a way that consumers of the data see a uniform view? Views can be pre-built, such as with datacubes, but can also be computed as needed.

Solution Matrix

A table showing the problem statements from above and the building blocks from below. This is an experimental presentation that is likely to be superseded by a better way of  matching building blocks to problem statements.


Asynchronous services & workflowsParadigm changesAssimilate new dataCongruent spatio-temporal views
WCS 2.0



WCS 2.1



WCS-T



OPeNDAP



Open Data Cube



WPS 2.0



WCPS 1.0



WPS-T



Common Data Model



Cloud Optimized GeoTIFF



EO JSON



OGC CIS 1.0/1.1



OGC DGGS



xarray



dask



dask.distributed



daskernetes



PyTables



Jupyter Notebooks



STAC



OpenAPI / Swagger




Building blocks

(Open to suggestions about better categories or names of categories!)

Data Access

Data Processing Services

Data Models & Formats

Data Libraries

Workflow and orchestration

Visualization & Interaction

Metadata & Catalogs

Interoperability Tools

References / Links

Other work

NASA

Other

Tutorials & Articles

Tutorials

Articles

Questions