We won't be using too many of these. Mostly it will look something like this:
data acquire from EUMETCAST-Terrestrial push or scrape from internet using FTP or some form of staged subscription service. The data primarily NetCDF or NetCDF in XML wrappers, mostly from EUMETSAT, ESA or NASA
process the data using command line batched modules strung together with python e.g. SNAP, in particular the GPF and SNAPPY, or SEADAS
and some of our own algorithms in python
curate the raw & processed data using whatever database/hierarchy works best
do cool research with the data on water quality, southern ocean carbon cycles, fisheries, harmful algal blooms etc...
serve the data up through applications sites something like
the EO eutrophication monitoring site