Accessing Raw Data (raw_data_access)

Accessing Raw Data (raw_data_access)#

CrocoDash uses several datasets to setup the model. Data can be gathered directly from public datasources (including the CESM inputdata svn repository) or through helper functions in the CrocoDash Raw Data Access module. The Raw Data Access Module is an expandible, verifyable, object-oriented module with access functions to raw datasets used in CrocoDash. This document explains what it is and what you can do to add more to it, if you would like to!

Specific datasets can either be accessed directly through the data access module or be chosen by adding arguments to the case.configure_forcings function. Check out the demo here.

Users can check if datasets are accessible at this link.

Please see below for available datasets.

Data Product Registry#

Product

Description

Link

tpxo

TPXO (TOPEX/POSEIDON Global Tidal Ocean I think) is a public tidal model dataset

https://www.tpxo.net/global

cesm inputdata

The CESM Input SVN repo holds all files CrocoDash exposes publicly themselves at the following repo link

https://svn-ccsm-inputdata.cgd.ucar.edu/trunk/inputdata/ocn/mom/croc

gebco

GEBCO (General Bathymetric Chart of the Ocean) is a public dataset of global ocean bathymetry

https://www.gebco.net/data_and_products/gridded_bathymetry_data/#global

glofas

GLOFAS (Global Flood Awareness System) is a public river discharge/runoff Product

https://ewds.climate.copernicus.eu/datasets/cems-glofas-historical?tab=download

glorys

GLORYS (Global Ocean Physics Reanalysis) is a public dataset provided through the copernicus marine service.

https://data.marine.copernicus.eu/product/GLOBAL_MULTIYEAR_PHY_001_030/description

mom6_output

CESM had old runs that can be used for IC and OBC

https://gdex.ucar.edu/datasets/d267000/

seawifs

SEAWIFS is a Chlorophyll Dataset for MOM6

https://oceandata.sci.gsfc.nasa.gov/getfile/SEASTAR_SEAWIFS_GAC.19980201_20100228.L3m.MC.CHL.chlor_a.9km.nc

CrocoDash Data Access Module#

CrocoDash has a data_access module for accessing various datasets. Please see below for a table of available methods.

Data Access Registry#

Product

Function

Type

Description

gebco

get_gebco_data_with_python

python

Python request for global bathymetry data

gebco

get_gebco_data_script

script

get script to download global bathymetry data

glofas

get_global_data_with_python

python

Gets glofas raw data through the cdsapi package

glofas

get_processed_global_glofas_script_for_cli

script

Generates bash script for access to CESM Inputdata processed glofas data

glorys

get_glorys_data_from_rda

python

Gathers GLORYS data from RDA on computers with access to glade/rda

glorys

get_glorys_data_from_cds_api

python

Python request with copernicusmarine api

glorys

get_glorys_data_script_for_cli

script

Generates bash script for direct CLI run with the copernicusmarine package

mom6_output

get_mom6_data

python

Gets MOM6 Data from a given path (by default a POP-MARBL run)

seawifs

get_global_seawifs_script_for_cli

script

Generates bash script for direct CLI access to chlorophyll data (No Package Required)

seawifs

get_processed_global_seawifs_script_for_cli

script

Generates bash script for direct CLI access to processed chlorophyll data (No Package Required)

Want to add more?#

Interested in adding your own spin on a dataset? Maybe with alternative metadata? Check our our additional docs: Adding Raw Data Products