Skip to content

Basic Workflow

Basic Workflow

The workflow of EL-PASO consists of four main steps:

1. Download of raw data files
2. Extract data from downloaded files and store it as `el_paso.Variables`
3. Process the extracted variables and compute derived values
4. Save variables in a pre-defined data standard and file format

1. Download of raw data files

Raw data files are usually downloaded from an online repository hosted by an agency. This can be achieved by the internal download routine (download) or by using external tools such as SPEDAS or an API such as HAPI.

The download using the EL-PASO routine can be setup using a few lines of code:

EL-PASO download routine

You can learn more about the EL-PASO download routine in tutorial #1 located in tutorials/1_download_data_and_extracting_variables.ipynb.

Note

The file containing all the code snippets is located at examples/minimal_example.py

2. Extract data from downloaded files and store it as 'Variables'

A Variable in EL-PASO is a custom class which holds a numpy-array as data and metadata (unit, processing notes, etc).

We can turn the downloaded files into variables by calling the EL-PASO extract_variables_from_files routine.

The user has to provide information about under which name or column the variable is storred and its unit:

EL-PASO extract variables routine

You can learn more about the extraction of variables in tutorial #1 located in tutorials/1_download_data_and_extracting_variables.ipynb.

3. Process the extracted variables and compute derived values

The next step will consist of multiple processing steps, such as time binning or applying thresholds:

EL-PASO processing

We can also calculate derived quantities such as adiabatic invariants (mu, K, L*). The calculations use the IRBEM library as a backend and runs parallelized:

EL-PASO derived quantities

You can learn more about processing in tutorials #4 and #5 located in tutorials.

4. Save variables in a pre-defined data standard and file format

The last step saves the variables in a pre-defined format utizing a data standard.

The pre-defined format specifies under which names the variables are saved, if the variables are split up in multiple files, and if daily, monthly, or yearly files are created.

The data standard describes the dimensions, units, and names of the data set. For example, the standard describes that a measurement of the unidirectional differential electron flux has always dimensions of (Time, energy, pitch angle), a unit of 1/(keV str s cm^2), and is called FEDU. In this example, we are applying the PRBEM standard.

EL-PASO saving

You can learn more about different saving file formats and standars in tutorials #3 located in tutorials.