geoarches is a machine learning package for training, running and evaluating ML models on weather and climate data, developed by Guillaume Couairon and Renu Singh in the ARCHES team at INRIA (Paris, France).
geoarches's building blocks can be easily integrated into research ML pipelines. It can also be used to run the ArchesWeather and ArchesWeatherGen weather models.
geoarches is based on pytorch, pytorch-lightning and hydra for configuration. After the package is installed, you can use its modules in python code, but you can also call the main training and evaluating scripts of geoarches.
To develop your own models or make modifications to the existing ones, the intended usage is to write configurations files and pytorch lightning classes in your own working directory. Hydra will then discover your custom configs
folder, and you can point to your custom classes from your custom config files.
Link to a recent presentation of geoarches.
geoarches is meant to jumpstart your ML pipeline with building blocks for data handling, model training, and evaluation. This is an effort to share engineering tools and research knowledge across projects.
Data:
download/
: scripts that parallelize downloads and show how to use chunking to speed up read access.dataloaders/
: PyTorch datasets that read netcdf data and prepare tensors to feed into model training.
Model training:
backbones/
: network architecture that can be plugged into lightning modules.lightning_modules/
: wrapper around backbone modules to handle loss computation, optimizer, etc for training and inference (agnostic to backbone but specific to ML task).
Evaluation:
metrics/
: tested suite of iterative metrics (memory efficient) for deterministic and generative models.evaluation/
: scripts for running metrics over model predictions and plotting.
Pipeline:
main_hydra.py
: script to run training or inference with hydra configuration.documentation/
: quickstart code for training and inference from a notebook.
We use poetry for package dependencies. Use pipx to install poetry:
brew install pipx
pipx install poetry
Create an environment or activate the environment you are already using.
conda create --name weather python=3.11
conda activate weather
Move into the git repo and install dependencies:
cd geoarches
poetry install
Poetry, by default, installs the geoarches package in editable mode. Editable mode allows you to make changes to the geoarches code locally, and these changes will automatically be reflected in your code that depends on it.
We recommend making the following symlinks in the codebase folder:
ln -s /path/to/data/ data # Store data for training and evaluation.
ln -s /path/to/models/ modelstore # Store model checkpoints and model hydra configs.
ln -s /path/to/evaluation/ evalstore # Store intermediate model outputs for computing metrics.
ln -s /path/to/wandb/ wandblogs # Store Wandb logs.
If you want to store models and data in your working directory, or can also simply create regular folders.
Use following the script to download the 4 deterministic models (archesweather-m-...) and generative model (archesweathergen).
src="https://huggingface.co/gcouairon/ArchesWeather/resolve/main"
MODELS="archesweather-m-seed0 archesweather-m-seed1 archesweather-m-skip-seed0 archesweather-m-skip-seed1 archesweathergen"
for MOD in $MODELS; do
mkdir -p modelstore/$MOD/checkpoints
wget -O modelstore/$MOD/checkpoints/checkpoint.ckpt $src/${MOD}_checkpoint.ckpt
wget -O modelstore/$MOD/config.yaml $src/${MOD}_config.yaml
done
You can follow instructions in documentation/archesweather-tutorial.ipynb
to load the models and run inference with them. See documentation/archesweathergen_pipeline.md
to run training.
To compute brier score on ERA5 (needed to instantiate ArchesWeather models for inference or training), you will need to download ERA5 quantiles:
src="https://huggingface.co/gcouairon/ArchesWeather/resolve/main"
wget -O geoarches/stats/era5-quantiles-2016_2022.nc $src/era5-quantiles-2016_2022.nc
Your directory structure after following installation should look like this:
├── geoarches
│ ├── geoarches
│ │ ├── ...
└── your_own_project
├── ...
The recommended way to use the package is to depend on the package inside your own working directory, by importing them in your project code e.g.
from geoarches.dataloaders.era5 import Era5Forecast
ds = Era5Foreacast(path='data/era5_240/full',
load_prev=True,
norm_scheme='pangu')
Making edits directly in the geoarches package will make updates more difficult, but if you prefer this option, you can create a development branch so as to rebase it on future updates of geoarches. (See Contributing section).
We use Hydra to easily configure training experiments. main_hydra.py
is pointed to the configs/
folder which tells geoarches which dataloader, lightning module, backbone, and their arguments to run. You can also override arguments by CLI (see below for useful arguments). Please read Hydra documentation for more information.
To train model named default_run
, you can run
python -m geoarches.main_hydra \
module=archesweather \ # Uses module/archesweather.yaml
dataloader=era5 \ # Uses dataloader/era5.yaml
++name=default_run \ # Name of run, used for Wandb logging and checkpoint dir
This will start a training for the deterministic model ArchesWeather
on ERA5 data.
The model config will be saved to modelstore/default_run/config.yaml
and the model checkpoints will be saved to modelstore/default_run/checkpoints
.
Useful training options are
python -m geoarches.main_hydra \
++log=True \ # log metrics on weights and biases (See Wandb section below.)
++seed=0 \ # set global seed
++cluster.gpus=4 \ # number of gpus used for distributed training
++batch_size=1 \ # batch size per gpu
++max_steps=300000 \ # maximum number of steps for training, but it's good to leave this at 300k for era5 trainings
++save_step_frequency=50000 \ # if you need to save checkpoints at a higher frequency
To run on a SLURM cluster, you can create a configs/cluster
folder inside your working directory and put a custom_slurm.yaml
configuration file in it with custom arguments. Then you call tell geoarches to use this configuration file with
python -m geoarches.submit --config-dir configs cluster=custom_slurm
Find your API key under User settings in your account (https://docs.wandb.ai/support/find_api_key/) and set the Wandb environment variable in your ~/.bashrc
.
export WANDB_API_KEY="..."
Then tell geoaches to log to Wandb.
python -m geoarches.main_hydra \
++log=True \ # log metrics on weights and biases
++cluster.wandb_mode=offline \ # online allows machine with internet connection to log directly to wandb. Otherwise offline mode logs locally and requires a separate step to sync with wandb.
To run evaluation of a model (e.g. ArchesWeather) on the test set (2020), you can run
MODEL=archesweather-m
python -m geoarches.main_hydra ++mode=test ++name=$MODEL
It will automatically load the config file in modelstore/$MODEL
and load the latest checkpoint from modelstore/$MODEL/checkpoints
.
It will then run the metrics relevant for the loaded model (deterministic metrics for deterministic models and similarly for generative models)
Warning: if the provided model does not exist, it will not throw an error.
Useful options for testing:
python -m geoarches.main_hydra ++mode=test ++name=$MODEL \
++ckpt_filename_match=100000 \ # substring that should be present in checkpoint file name, e.g. here for loading the checkpoint at step 100000
++limit_test_batches=0.1 \ # run test on only a fraction of test set for debugging
++module.module.rollout_iterations=10 \ # autoregressive rollout horizon, in which case the line below is also needed
++dataloader.test_args.multistep=10 \ # allow the dataloader to load trajectories of size 10
++dataloader.test_args.
For testing the generative models, you can also use the following options:
++module.inference.num_steps=25 \ # num diffusion steps in generation
++module.inference.num_members=50 \ # num members in ensemble
++module.inference.rollout_iterations=10 \ # number of auto-regressive steps, 10 days by default.
-
Create a
configs/
folder in your own project folder for your hydra configuration files. In this folder, you can put your own configs, e.g. by copying config files from geoarches and modifying them. Please note the config files should be put in the appropriate folder (configs/cluster/
,configs/dataloader/
orconfigs/module/
). You will need a baseconfigs/config.yaml
. See geoarches for an example. -
Add new lightning modules or architectures in your working directory (we recommend putting lightning modules in a
lightning_modules
folder, and pytorch-only backbone architectures in abackbones
folder). To tell hydra to use these modules, you can create a module config filecustom_forecast.yaml
underconfigs/module/
and point to your new backbone and module classes:
module:
_target_: lightning_modules.custom_module.CustomLightningModule
...
backbone:
_target_: backbones.custom_backbone.CustomBackbone
...
You can of course mix and match your custom modules and backbones with the ones in geoarches.
- Training models only requires one to tell hydra to use your
configs
folder with
python -m geoarches.main_hydra --config-dir configs
You can compute model outputs and metrics separately. In that case, you first run evaluation as following:
python -m geoarches.main_hydra ++mode=test ++name=$MODEL \
++module.inference.save_test_outputs=False \
Then, to compute metrics, you can run evaluation/eval_multistep.py
which reads in inference output from xarray files, computes specified metrics, and dumps metrics to output_dir
. Example:
python -m geoarches.evaluation.eval_multistep \
--pred_path evalstore/modelx_predictions/ \
--output_dir evalstore/modelx_predictions/ \
--groundtruth_path data/era5_240/full/ \
--multistep 10 \
--metrics era5_ensemble_metrics --num_workers 2
Before running, you need to make sure the metrics are registered in evaluation/metric_registry.py
using register_metric(). You can find examples in the file. Example:
register_metric(
"era5_ensemble_metrics",
Era5EnsembleMetrics,
save_memory=True,
)
Metrics are registered with a name, class, and any arguments.
You can plot metrics for several models using the script plot.py
. Just specify where the computed metrics are stored (either .nc or .pt files). Example:
python -m geoarches.evaluation.plot --output_dir plots/ \ --metric_paths /evalstore/modelx/...nc /evalstore/modely/...nc --model_names_for_legend ModelX ModelY \ --metrics rankhist --rankhist_prediction_timedeltas 1 7 \ --figsize 10 4 --vars Z500 Q700 T850 U850 V850
If you want to contribute to geoarches, please see the Contributing section.
Many thanks to the authors of WeatherLearn for adapting the Pangu-Weather pseudocode to pytorch. The code for our model is mostly based on their codebase.