Anemoi Benchmarks

Benchmarking read throughput of anemoi-datasets Zarr stores, comparing Zarr 2 vs Zarr 3 across different parallelisation strategies and storage backends.

Overview

The suite measures how fast data can be read from Zarr datasets using threads, processes, and PyTorch DataLoader (with DDP). It includes heat tracking to ensure benchmarks read cold (uncached) data.

See BENCHMARK.md for detailed results and methodology. See BENCHMARK_TORCH.md for more results using pytorch data loader.

Datasets

The following dataset is publicly available and can be used to reproduce the benchmarks:

era5-o96-1979-2023-6h-v8.zarr — https://data.ecmwf.int/anemoi-datasets/era5-o96-1979-2023-6h-v8.zarr

The higher-resolution datasets (N320, O1280) used in some benchmarks are not yet publicly available.

Quick Start

./run_test.sh <path-to-dataset.zarr> --mode threads --workers 1-2-4-8-16 -n 16

./run_test.sh <path-to-dataset.zarr> --mode processes --workers 1-2-4-8-16 -n 16

./run_test.sh <path-to-dataset.zarr> --mode torch --workers 1-2-4-8-16 -n 16 -g 4

Modes: threads, processes, torch, or threads-processes. Results are saved as JSONL files in logs/.

Plotting

Use plot.py to visualise results. It reads the JSONL logs and produces PNG plots. Use -K to filter by dataset path and -o to set the output file:

./plot.py logs/* -K <path-to-dataset.zarr> -o results.png

Other useful options: -k for substring filtering (e.g. -k "S3 | SSD"), --torch-only, --no-torch.

Simple benchmark

Additionally some simple benchmark tools can be found in simple_benchmark/*, no datasets needed.

 ./simple_benchmark/run.sh --path /path/to/directory --chunk-size 1GB

Requirements

uv (dependencies are managed automatically via uv run)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
simple_benchmark		simple_benchmark
BENCHMARK.md		BENCHMARK.md
BENCHMARK_TORCH.md		BENCHMARK_TORCH.md
LICENSE		LICENSE
README.md		README.md
flush-cache		flush-cache
heat_tracker.py		heat_tracker.py
n320-ssd-threads-processes.png		n320-ssd-threads-processes.png
n320-ssd.png		n320-ssd.png
o96-ssd-threads-processes.png		o96-ssd-threads-processes.png
o96-ssd.png		o96-ssd.png
plot.py		plot.py
run_test.sh		run_test.sh
test_torch_speed.py		test_torch_speed.py
test_zarr_speed.py		test_zarr_speed.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anemoi Benchmarks

Overview

Datasets

Quick Start

Plotting

Simple benchmark

Requirements

About

Uh oh!

Releases

Packages

Languages

License

floriankrb/anemoi-benchmarks

Folders and files

Latest commit

History

Repository files navigation

Anemoi Benchmarks

Overview

Datasets

Quick Start

Plotting

Simple benchmark

Requirements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages