A sequence to sequence autoregressive project

This repository aims to provide an efficient, user-friendly, and comprehensive framework for sequence-to-sequence machine learning applications, including but not limited to weather forecasting and time series analysis.

Our primary objective is to address the challenges associated with training large-scale machine learning models on extensive time-series datasets. For instance, training a 70B ViT model on the 721x1440 full-resolution ERA5 dataset (40T = 3G x 100k).

Key Features

1. Parallel Training for Large Models

We offer robust support for large model parallel training, with different modules at various testing stages. These include:

PyTorch-DDP (Tested)
Huggingface Accelerate-DDP (Tested)
Huggingface Accelerate-Deepspeed (Testing Required)
Huggingface Accelerate-FSDP (Testing Required)
Huggingface Accelerate-Magetron (Testing Required)

2. Efficient Data Loading for Large Datasets

This framework supports efficient data loading for large datasets through:

In-Memory Data Loading (Tested)
- Runtime Loading (Tested): Saves time during program initiation
- Shared Dataset (Tested): Conserves memory in distributed mode
Server Memory Data Loading: Boosted via Ray
- Shared Dataset (TODO)
- Asynchronous updating of in-memory datasets (TODO)
- Runtime updating of datasets and common buffer for efficient data sampling and infinite data size (TODO)

3. Efficient Autoregressive Forward Mode

High-order autoregressive computing. (Tested)

$$ \begin{align} \left( \begin{array}{cccccc} X_{t}^O & X_{t+1}^I & X_{t+2}^{\text{II}} & X_{t+3}^{\text{III}} & X_{t+4}^{\text{IV}} & X_{t+5}^V \\ & X_{t+1}^O & X_{t+2}^I & X_{t+3}^{\text{II}} & X_{t+4}^{\text{IV}} & X_{t+5}^{\text{IV}} \\ & & X_{t+2}^O & X_{t+3}^I & X_{t+4}^{\text{II}} & X_{t+5}^{\text{III}} \\ & & & X_{t+3}^O & X_{t+4}^I & X_{t+5}^{\text{II}} \\ & & & & X_{t+4}^O & X_{t+5}^I \\ & & & & & X_{t+5}^O \\ \end{array} \right) \end{align} $$
Patch wise and overlap aggregation back. (Tested)

4. Autoregressive Plugins for Enhanced Performance

We offer various autoregressive plugins/tricks for training boost and performance enhancement such as:

Pseudo-future alignment via high order loss (MLSE, MASE, and any order) (Tested)
Jacobian Regularization to limit:
- Naive computing and gradient modification (Testing Required)
- Stochastic backward via Hutchinson Method (Testing Required)

5. Various Large Models in PyTorch

Our framework supports various large models in PyTorch including:

FourCastNet (Tested)
GraphCast (Tested)
ViT Model Series (Tested)
Physics-Constrained Models: Convection Model (Testing Required)
On-stamp Prediction (Testing Required)
And much more...

This repository is continually evolving, with new features and models being added. We welcome contributions, suggestions, and feedback to enhance its functionality and user experience.

Name		Name	Last commit message	Last commit date
Latest commit History 568 Commits
configs		configs
criterions		criterions
dataset		dataset
evaluator		evaluator
figures		figures
jupyter_test_archieve		jupyter_test_archieve
model		model
plugin		plugin
tools		tools
train		train
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adamw.py		adamw.py
align_patch_embedding.py		align_patch_embedding.py
colossalai_test.py		colossalai_test.py
custom_optimizer.py		custom_optimizer.py
distributed_run.py		distributed_run.py
download.sh		download.sh
downsamplefrom720.py		downsamplefrom720.py
fourcast_ckpt		fourcast_ckpt
gpu_use_setting.py		gpu_use_setting.py
mytool.py		mytool.py
optuna_run.py		optuna_run.py
optuna_run.sh		optuna_run.sh
run_downsample.sh		run_downsample.sh
run_fourcast.py		run_fourcast.py
run_patch_snap.py		run_patch_snap.py
start_sensesync.sh		start_sensesync.sh
sweep_run.sh		sweep_run.sh
test.sh		test.sh
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A sequence to sequence autoregressive project

Key Features

1. Parallel Training for Large Models

2. Efficient Data Loading for Large Datasets

3. Efficient Autoregressive Forward Mode

4. Autoregressive Plugins for Enhanced Performance

5. Various Large Models in PyTorch

About

Uh oh!

Packages

Uh oh!

Languages

License

veya2ztn/Seq2SeqAutoregressiveModel

Folders and files

Latest commit

History

Repository files navigation

A sequence to sequence autoregressive project

Key Features

1. Parallel Training for Large Models

2. Efficient Data Loading for Large Datasets

3. Efficient Autoregressive Forward Mode

4. Autoregressive Plugins for Enhanced Performance

5. Various Large Models in PyTorch

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Languages

Packages