GitHub - AdaBit-AI/parameter_efficient_instruction_tuning

Parameter efficient instruction tuning: an Empirical Study

This repository serves as an effort to systematically to compare different parameter efficient fine-tuning methods on instruction tuning task. We use the NI dataset as the benchmark dataset. The technical report can be found here

PEFT method implementations are adapted from adapter-transformers and peft.

Training

For bash scripts to run all experiments, refer to scripts folder.
All experiments calling scripts are formatted by scripts/hfai/hp_run.sh. For example, to run a LoRA experimental in the development mode, run the following command.

bash scripts/hfai/hp_run.sh lora_peft dev

Dataset

We employ SuperNI as our training and evaluation datasets.

Setup

To install the package, do the following.

conda create -n peft python=3.8
git clone https://github.com/hepengfe/peft-private.git and checkout git checkout release-v0.4.0-adapter branch. Under peft-private folder, pip install -e . to install peft-private.

rouge-score: pip install rouge-score
make sure GPT2 is under cache/saved_pretrained/gpt2 for evaluation

HPC Platform specific

The platform we use is hfai HPC. Each node is equipped with A100x8 GPU, and each of our experiments is runing on a single node.

This codebase is highly optimized for hfai platform, and it supports the following functionalities:

Experiment Configuration and Submission: The hp_run.sh scripts allows for flexible adjustments to experiment name, batch size and training framework based on fine-tuning methods. To launch multiple jobs based on hp_run.sh, refer to scripts under scripts/hfai folder.
Checkpoint Management: Since the platform is pre-emptable, our codebase supports checkpoints saving upon suspension and resuming from the last checkpoint. Each experiment is assumed to be run until complete test dataset evaluation.
Training State Validation: When saving checkpoint, we support checking the completeness of training state, training random states. Otherwise, it will delete the latest checkpoint and needs re-run the experiment loading second-to-last checkpoint.
System Message and Debugging: We let most system message output by print statement because it's more suitable for multi-process debugging, and we suppress warnings that contain error string to avoid job killing.

Here are some extra notes for hfai platform:

default ni dataset dir is ../../data due to hfai compatibility.
Pytorch and CUDA compatibility: hfai platform has CUDA version 11.3, and peft setup requires torch>=1.13.0. Therefore, we use the corresponding pytorch version 1.10.2+cu113 by peft setup.

Citation

If you found the codebase or my work valuable, please cite:

@misc{he2024parameterefficientinstructiontuning,
      title={Parameter Efficient Instruction Tuning: An Empirical Study}, 
      author={Pengfei He},
      year={2024},
      eprint={2411.16775},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2411.16775}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 339 Commits
.hfai		.hfai
adapter-transformers @ 8ef6771		adapter-transformers @ 8ef6771
configs		configs
examples		examples
expr_analysis		expr_analysis
scripts		scripts
src/peft		src/peft
tests		tests
util		util
.gitignore		.gitignore
.gitmodules		.gitmodules
.hfignore		.hfignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
hfai_accelerate.py		hfai_accelerate.py
peft_trainer.py		peft_trainer.py
prompt_tuning.py		prompt_tuning.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parameter efficient instruction tuning: an Empirical Study

Training

Dataset

Setup

HPC Platform specific

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

AdaBit-AI/parameter_efficient_instruction_tuning

Folders and files

Latest commit

History

Repository files navigation

Parameter efficient instruction tuning: an Empirical Study

Training

Dataset

Setup

HPC Platform specific

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages