Beyond Token Pruning: Operation Pruning in Vision-Language Models

A tuning-free VLM/MLLM inference acceleration framework that searches to prune operations rather than tokens.

🔧 Installation

conda create -n gsop python=3.10 -y
conda activate gsop

cd lmms-eval
pip install -e .

cd ../LLaVA
pip install -e .

pip install easydict

For additional setup instructions, please refer to:

🚀 Usage

Inference

bash scripts/gsop_inference.sh

Search

bash scripts/gsop_search.sh

Some benchmarks (e.g., TextVQA) may produce results that differ from commonly reported metrics when run on lmms-eval. Please follow the evaluation setup detailed in Evaluation.md for those benchmarks.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LLaVA		LLaVA
lmms-eval		lmms-eval
scripts		scripts
search_cfgs		search_cfgs
search_res/cls_att_g12o120_gqa_500q_llh_s1_step15		search_res/cls_att_g12o120_gqa_500q_llh_s1_step15
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Beyond Token Pruning: Operation Pruning in Vision-Language Models

🔧 Installation

🚀 Usage

Inference

Search

About

Uh oh!

Releases

Packages

Languages

zxcvfd13502/GSOP

Folders and files

Latest commit

History

Repository files navigation

Beyond Token Pruning: Operation Pruning in Vision-Language Models

🔧 Installation

🚀 Usage

Inference

Search

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages