Spatio Temporal Feature Injection

Image Editing with Diffusion Models in the domain of fashion
Abstract:
Image editing, specifically appearance or style transfer, is an area of computer vision that has seen significant growth due to recent advancements in image diffusion models. There are many different approaches to appearance transfer with fine-tuning, external network adapters and tuning-free methods. In this work, we focus on the latter, where we leverage the existing Stable Diffusion network with some modifications to the internal processing of the attention maps and latent vectors, without modifying the model weights. We propose an appearance transfer method based on partial masking and timestep-controlled appearance modulation, controlling the area and amount of appearance we transfer. The results on the benchmarks outperform current baseline models, which shows the method’s potential for future improvements.

Environment

Our code builds on the requirement of the diffusers library. To set up their environment, please run:

git clone https://github.com/lukakeso/spatio_temporal_feature_injection.git
cd STFI
conda env create -f environment/environment.yaml
conda activate STFI

(Optional) You may also want to install SAM-HQ to extract the instance masks:

pip install git+https://github.com/SysCV/sam-hq.git.

Please download the ViT-L HQ-SAM model from the provided link.

Appearance Transfer

python run.py \
--app_image_path example/0.jpg \                                 # appearance image 
--struct_image_path example/1.jpg \                              # atructure image 
--prompt "high-quality, detailed, realistic photo of clothes" \  # default prompt
--output_path results \                                          # output folder
--scenario 1                                                     # scenario number

Acknowledgement

Our code is largely based on the following open-source projects: DIFT, Cross-image-attention, Eye-for-an-eye.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
environment		environment
models		models
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
DBSCAN.py		DBSCAN.py
README.md		README.md
appearance_transfer_model_final.py		appearance_transfer_model_final.py
checkout_correct_modules.py		checkout_correct_modules.py
config.py		config.py
constants.py		constants.py
metrics.py		metrics.py
run.py		run.py
torch_metrics.py		torch_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spatio Temporal Feature Injection

Environment

Appearance Transfer

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

lukakeso/spatio_temporal_feature_injection

Folders and files

Latest commit

History

Repository files navigation

Spatio Temporal Feature Injection

Environment

Appearance Transfer

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages