Skip to content

Conversation

@tcapelle
Copy link

@tcapelle tcapelle commented Oct 1, 2022

Hey, this code has made me play and have so much fun =)
I refactored the code to do multiple things:

  • It uses mixed precision training (2x speed up).
  • Refactor the modules to make them more readable and not repeat the code
  • For the moment only refactores the conditional model with argparse and defaults. Added a bunch of params to the script argparse.
  • Logging to Weights and Biases so you get intermediate results and can follow training. For instance, this is my recent Cifar10 run
  • Added scripts to download the dataset, Cifar10, Cifar100 and Cifar10_64px from Kaggle.
  • Refactored the training script to make it modular, also added validation metrics if available.
  • Added checkpoint loading.

Please take a look, and feel free to contact me.

  • Added FFCV dataloading option, this makes training dramatically faster.

@dome272
Copy link
Owner

dome272 commented Oct 5, 2022

Hey this is an awesome addition and thank you so much for the work. The only problem I have is that people might be confused who are coming from the video and suddenly see so much new code. As a result I probably will push your code to a new branch. Im very keen to try all of this out!

@tcapelle
Copy link
Author

tcapelle commented Oct 5, 2022

I know, but this is very cool as it produces a nice wandb.workspace, you can see the logging of the run here:
https://wandb.ai/capecape/train_sd?workspace=user-capecape

@tcapelle
Copy link
Author

tcapelle commented Oct 21, 2022

@dome272
Copy link
Owner

dome272 commented Oct 24, 2022

Very very cool. Thank you!! I added a reference to your repo and blog in the readme

@noushineftekhari
Copy link

noushineftekhari commented Oct 20, 2023

I am writing an article based on your code and video: https://wandb.ai/capecape/train_sd/reports/Training-a-Conditional-Diffusion-model-from-scratch--VmlldzoyODMxNjE3

Hello, Can I ask why we have three.PT files after training? Which one is for test and did you used diffusion for feature representation?

@tcapelle
Copy link
Author

@noushineftekhari
Copy link

noushineftekhari commented Oct 23, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants