Skip to content

Remove HF_TOKEN dependency in E2E test #357

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

jack8558
Copy link
Collaborator

@jack8558 jack8558 commented Aug 5, 2025

Removing HF_TOKEN dependency in E2E test

  • Created tp save_hf_model_files_to_gcs to save huggingface model files in gcs
  • Saved tokenizers and llama3-8B's model weights and configs in gcs bucket (the weights and configs are needed for SFT e2e test)
  • Since huggingface library can't load directly from gcs, added util funciton copy_gcs_to_local which download in tmp directory
  • Removed HF_TOKEN on e2e test and cpu test

tp save_hf_model_files_to_gcs example:

tp save-hf-model-files-to-gcs \
  --repo-id "meta-llama/Meta-Llama-3-8B" \
  --gcs-path "gs://bucket" \
  --file-type "all" \
  --temp-dir /mnt/disks/tmp

#14

@jack8558 jack8558 changed the title DRAFT Remove HF_TOKEN dependency in E2E test Aug 5, 2025
@jack8558 jack8558 linked an issue Aug 8, 2025 that may be closed by this pull request
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Remove dependency on HuggingFace token
1 participant