Kubernetes Hugging Face Mirror

Deployment

helm repo add olah https://surajssd.github.io/k8s-hf-mirror
helm repo update
helm upgrade -i --wait \
    --create-namespace \
    --namespace olah \
    olah \
    olah/olah

Using the cache with vLLM

Ensure that the vLLM deployment has the following environment variable set:

export HF_ENDPOINT=http://olah.olah:18090

Testing Locally

Start a local port-forward to the service:

kubectl -n olah port-forward svc/olah 18090

Install the huggingface-cli:

virtualenv venv
source venv/bin/activate
pip install -U "huggingface_hub[cli]"

Run the following to use the deployment as a cache:

export HF_ENDPOINT=http://localhost:18090
rm -rf ~/.cache/huggingface/hub/models--facebook--mms-tts-sml/
huggingface-cli download facebook/mms-tts-sml

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
olah		olah
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kubernetes Hugging Face Mirror

Deployment

Using the cache with vLLM

Testing Locally

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Languages

License

surajssd/k8s-hf-mirror

Folders and files

Latest commit

History

Repository files navigation

Kubernetes Hugging Face Mirror

Deployment

Using the cache with vLLM

Testing Locally

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Languages

Packages