-
Notifications
You must be signed in to change notification settings - Fork 12
GGUF support #12
Copy link
Copy link
Open
Description
I use Docker for deployment. I downloaded a 7B file, and the environment configuration file looks like this.
CRIA_SERVICE_NAME=cria
CRIA_HOST=0.0.0.0
CRIA_PORT=3000
CRIA_MODEL_ARCHITECTURE=llama
!/! Utilizado en docker-compose
CRIA_MODEL_PATH=/llama/llama-2-7b/consolidated.00.pth
CRIA_USE_GPU=true
CRIA_GPU_LAYERS=32
CRIA_ZIPKIN_ENDPOINT=http://zipkin-server:9411/api/v2/spans
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels