Running AI Workbench with Docker

The published container image bundles the TypeScript runtime and the React web UI into a single process on port 8080. This guide covers the end-user "try it" path; for source-from-Node development see the top-level README.md.

Quickstart

curl -O https://raw.githubusercontent.com/datastax/ai-workbench/main/docker-compose.yml
docker compose up

Open http://localhost:8080. The first compose up pulls ghcr.io/datastax/ai-workbench:latest (multi-arch — amd64 and arm64). Subsequent boots are near-instant.

Without compose

If you'd rather not use compose:

docker run --rm \
  -p 8080:8080 \
  -v workbench-data:/var/lib/workbench \
  -e WORKBENCH_CONFIG=/app/examples/workbench.docker.yaml \
  ghcr.io/datastax/ai-workbench:latest

Same behavior: file-backed persistence on the named volume, healthcheck baked into the image.

Persistence

The compose stack mounts a named volume workbench-data at /var/lib/workbench. The bundled workbench.docker.yaml pins the control plane to the file driver rooted there, so workspaces / agents / KBs / jobs survive docker compose down.

Where the volume lives

docker volume inspect ai-workbench_workbench-data

On Docker Desktop this is inside the VM disk; on Linux Engine it's under /var/lib/docker/volumes/.

Backup and restore

# Backup
docker run --rm \
  -v ai-workbench_workbench-data:/data \
  -v "$(pwd):/backup" \
  alpine tar czf /backup/workbench-backup.tgz -C /data .

# Restore (into a fresh stack)
docker compose down -v
docker compose up -d --no-start
docker run --rm \
  -v ai-workbench_workbench-data:/data \
  -v "$(pwd):/backup" \
  alpine tar xzf /backup/workbench-backup.tgz -C /data
docker compose up -d

Reset to a clean slate

docker compose down -v
docker compose up -d

-v removes the named volume; the stack restarts with no workspaces.

Configuration

The runtime reads workbench.yaml for backend selection and per-feature toggles (auth, chat, MCP). Environment variables override matching keys at boot — see docs/configuration.md for the full schema.

Precedence (highest wins): shell env > container env > .env > workbench.yaml defaults.

Astra DB credentials

Workspace-scoped Astra credentials resolve from process env at use time, so adding them to .env is enough — no yaml edit required:

# .env (next to docker-compose.yml)
ASTRA_DB_API_ENDPOINT=https://<db-id>-<region>.apps.astra.datastax.com
ASTRA_DB_APPLICATION_TOKEN=AstraCS:...

docker compose up -d --force-recreate reloads env.

When you create a workspace in the UI / API with credentialsRef: { token: "env:ASTRA_DB_APPLICATION_TOKEN" }, the runtime resolves the ref from these vars.

Note: the bundled workbench.docker.yaml pins the control plane (where workspace / agent / KB metadata is stored) to the local file volume. To put the control plane itself in Astra, use the override file pattern below to point WORKBENCH_CONFIG at a custom yaml with controlPlane: { driver: astra, ... }.

OpenRouter chat

Generate a key at https://openrouter.ai/keys, add it to .env:

OPENROUTER_API_KEY=sk-or-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx  # secret-scan: allow

One key reaches OpenRouter's 300+ models. For fully offline use, point the chat: block at the ollama provider instead — Ollama runs locally and needs no key.

Ollama on the host

Inside the container, localhost is the container itself — so an Ollama server on your machine is not at localhost:11434 from the runtime's point of view. The bundled compose file handles this:

it maps host.docker.internal to the host gateway (extra_hosts: host-gateway, works on Docker Desktop and Linux Engine), and
it defaults OLLAMA_BASE_URL to http://host.docker.internal:11434/v1.

So docker compose up + Ollama running on the host with defaults just works. If Ollama listens somewhere else, set the env (shell or .env):

OLLAMA_BASE_URL=http://gpu-box.lan:11434/v1

A bare origin (http://gpu-box.lan:11434) is fine — the runtime appends the /v1 OpenAI-compatible path when no path is given. Per-service overrides win over the env: the LLM-service form's Endpoint base URL field (and the API's endpointBaseUrl) pin a specific service to a specific server.

Linux note: Ollama binds to 127.0.0.1 by default, which is not reachable from the container even via the gateway. Start it with OLLAMA_HOST=0.0.0.0 ollama serve (or systemd override) so it listens on the gateway interface too.

Then either uncomment the chat: block in the bundled workbench.docker.yaml (via the override pattern) or bind-mount a custom yaml with chat: enabled. Without chat: and no agent-level LLM binding, POST .../messages returns 503 chat_disabled.

Override file pattern

docker-compose.override.yml (gitignored) is merged on top of docker-compose.yml automatically. Use it for the common deviations:

cp docker-compose.override.yml.example docker-compose.override.yml
# edit, then:
docker compose up -d --force-recreate

The example file shows three pre-canned snippets:

Build from source — replace image: with a build: block pointing at runtimes/typescript/Dockerfile.
Bind-mount a custom yaml — swap WORKBENCH_CONFIG to a path under /etc/workbench/ and bind-mount your file there read-only.
Bump log verbosity — set LOG_LEVEL=debug.

Upgrading

docker compose pull
docker compose up -d --remove-orphans

pull fetches the newest :latest (or whatever tag you've pinned via AI_WORKBENCH_TAG). Recreating preserves the named volume.

To pin a version:

AI_WORKBENCH_TAG=0.3.0 docker compose up -d

Healthcheck

The image ships with a HEALTHCHECK that hits /healthz every 30s. Status surfaces in docker ps:

docker compose ps
# NAME            STATUS                    PORTS
# ai-workbench    Up 2 minutes (healthy)    0.0.0.0:8080->8080/tcp

/readyz adds config-load gating; use it from external probes that need to know the runtime finished startup.

Troubleshooting

Port already in use. Override the host-side port:

AI_WORKBENCH_PORT=9090 docker compose up -d

Volume permission denied. The image pre-creates /var/lib/workbench owned by node (uid 1000), so a fresh named volume initializes writable on first use. If your volume was created by an image older than 0.5.4 (which left the mount point root-owned) or with a different uid, workspace creation fails with permission errors until you fix the ownership:

docker run --rm -v ai-workbench_workbench-data:/data alpine \
  chown -R 1000:1000 /data

(The image runs as node = uid 1000. Prefer this one-time chown over running the container as root via a user: "0:0" compose override.)

View logs.

docker compose logs -f workbench

Pull denied / unauthorized. New GHCR packages default to private, so right after the first release the image isn't pullable without auth. A DataStax org member with admin:packages rights flips it with the bundled helper:

# One-time per gh login:
gh auth refresh -s admin:packages

# After the first release lands:
npm run release:make-public
# → "Flipped to public. `docker pull` now works without auth."

The script (scripts/ghcr-make-public.mjs) is idempotent — re-running on an already-public package is a no-op — and exits with a hint if the package doesn't exist yet (no release has shipped). Until the flip happens, authenticate manually:

echo $GITHUB_TOKEN | docker login ghcr.io -u <user> --password-stdin

Manifest unknown / no matching arch. The :latest tag is multi-arch (amd64 + arm64) as of the first release that ships this guide. Older tags may be amd64-only — pin a newer tag or rebuild from source via the override file.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running AI Workbench with Docker

Quickstart

Without compose

Persistence

Where the volume lives

Backup and restore

Reset to a clean slate

Configuration

Astra DB credentials

OpenRouter chat

Ollama on the host

Override file pattern

Upgrading

Healthcheck

Troubleshooting

FilesExpand file tree

docker.md

Latest commit

History

docker.md

File metadata and controls

Running AI Workbench with Docker

Quickstart

Without compose

Persistence

Where the volume lives

Backup and restore

Reset to a clean slate

Configuration

Astra DB credentials

OpenRouter chat

Ollama on the host

Override file pattern

Upgrading

Healthcheck

Troubleshooting