skill-harness

skill-harness is the setup repo for the 45ck agent workflow stack. It is primarily an installer and generator: most downstream repos receive copied Claude/Codex agent definitions or project setup changes rather than importing skill-harness as an application runtime.

It does five jobs:

installs the shared dependency-repo suite across pack repos, doctrine repos, and single-skill repos
installs the shared Claude and Codex agents
bootstraps project-level tooling with @45ck/noslop and 45ck/agent-docs
optionally bootstraps Beads, enabled by default in project setup
hosts embedded packs for suite-local or incubating capabilities

Project status

This repo is open source under the MIT License. It is actively evolving, and the default branch is the source of truth until formal release channels are established.

The root package.json is marked private because npm is used here for local policy and artifact scripts, not for publishing the CLI as an npm package. Build the Go CLI locally or from a release bundle.

Trust boundaries

skill-harness can write into user-level agent directories, clone or copy skill packs, install project tooling, run setup commands, and create generated review surfaces in target repos. Review scripts and dependency changes as supply-chain-sensitive.

The setup-project command runs inside the resolved setup directory. In --scope auto, that may be a detected monorepo root rather than the exact directory passed with --dir. It can:

create or update package.json
install @45ck/noslop and github:45ck/agent-docs as development dependencies
run agent-docs init, noslop init, bd init, and agent-docs install-gates --quality
install Beads if bd is not already available
write .claude/settings.json to allow Claude agent-team use
install the repo-local Beads worktree wrapper by default
create .skill-harness/, docs/artifacts/, generated/review/, helper scripts, and package scripts

Beads installation first uses an existing bd, then tries go install github.com/steveyegge/beads/cmd/bd@latest, then falls back to the upstream Beads install script (irm ... | iex on Windows or curl ... | bash on Unix). Use --skip-beads when you want to install or review Beads manually.

Use --install-only to install packages without running initialization commands, --scope workspace to avoid monorepo-root lifting, --skip-claude-settings to avoid changing Claude tool permissions, --beads-worktrees=false to skip the Beads worktree wrapper, and the other --skip-* flags to reduce setup side effects.

Dependency source and pinning details are documented in docs/dependency-provenance.md.

Generated HTML under generated/review/ is a human review surface only. Canonical decisions, specs, models, and evidence stay in source files under docs/, packs/, scripts, or target repo scaffolds.

What it can set up

Shared suite

remote dependency repos plus embedded local packs under packs/
shared skills synced into ~/.claude/skills/ and ~/.agents/skills/
supports repos exposed as skills/, packaged .claude/.agents mirrors, or a single root SKILL.md
shared Claude agents
shared Codex agents

Project tooling

Use the project setup command when you want a repo scaffolded with the 45ck tooling stack:

./skill-harness setup-project --dir path/to/project

That command:

auto-detects monorepo roots from workspace markers such as pnpm-workspace.yaml, package.json workspaces, nx.json, turbo.json, lerna.json, and rush.json
auto-detects npm, pnpm, yarn, or bun from lockfiles or packageManager
defaults to monorepo-root setup when the target directory is inside a detected monorepo
creates a package.json in the resolved setup directory if one does not exist yet
installs @45ck/noslop and github:45ck/agent-docs
installs the Beads CLI if it is not already present
runs agent-docs init
runs noslop init
runs bd init
runs agent-docs install-gates --quality
writes .claude/settings.json to allow Claude agent-team use unless --skip-claude-settings is passed
installs the repo-local Beads worktree wrapper unless --beads-worktrees=false is passed
scaffolds developer artifact guidance by default, with Markdown/TOON/JSON/YAML/specgraph sources, visual-source-first product/business/data/research/UX/mockup artifacts, UML/UWE/C4/evidence model sources as canonical source, generated HTML as a human review surface, and a manifest for provenance/freshness, including optional governed agent-loop scaffolding
writes .skill-harness/setup-proof.json with setup scope, package manager, selected profiles, tool statuses, check commands, generated paths, skipped capabilities, and Beads mode

Benchmark results

Controlled experiments measured the toolkit (specgraph + noslop + skills) against raw Claude Code. Full data in experiments/RESULTS.md. Current adoption audit notes are in docs/adoption-audit-2026-04-29.md.

Experiment	Toolkit	Baseline	Delta
Small greenfield (×2 runs)	31–32 / 35	19–20 / 35	+12
Large greenfield (3 modules)	32 / 35	19 / 35	+13
Maintenance / handoff	33 / 35	19 / 35	+14
Ambiguous brief	35 / 35	13 / 35	+22

The gap is driven by scope enforcement and traceability — not code quality (functional output was equal in every experiment). The largest signal came from the ambiguous brief: the toolkit forced scope decisions before code was written; the baseline built a 4-class framework for a task that needed a 3-function library. Treat these as controlled workflow results; downstream repos still need active hook, CI, and agent wiring before those practices are enforced.

Install the CLI

Prerequisites for local builds and wrapper scripts: Go, Git, and the target shell (bash or PowerShell). Node.js and Python are only needed for development and artifact checks.

Build locally

git clone https://github.com/45ck/skill-harness.git
cd skill-harness
go build -o skill-harness ./cmd/skill-harness

Windows:

git clone https://github.com/45ck/skill-harness.git
cd skill-harness
go build -o skill-harness.exe .\cmd\skill-harness

Use wrapper scripts

bash install.sh

.\install.ps1

The wrapper scripts are go run shortcuts. They still require Go and run the checked-out source tree.

Build a release bundle

Release bundles can ship the binary plus the repo files together so Go is not required.

Build them with:

python scripts/build_release.py --version v0.1.0

Main commands

Install the full shared suite

./skill-harness install --all

Install selected agents

./skill-harness install --agents=requirements-analyst,system-modeler,security-reviewer

Install selected packs only

./skill-harness install --packs=business-analysis-skills,documentation-evidence-skills --packs-only

Install doctrine or utility repos only

./skill-harness install --packs=frontier-agent-playbook,repo-branding-skill --packs-only

Use the interactive installer

./skill-harness install --interactive

Set up a project with noslop and agent-docs

./skill-harness setup-project --dir ../my-project

Install only the packages and skip initialization:

./skill-harness setup-project --dir ../my-project --install-only

Monorepo auto mode:

./skill-harness setup-project --dir ../my-monorepo/apps/web

Force workspace-local setup instead of lifting to the monorepo root:

./skill-harness setup-project --dir ../my-monorepo/apps/web --scope workspace

Override package manager if auto-detection is not what you want:

./skill-harness setup-project --dir ../my-monorepo --package-manager pnpm

Skip one tool:

./skill-harness setup-project --dir ../my-project --skip-agent-docs
./skill-harness setup-project --dir ../my-project --skip-noslop
./skill-harness setup-project --dir ../my-project --skip-beads
./skill-harness setup-project --dir ../my-project --skip-claude-settings
./skill-harness setup-project --dir ../my-project --beads-worktrees=false
./skill-harness setup-project --dir ../my-project --skip-artifacts
./skill-harness setup-project --dir ../my-project --skip-developer-artifacts

Install the repo-local Beads worktree wrapper (copies scripts/beads/bd.mjs + adds .trees/ to .gitignore):

./skill-harness beads-worktrees --dir ../my-project

Choose a developer artifact profile:

./skill-harness setup-project --dir ../my-project --developer-artifacts-profile auto
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile codex-app
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile claude-desktop
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile cli
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile tui
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile media
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile agent-loop
./skill-harness setup-project --dir ../my-project --developer-artifacts-profile none

Artifact profiles are guidance and scaffold settings, not a separate runtime. auto is the default and resolves to dual: canonical Markdown/TOON/specgraph sources plus optional generated review surfaces. Use codex-app or claude-desktop for desktop workflows with file-backed previews, cli or tui for terminal-heavy projects, media for source-backed demo and generated media review workflows, agent-loop for governed self-improving agent workflows with trace/eval receipts, and none, --skip-artifacts, or --skip-developer-artifacts for minimal projects. The shorter --artifact-profile media|agent-loop|markdown|html|dual|none form remains supported as an alias.

Developer artifact scaffolding also creates docs/artifacts/artifacts.manifest.json and scripts/check-artifact-manifest.mjs. Use the manifest to record source-backed generated views, including product briefs, business dashboards, data dictionaries, metric definitions, research evidence boards, high-fidelity UX prototypes, Mermaid, C4, UML-style, dependency, architecture-space, demo media, and agent-loop artifacts. Generated HTML should be the human artifact: a self-contained review page with clear sections, tabs, diagrams or static previews, screenshots or evidence images, text summaries, source links, and freshness metadata. It should embed pre-rendered diagrams, such as inline SVG, instead of loading browser diagram runtimes by default. Visual-source-first artifact families use canonical sources under docs/artifacts/source/product/, business/, data/, research/, and ux/, with generated reviews under matching generated/review/ subfolders. High-fidelity review is the default for UI, product, customer-facing workflow, and mockup artifacts; low-fidelity sketches are scratch unless captured as evidence. Agents should auto-detect model impact for engineering changes and update the relevant model source, manifest, and human HTML review artifact when code, API, workflow, dependency, deployment, UI structure, or agent behavior changes. The media profile also creates generated/media/ and docs/artifacts/templates/demo-artifact.md; generated media stays out of git by default. The agent-loop profile creates generated/agent-runs/, docs/artifacts/source/agent-loop-playbook.md, docs/artifacts/templates/agent-loop-artifact.md, and scripts/check-agent-loop-policy.mjs.

Modeling mode defaults to auto. Fresh developer-artifact setups resolve auto to UML-first modeling; existing harnessed repos keep their current behavior unless migrated. Use explicit modes when needed:

./skill-harness setup-project --dir ../my-project --modeling-mode auto
./skill-harness setup-project --dir ../my-project --modeling-mode uml-first
./skill-harness setup-project --dir ../my-project --modeling-mode baseline
./skill-harness setup-project --dir ../my-project --skip-modeling

UML-first mode keeps modeling inside the normal artifact system and adds docs/artifacts/source/models/, docs/artifacts/source/models/model-inventory.md, generated/review/models/, docs/artifacts/templates/model-diff-artifact.md, scripts/generate-model-review.mjs, scripts/open-artifact-review.mjs, scripts/check-model-artifact-policy.mjs, model-aware package scripts, and setup-proof check entries. model-view still exists in the base scaffold; UML-first adds stricter UML/UWE/C4/evidence policy, model-diff support, and generated human HTML review expectations. --enable-modeling remains as a legacy alias for --modeling-mode uml-first.

Use npm run models:review to regenerate model review HTML and npm run models:drift to fail on stale generated review files. CI should use the drift check after generation has been committed or uploaded as an artifact.

Use npm run models:open to open the generated model index through the default local surface. Agents and host apps can resolve the same target without launching a browser by running:

node scripts/open-artifact-review.mjs --json --print

In Codex app or Claude desktop, open the resolved artifact with the built-in browser/preview surface when available. If that surface blocks file:// URLs, serve generated/review/ through a local static server and open /models/index.html from http://127.0.0.1:<port>/.

Every setup-project run writes .skill-harness/setup-proof.json. Treat it as machine-readable install evidence: it records the resolved setup directory, monorepo lift, package manager, requested/effective artifact profile, initialized tools, available check commands, generated paths, and skipped capabilities. It is intentionally descriptive; run the recorded check commands for live conformance.

Validate installed agent dependencies

./skill-harness check --all

Development

Required local tools:

Go, using the version in go.mod
Node.js 22 or newer for generated artifact checks
Python 3.12 or newer for suite maintenance scripts

Core checks:

go test ./...
npm run artifacts:check
python scripts/check_suite_drift.py --check

The GitHub Actions workflow in .github/workflows/quality.yml also builds the CLI, validates JSON inputs, compiles Python scripts, checks suite drift, verifies generated model review HTML, and runs a hermetic setup-project smoke test.

Included agents

requirements-analyst
requirements-analyst-beads
ux-researcher
system-modeler
system-modeler-beads
software-architect
software-architect-beads
web-engineer
backend-engineer
test-designer
test-designer-beads
qa-automation-engineer
quality-reviewer
security-reviewer
security-reviewer-beads
pentest-reviewer
delivery-manager
delivery-manager-beads
research-writer
workflow-engineer

Agent-to-skill mapping lives in docs/agent-loadouts.md.

Included dependency repos

45ck/agile-delivery-skills
45ck/authentication-cryptography-skills
45ck/automation-testing-skills
45ck/backend-persistence-skills
45ck/business-analysis-skills
45ck/cloud-platform-operations-skills
45ck/code-review-inspection-skills
45ck/data-structures-algorithmic-reasoning-skills
45ck/deployment-release-skills
45ck/design-for-testability-skills
45ck/documentation-evidence-skills
45ck/enterprise-architecture-integration-skills
45ck/fagan-inspection-skill
45ck/frontier-agent-playbook - doctrine repo
45ck/hci-review-skill
45ck/llm-agent-security-skills
45ck/maintenance-evolution-skills
45ck/marketing-product-skills
45ck/non-functional-testing-skills
45ck/oop-code-structure-skills
45ck/pentest-security-testing-skills
45ck/project-management-skills
45ck/repo-branding-skill - single-skill repo
45ck/refactoring-code-smells-skills
45ck/research-literature-review-skills
45ck/security-engineering-skills
45ck/software-architecture-skills
45ck/software-quality-skills
45ck/uml-analysis-modelling-skills
45ck/verification-test-design-skills
45ck/web-engineering-skills
coding-workflow-skills (embedded)
design-tooling-skills (embedded)
integration-tooling-skills (embedded)
specgraph-skills (embedded)
noslop-skills (embedded)
developer-artifact-skills (embedded)
demo-production-skills (embedded)
agent-operating-skills (embedded)

Shared doctrine companion

45ck/frontier-agent-playbook is the suite-wide doctrine repo for frontier-capability priors, agentic thinking, anti-fallback checks, and LLM-first architecture.

Use it in two ways:

install the doctrine skills globally with ./skill-harness install --packs=frontier-agent-playbook --packs-only
copy AGENTS.md, CLAUDE.md, AGENT_INSTRUCTIONS.md, and llms.txt into a target project when you want repo-local doctrine surfaces

Codex helper plugin

The optional skill-harness helpers plugin packages Codex-oriented helper skills for agent selection, handoff planning, Beads task shaping, and third-party skill intake. It is a local helper bundle, not a replacement for the main .codex/agents/ loadouts installed by the CLI.

Tooling repos used here

Contributing and support

Start with CONTRIBUTING.md for local setup, validation, scope, and pull request expectations.
Use SUPPORT.md for public support requests.
Follow SECURITY.md for vulnerability reports and generated artifact safety expectations.
Follow CODE_OF_CONDUCT.md for collaboration standards.

Full toolkit

The standard full toolkit for a new project is specgraph + noslop. The setup-project command installs both automatically. For manual installation steps or to install the matching skill packs (specgraph-skills, noslop-skills), see AGENT_INSTRUCTIONS.md.

For other agents

If another agent needs to install this repo or use it as the setup entrypoint, point it at AGENT_INSTRUCTIONS.md.

Important files

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.agents/plugins		.agents/plugins
.beads		.beads
.claude/agents		.claude/agents
.codex/agents		.codex/agents
.github		.github
.skill-harness		.skill-harness
cmd/skill-harness		cmd/skill-harness
docs		docs
experiments		experiments
generated/review		generated/review
packs		packs
plugins/skill-harness-helpers		plugins/skill-harness-helpers
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
AGENT_INSTRUCTIONS.md		AGENT_INSTRUCTIONS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
banner.svg		banner.svg
go.mod		go.mod
install.ps1		install.ps1
install.sh		install.sh
llms.txt		llms.txt
logo.svg		logo.svg
package.json		package.json
social-preview.svg		social-preview.svg
uninstall.ps1		uninstall.ps1
uninstall.sh		uninstall.sh

Folders and files

Latest commit

History

Repository files navigation

skill-harness

Project status

Trust boundaries

What it can set up

Shared suite

Project tooling

Benchmark results

Install the CLI

Build locally

Use wrapper scripts

Build a release bundle

Main commands

Install the full shared suite

Install selected agents

Install selected packs only

Install doctrine or utility repos only

Use the interactive installer

Set up a project with noslop and agent-docs

Validate installed agent dependencies

Development

Included agents

Included dependency repos

Shared doctrine companion

Codex helper plugin

Tooling repos used here

Contributing and support

Full toolkit

For other agents

Important files

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages