This document defines the Agent Genome (or "DNA") format — the portable, versioned, evolvable unit of specialized capability in the Agent Drive ecosystem.
- Structured enough to be composable, auditable, and machine-processable.
- Rich enough to capture not just "what to do" but "why it works" and "how well it works".
- Evolvable — supports forking, merging, and mutation with clear provenance.
- Portable across different agent implementations (with adapters).
A Genome is a directory (or archive) containing:
genome/
├── manifest.yaml # Identity, version, lineage, applicability
├── framework.yaml # The core typed playbook
├── reasoning/ # High-signal patterns (causality, analogies, contradictions, etc.)
│ ├── patterns.jsonl
│ └── traces/
├── tools/ # Proven tool compositions and guardrails
│ └── compositions.yaml
├── evaluations/ # Performance data on reference tasks
│ └── reference-evals.json
├── artifacts/ # Example successful outputs
├── provenance/ # Full history of creation and improvements
│ └── lineage.json
└── schema/ # Validation schemas
genome:
id: security-incident-postmortem-v2
version: 2.3.1
content_hash: sha256:...
created: 2026-05-10
last_improved: 2026-05-23
authors:
- type: human
name: "the project maintainer"
- type: agent
id: "rich-node-042"
run: "eng-2026-05-18-postmortem-17"
applicability:
domains: [security, devops, reliability]
problem_signatures:
- "production incident with unclear root cause"
- "need for blameless + technically deep writeup"
dependencies:
genomes: [] # other genomes this one builds on
agent_capabilities: [tool_use, long_reasoning, file_analysis]
evaluation_score:
reference_tasks: 0.87
human_preference: 0.92
cost_efficiency: 0.78This is the core typed framework.yaml specification.
It defines deterministic, schema-validated steps.
See the seeded examples under genomes/examples/ for reference shapes.
Key addition in Agent Drive: steps can declare which genome sub-components they activate.
This is where we go beyond static playbooks and capture how good agents think.
Examples of what lives here:
- Causal chain templates that worked well
- Productive analogy mappings for the domain
- Contradiction detection heuristics
- Useful ways to structure a postmortem timeline
- Common failure modes and how to surface them early
These are stored in a machine-readable + human-readable form so both the Evolutionary Engine and other agents can use them.
Not just "this tool exists", but proven sequences and parallel patterns that delivered value, including:
- Orderings that reduce context pollution
- Parallelization strategies that are safe
- Guardrail combinations that caught bad outputs early
Every serious genome should carry data about how well it performs.
This creates selection pressure and lets the system know when a genome is degrading or improving.
Critical for trust and evolution.
lineage.json records:
- Parent genomes (when this was forked or merged)
- Improvement events (with links to the review run that proposed the change)
- Validation runs that confirmed the improvement
This gives us a true evolutionary tree instead of a bag of unrelated skills.
- Semantic versioning for the genome contract.
- Content-addressed hashes for exact reproducibility.
- Forking model similar to Git: you can fork a genome, improve it, and later propose merges or let the registry surface the best descendants.
Not every agent will natively understand the full genome format.
Agent Drive will provide:
- Export adapters → turn relevant parts of a genome into external skills, custom prompts, etc.
- Import scanners → pull useful patterns out of existing agent runs and package them as genomes.
Planned primitive operations on genomes:
scan(run)→ produce candidate genomefork(genome)→ create a new versioned descendantmerge(genome_a, genome_b)→ safe recombination (with conflict resolution)mutate(genome, proposal)→ apply an improvementevaluate(genome, reference_suite)→ score itselect(best_n, criteria)→ evolutionary selection
This v0.1 is intentionally a starting point. As we implement the scanner and evolutionary engine, the exact shapes of reasoning/, tools/, and evaluation data will be refined based on what actually proves valuable to extract and transfer.
The invariant we will protect: Genomes must be more than prompts or skills — they must be rich, structured, improvable packages of capability that compound across the ecosystem.
This spec will live in the repo and evolve with the code. Contributions to the format are very welcome.