Expert panel discussions for complex decisions
Claude becomes 3-7 domain experts who debate, challenge each other, and synthesize actionable recommendations through Hegelian dialectic.
Install โข Usage โข How It Works โข Research
npx skills add wyattowalsh/panel-debate-skillTip
After installation, the /panel-debate command becomes available in Claude Code.
/panel-debate "Should we migrate to microservices?"
/panel-debate size:5 depth:deep "Build vs buy our CRM?"
/panel-debate style:adversarial "GraphQL vs REST?"| Option | Values | Default | Description |
|---|---|---|---|
size |
3-7 |
auto | Number of experts (auto-scales with topic breadth) |
depth |
quick / standard / deep |
standard |
Discussion rounds: 1 / 2-3 / 4+ |
style |
collaborative / adversarial / academic |
collaborative |
Panel interaction tone |
Note
Low-complexity topics (e.g., "What port does PostgreSQL use?") trigger a warningโmulti-agent debate adds overhead without benefit for simple questions.
๐ Microservices Migration Panel
โญโ Panel Discussion: Microservices Migration โโโโโโโโโโโโโโโโฎ
โ Experts: Dr. Chen (Security), Kai Lindstrรถm (Platform), โ
โ Rashida Okoye (Ops), Sophia Martinez (Product) โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
๐ค Dr. Chen (Security):
"Each microservice becomes a potential entry point. We need
zero-trust from day one."
๐ค Kai Lindstrรถm (Platform) [Contrarian]:
"Before we assume microservices, has anyone considered a
well-structured modular monolith? You get 80% of the benefits
without the operational overhead."
๐ค Rashida Okoye (responding to Kai):
"I've seen both approaches. With 15 engineers and only 3 with
distributed systems experience, Kai's point is well-taken."
๐ Round 1 Synthesis:
โข Agreement: Team capability matters more than architecture choice
โข Tension: Invest in microservices now vs. extract services later
โข Open question: What are our actual scaling bottlenecks?
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ [1] Continue [2] Follow-up [3] Redirect [4] Conclude โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
flowchart TB
subgraph Input
A[๐ฏ Topic]
end
subgraph Validation
B{Complexity<br/>Score}
B -->|5-7: Low| C[โ ๏ธ Warn User]
C --> D{Proceed?}
D -->|No| E[Direct Answer]
D -->|Yes| F
B -->|8-15| F[โ Continue]
end
subgraph Panel["Panel Assembly"]
F --> G[Generate Experts]
G --> H{Diversity<br/>โฅ60?}
H -->|No| G
H -->|Yes| I[๐ญ Panel Ready]
end
subgraph Discussion
I --> J[Round N]
J --> K[Cross-Examination]
K --> L[๐ก๏ธ Contrarian Check]
L --> M[๐ Synthesis]
M --> N{Converged?}
N -->|No| O{Stalled?}
O -->|Yes| P[Adjust Panel]
P --> J
O -->|No| J
end
subgraph Output
N -->|Yes| Q[๐ Final Report]
end
A --> B
| State | Description | Exit Condition |
|---|---|---|
COMPLEXITY_CHECK |
Assess if topic warrants panel | Score calculated |
EXPERT_GENERATION |
Create diverse personas | Diversity โฅ60 |
DISCUSSION |
Facilitate debate rounds | Convergence or max rounds |
SYNTHESIS |
Generate recommendations | Report complete |
Important
Every panel must include three archetypes: Contrarian (challenges consensus), Synthesizer (connects perspectives), and Specialist (provides domain depth).
This skill synthesizes findings from peer-reviewed multi-agent debate research1.
| Finding | Source | Implementation |
|---|---|---|
| Diversity is THE dominant driver | Wu et al. 20252 | Diversity score โฅ60 required |
| Majority pressure suppresses correction | Wu et al. 20252 | Contrarian protection protocol |
| Heterogeneous > homogeneous agents | A-HMAD 20253 | Max 30% same-archetype |
| MAD helps complex, not simple tasks | ICLR 20254 | Complexity classifier |
| Confidence weighting improves synthesis | CISC 20255 | Weighted aggregation |
| 3 agents ร 2 rounds is effective | Du et al. 20246 | Default configuration |
๐ Detailed Research Summaries
"Improving Factuality and Reasoning through Multiagent Debate"
The foundational paper establishing that multiple LLM instances debating over rounds significantly improves reasoning:
- Cross-examination reduces hallucinations
- Performance scales with agent count and rounds
- 3 agents ร 2 rounds is cost-effective baseline
"Can LLM Agents Really Debate?"
Critical analysis revealing group diversity is THE dominant driverโmore important than speaking order or confidence visibility. Majority pressure suppresses correction, leading to conformity cascades.
Adaptive Heterogeneous Multi-Agent Debate
Heterogeneous specialized agents significantly outperform homogeneous teams. Simple majority voting underperforms quality-weighted aggregation.
Confidence Improves Self-Consistency
Prioritizing high-confidence reasoning paths reduces required samples by 40%+ while maintaining accuracy.
Caution
Research identifies these failure modesโpanel-debate-skill actively prevents them:
| Anti-Pattern | Problem | Mitigation |
|---|---|---|
| Conformity Cascade | LLMs drift toward majority, entrenching errors | Required contrarian + disagreement triggers |
| Devil's Advocate Overuse | Pure adversarial debate reduces accuracy | Synthesizer required, ~90% collaborative |
| False Consensus | Averaging positions loses nuance | Context-dependent synthesis, "CONTESTED" labels |
| Simple Task Overhead | MAD adds cost without benefit | Complexity classifier screens topics |
The synthesis mechanism uses Hegelian dialectic:
flowchart LR
T[Thesis<br/><i>Initial position</i>] --> A[Antithesis<br/><i>Challenge</i>]
A --> S[Synthesis<br/><i>Emergence</i>]
S -.->|"becomes next"| T2[New Thesis]
style T fill:#4a9eff,color:#fff
style A fill:#ff6b6b,color:#fff
style S fill:#51cf66,color:#fff
style T2 fill:#4a9eff,color:#fff,stroke-dasharray: 5 5
Each round's synthesis becomes the next round's thesis, enabling progressive refinement rather than simple compromise.
panel-debate-skill/
โโโ SKILL.md # Entry point (~150 lines)
โโโ AGENTS.md # AI agent instructions
โโโ CLAUDE.md # โ symlink to AGENTS.md
โโโ references/
โ โโโ research-foundations.md
โ โโโ expert-generation.md
โ โโโ turn-taking.md
โ โโโ synthesis-patterns.md
โ โโโ output-formats.md
โโโ examples/
โโโ architecture-decision.md
โโโ business-strategy.md
โโโ security-implementation.md
Note
The skill uses progressive disclosure: SKILL.md contains lean execution logic; reference files are loaded on-demand for depth.
See CONTRIBUTING.md for guidelines.
Quick Test Commands
# Install locally
npx skills add ./
# Test complexity rejection
/panel-debate "What port does PostgreSQL use?"
# Test standard panel
/panel-debate "Redis vs Memcached?"
# Test deep panel
/panel-debate depth:deep "Microservices migration strategy"Footnotes
-
Full citations in references/research-foundations.md โฉ
-
Wu et al. "Can LLM Agents Really Debate?" arXiv:2511.07784 โฉ โฉ2
-
A-HMAD "Adaptive Heterogeneous Multi-Agent Debate" Springer โฉ
-
Du et al. "Improving Factuality through Multiagent Debate" arXiv:2305.14325 โฉ