Skip to content

Commit e8f7778

Browse files
committed
minor tweak for readme
1 parent af8f4f1 commit e8f7778

File tree

2 files changed

+4
-36
lines changed

2 files changed

+4
-36
lines changed

README.md

Lines changed: 3 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -456,12 +456,9 @@ We're building a hosted version:
456456
## Installation
457457

458458
```bash
459-
# Basic installation
459+
# Install (includes skills testing)
460460
pip install evalview
461461

462-
# With skills testing (Claude Code / Codex)
463-
pip install evalview[skills]
464-
465462
# With HTML reports (Plotly charts)
466463
pip install evalview[reports]
467464

@@ -793,7 +790,7 @@ evalview/
793790
794791
**The first CI/CD testing framework for AI agent skills.**
795792
796-
Skills are the new plugins. With 25k+ skills on marketplaces and enterprises deploying to thousands of employees, skills need the same testing rigor as any other code.
793+
Skills are the new plugins. With thousands of skills on marketplaces and enterprises deploying skills to employees, they need the same testing rigor as any other code.
797794
798795
EvalView lets you validate skill structure and test skill behavior **automatically on every commit**—before your skill reaches users.
799796
@@ -946,31 +943,6 @@ Summary: ✓
946943
Total tokens: 3,847
947944
```
948945
949-
### Add to CI
950-
951-
Block bad skills before they reach users:
952-
953-
```yaml
954-
# .github/workflows/skills.yml
955-
name: Skill Tests
956-
on: [push, pull_request]
957-
958-
jobs:
959-
test-skills:
960-
runs-on: ubuntu-latest
961-
steps:
962-
- uses: actions/checkout@v4
963-
- run: pip install evalview
964-
965-
# Validate structure
966-
- run: evalview skill validate ./skills/ -r --strict
967-
968-
# Test behavior
969-
- run: evalview skill test ./tests/skills/*.yaml
970-
env:
971-
ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
972-
```
973-
974946
### Why Test Skills?
975947
976948
**You can test skills manually in Claude Code. So why use EvalView?**
@@ -987,7 +959,7 @@ Manual testing works for development. EvalView is for **automation**:
987959
988960
**Who needs automated skill testing?**
989961
990-
- **Skill authors** publishing to marketplaces (25k+ skills on SkillsMP)
962+
- **Skill authors** publishing to marketplaces
991963
- **Enterprise teams** rolling out skills to thousands of employees
992964
- **Open source maintainers** accepting contributions from the community
993965
- **Anyone** who wants CI/CD for their skills

pyproject.toml

Lines changed: 1 addition & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,7 @@ dependencies = [
3232
"pydantic>=2.5.0",
3333
"pyyaml>=6.0",
3434
"openai>=1.12.0",
35+
"anthropic>=0.39.0",
3536
"rich>=13.7.0",
3637
"httpx>=0.26.0",
3738
"python-dateutil>=2.8.2",
@@ -48,16 +49,11 @@ reports = [
4849
watch = [
4950
"watchdog>=3.0",
5051
]
51-
# Claude Code skills testing
52-
skills = [
53-
"anthropic>=0.39.0",
54-
]
5552
# All optional features
5653
all = [
5754
"jinja2>=3.0",
5855
"plotly>=5.0",
5956
"watchdog>=3.0",
60-
"anthropic>=0.39.0",
6157
]
6258
# Development dependencies
6359
dev = [

0 commit comments

Comments
 (0)