Skip to content

thecapibara/code-jury

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

14 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Code Jury [Vote]

4 independent AI judges evaluate your code โ€” like America's Got Talent!

Works with Qwen Code, Claude Code, Gemini CLI, and any other AI coding assistant that supports custom commands/skills.

โœจ Features

  • ๐ŸŽญ 4 Dynamic Judges โ€” randomly selected from 10 personality types each time
  • ๐ŸŒ 9 Languages โ€” Ukrainian, English, Arabic, Russian, Polish, Spanish, German, French, Italian
  • โš–๏ธ Weighted Voting โ€” experts in security, architecture, testing carry more weight
  • ๐Ÿ“Š Score & Verdict โ€” 1-10 scale with โœ…/โŒ verdict per judge
  • ๐Ÿ”„ Branch Tracking โ€” tracks progress across multiple attempts on the same branch
  • ๐Ÿ’พ Session Caching โ€” remembers previous votes for the same diff
  • ๐Ÿ“ˆ Issue Persistence โ€” shows what got fixed, what's still broken, what's new

๐Ÿš€ Quick Install

One-line Installer (Recommended)

# Install for both Qwen & Claude
curl -fsSL https://raw.githubusercontent.com/thecapibara/vote-skill/main/install.sh | bash -s -- --remote

# Qwen Code only
curl -fsSL https://raw.githubusercontent.com/thecapibara/vote-skill/main/install.sh | bash -s -- --remote --qwen

# Claude Code only
curl -fsSL https://raw.githubusercontent.com/thecapibara/vote-skill/main/install.sh | bash -s -- --remote --claude

# Global installation
curl -fsSL https://raw.githubusercontent.com/thecapibara/vote-skill/main/install.sh | bash -s -- --remote --all --global

Clone & Install (Offline)

git clone https://github.com/thecapibara/vote-skill.git
cd vote-skill

# Both (default)
bash install.sh

# Qwen only
bash install.sh --qwen

# Claude only
bash install.sh --claude

# Global
bash install.sh --all --global

# Remote (download from GitHub)
bash install.sh --remote

Verify Installation

cd ~/.qwen/skills/vote  # or your project's .qwen/skills/vote
python3 scripts/select_judges.py --help

๐ŸŽฏ Usage

Qwen Code

Use the slash command:

/vote                     # Evaluate unstaged changes
/vote --mode lightning    # Quick check (fast, cheap)
/vote --mode thorough     # Deep review (slow, thorough)
/vote last commit         # Last commit
/vote last 3 commits      # Last 3 commits
/vote branch feature-auth # Branch changes

Or just ask naturally:

  • "Evaluate my changes"
  • "Check the last commit"
  • "Vote on my feature branch"

Claude Code

Use the slash command:

/vote                     # Evaluate unstaged changes
/vote --mode lightning    # Quick check (fast, cheap)
/vote --mode thorough     # Deep review (slow, thorough)
/vote last commit         # Last commit
/vote last 3 commits      # Last 3 commits
/vote branch feature-auth # Branch changes
/vote file script.py      # Specific file
/vote history             # View history

Or directly run the selector script:

python3 .claude/skills/vote/scripts/select_judges.py --help

Gemini CLI

Use the skill command:

/skills code-jury            # Evaluate unstaged changes (balanced: 2ร— Pro + 2ร— Flash)

Quality modes:

# Lightning โ€” 4ร— Flash-Lite (fastest & cheapest)
/skills code-jury            # Ask for lightning mode

# Flash โ€” 4ร— Flash (fast but capable)
/skills code-jury            # Ask for flash mode

# Balanced โ€” 2ร— Pro + 2ร— Flash (default)
/skills code-jury            # Default mode

# Thorough โ€” 4ร— Pro (maximum quality)
/skills code-jury            # Ask for thorough mode

Or directly run the selector script:

python3 .gemini/skills/code-jury/scripts/select_judges.py --platform gemini --help

๐Ÿ‘ฅ The Judges

Each vote uses 4 judges randomly selected from 10 personality types:

Personality Emoji Weight Claude/Qwen Gemini Expertise
Strict Critic โš–๏ธ 1.7x Sonnet 3.1 Pro Architecture, errors, security
Supportive Mentor ๐Ÿซ 1.0x Haiku 3.1 Flash Potential, best practices
Detail-Oriented Reviewer ๐Ÿ” 1.2x Haiku 3.1 Flash Style, docs, tests, DRY
Creative Engineer ๐ŸŽจ 1.0x Haiku 3.1 Flash Creativity, elegance
Security Expert ๐Ÿ›ก๏ธ 2.0x Sonnet 3.1 Pro Vulnerabilities, validation
Performance Optimizer โšก 1.5x Sonnet 3.1 Pro Algorithms, memory, CPU
Testing Expert ๐Ÿงช 1.6x Sonnet 3.1 Pro Unit tests, edge cases
Architecture Guru ๐Ÿ›๏ธ 1.8x Sonnet 3.1 Pro SOLID, patterns, modularity
User Advocate ๐Ÿ‘ค 1.1x Haiku 3.1 Flash API design, UX
Maintenance Focused ๐Ÿ”ง 1.2x Haiku 3.1 Flash Readability, tech debt
Maintenance Focused ๐Ÿ”ง 1.2x Haiku Readability, tech debt, legacy

Quality Modes

Each vote uses 4 judges. Choose the review depth. Modes vary by platform:

Claude / Qwen Code

Mode Models Use case
โšก Lightning 4ร— Haiku Quick checks, smallest diff
โš–๏ธ Balanced 2ร— Sonnet + 2ร— Haiku Default โ€” optimal balance
๐Ÿ” Thorough 4ร— Sonnet Important reviews, security

Gemini CLI

Mode Models Use case
โšก Lightning 4ร— Flash-Lite Fastest & cheapest
๐Ÿ’จ Flash 4ร— Flash Fast but capable
โš–๏ธ Balanced 2ร— Pro + 2ร— Flash Default โ€” optimal balance
๐Ÿ” Thorough 4ร— Pro Maximum quality

Usage: /vote --mode thorough or /vote --mode lightning

Weighted Voting Explained

  • Judges with higher weight have more influence on the final verdict
  • Consensus detection: Issues mentioned by 3+ judges get extra priority
  • To pass: weighted "Yes" votes must exceed 60% of total weight
  • Example: Security Expert (2.0x) saying โŒ hurts more than Mentor (1.0x) saying โŒ

๐Ÿ“Š Example Output

๐ŸŽญ Vote Results
============================================================

๐Ÿ‘ฉโ€โš–๏ธ ะžะบัะฐะฝะฐ (Strict Critic) (weight: 1.7x)
  โœ… Good error handling, new functionality added
  โŒ Missing tests
  ๐Ÿ“Š 6.5/10 | โŒ No

๐Ÿ‘จโ€๐Ÿซ ะขะฐั€ะฐั (Supportive Mentor) (weight: 1.0x)
  โœ… Clean architecture, good function names
  โŒ Needs more comments
  ๐Ÿ“Š 8/10 | โœ… Yes

๐Ÿ” ะ†ั€ะธะฝะฐ (Detail-Oriented Reviewer) (weight: 1.2x)
  โœ… Follows code style
  โŒ Magic numbers at line 45
  ๐Ÿ“Š 7/10 | โœ… Yes

๐ŸŽจ ะ‘ะพะณะดะฐะฝ (Creative Engineer) (weight: 1.0x)
  โœ… Elegant caching solution
  โŒ Could optimize the loop
  ๐Ÿ“Š 8/10 | โœ… Yes

============================================================
๐Ÿ“Š Summary: 3.7 โœ… Yes | 1.5 โŒ No (weighted)
๐ŸŽฏ Average Score: 7.4/10

๐ŸŽ‰ Great! Code passed the vote!

๐Ÿ’ก Tips for improvement:
  โ€ข Add tests
  โ€ข Avoid magic numbers
  โ€ข Optimize the loop

๐Ÿ”„ Branch Tracking

When working on a feature branch, Vote skill tracks your progress:

  1. First vote โ†’ โŒ Fails (finds: no tests, magic numbers)
  2. Fix issues โ†’ Second vote โ†’ โœ… Shows what's fixed, what's still broken, what's new
๐Ÿ“Š History on branch: feature-auth
============================================================

๐Ÿ“‹ Vote attempts:
  Attempt #1: 6.5/10 โŒ
  Attempt #2: 7.5/10 โŒ

๐Ÿ“ˆ Trend: 6.5 โ†’ 7.5 (+1.0)

โœ… Fixed issues:
  โœ… Added tests (was in attempt #1)

โš ๏ธ Open issues:
  ๐Ÿ”ต Deprecated function process()

============================================================
โŒ Branch failing. Need to fix 1 issues.

๐ŸŒ Language Support

Judges names and personalities adapt to your language:

Language Flag Example Names
Ukrainian ๐Ÿ‡บ๐Ÿ‡ฆ ะžะบัะฐะฝะฐ, ะขะฐั€ะฐั, ะ†ั€ะธะฝะฐ
English ๐Ÿ‡ฌ๐Ÿ‡ง Alex, Maya, Chris
Arabic ๐Ÿ‡ธ๐Ÿ‡ฆ ุนู…ุฑ, ู„ูŠู„ู‰, ุญุณู†
Russian ๐Ÿ‡ท๐Ÿ‡บ ะะปะตะบัะตะน, ะ•ะปะตะฝะฐ, ะ”ะผะธั‚ั€ะธะน
Polish ๐Ÿ‡ต๐Ÿ‡ฑ Kasia, Tomek, Agnieszka
Spanish ๐Ÿ‡ช๐Ÿ‡ธ Carlos, Sofรญa, Miguel
German ๐Ÿ‡ฉ๐Ÿ‡ช Hans, Greta, Fritz
French ๐Ÿ‡ซ๐Ÿ‡ท Pierre, Camille, Luc
Italian ๐Ÿ‡ฎ๐Ÿ‡น Marco, Giulia, Luca

๐Ÿ“ Project Structure

.qwen/skills/vote/           # Qwen Code skill
โ”œโ”€โ”€ SKILL.md                 # Skill description
โ”œโ”€โ”€ scripts/
โ”‚   โ”œโ”€โ”€ select_judges.py     # Main entry point โ€” judge selection
โ”‚   โ”œโ”€โ”€ judge_profiles/      # 9 languages (names & personalities)
โ”‚   โ””โ”€โ”€ judge_review_guides/ # Detailed checklists for each judge type
โ”œโ”€โ”€ sessions/                # Vote cache (gitignored)
โ””โ”€โ”€ branches/                # Branch history (gitignored)

.claude/
โ”œโ”€โ”€ commands/
โ”‚   โ””โ”€โ”€ vote.md              # /vote slash command for Claude
โ”œโ”€โ”€ agents/
โ”‚   โ”œโ”€โ”€ jury-judge-haiku.md  # Lightweight sub-agent
โ”‚   โ””โ”€โ”€ jury-judge-sonnet.md # Expert sub-agent
โ””โ”€โ”€ skills/vote/             # Claude Code skill
    โ””โ”€โ”€ scripts/             # Same scripts as Qwen Code

.gemini/skills/code-jury/    # Gemini CLI skill
โ”œโ”€โ”€ SKILL.md                 # Skill description
โ””โ”€โ”€ scripts/                 # Same scripts as Qwen Code
.gemini/agents/
โ”œโ”€โ”€ jury-judge-pro.md        # Expert sub-agent (Gemini 3.1 Pro)
โ””โ”€โ”€ jury-judge-flash.md      # Lightweight sub-agent (Gemini 3.1 Flash/Lite)

judges.json                  # Source of truth for judge weights, models, review guides
install.sh                   # Installer (--qwen, --claude, --gemini, --remote)
select_judges.py             # CLI: judge selection & localization

โš™๏ธ Configuration

Key configuration files:

  • judges.json โ€” judge weights, models (sonnet/haiku + pro/flash/flash-lite), and focus areas
  • select_judges.py โ€” MODES_* dicts for quality mode behavior per platform
  • scripts/judge_review_guides/ โ€” detailed checklists for each judge type (10 files)
  • Language profiles in scripts/judge_profiles/ โ€” localized names & personalities (9 languages)

๐Ÿ”’ Privacy & Security

  • Session data and branch history are NOT committed to git
  • .gitignore files are included to protect sensitive information
  • All data stays local โ€” no external API calls (yet)

๐Ÿ›ฃ๏ธ Roadmap

  • Real LLM API integration (Claude, GPT, Qwen)
  • Custom judge personalities
  • Vote report export (Markdown/PDF)
  • Team sharing via git
  • CI/CD integration
  • Historical trends dashboard

๐Ÿค Contributing

PRs welcome! Areas of interest:

  • More language profiles
  • New judge personality types
  • LLM integration
  • UI improvements

๐Ÿ“ License

MIT

๐Ÿ‘ค Authors

  • ๐Ÿ’ก Idea: thecapibara
  • ๐Ÿ› ๏ธ Implementation: Qwen Code + thecapibara

About

๐ŸŽญ Code Jury: 4 AI judges evaluate your code with weighted voting. Supports Qwen Code & Claude Code. 9 languages. Lightning, Balanced, Thorough modes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors