Awesome AI Pentesting

A curated list of AI-powered tools, frameworks, and resources for penetration testing, bug bounty hunting, red teaming, and cybersecurity operations.

The intersection of Artificial Intelligence and offensive security is evolving rapidly. This list tracks the most impactful open-source projects that leverage LLMs, machine learning, and autonomous agents for cybersecurity.

Maintained by @skyvanguard — Cybersecurity & AI Researcher | TryHackMe Top 1% Global

Stars updated: February 2026

Autonomous AI Pentesting Agents
MCP Servers for Security
Bug Bounty & Recon AI
CTF Solvers & Challenge Tools
Vulnerability Analysis & Scanning
Red Team & Offensive AI
Blue Team & Defensive AI
AI Security (Securing AI Systems)
Fuzzing with AI/ML
Pentest Report Generation
OSINT with AI
Research Papers & Resources
Contributing

Autonomous AI Pentesting Agents

Fully autonomous agents that perform end-to-end penetration testing tasks with minimal human intervention.

Tool	Description	Stars
Shannon	Fully autonomous AI hacker using Claude Agent SDK. 96.15% success rate on XBOW Benchmark. Finds and exploits real vulnerabilities in web apps.
Strix	Open-source AI hackers that run code dynamically, find vulnerabilities, and validate them through actual proof-of-concepts.
PentestGPT	Automated penetration testing agentic framework powered by LLMs. Published at USENIX Security 2024.
CAI	Open-source framework for building AI agents for real-world security testing. Top-10 in Dragos OT CTF 2025.
PentestAgent	AI agent framework for black-box security testing with prebuilt attack playbooks for bug bounty, red-team, and pentest.
PentAGI	Fully autonomous AI agents system. Docker sandboxed, 20+ built-in tools (nmap, metasploit, sqlmap), Neo4j knowledge graph.
Nebula	AI-powered pentesting assistant for automating recon, note-taking, and vulnerability analysis with human decision checkpoints.
NeuroSploit	AI-powered pentest framework with 100 vulnerability types, isolated Kali containers, 9 specialized agent personas.
Pentest Copilot	AI-powered browser-based ethical hacking assistant with agentic architecture that runs commands in the pentest environment.
Auto-Pentest-GPT-AI	LLM-powered pentesting for your software with automated vulnerability discovery and exploitation.
AIDA	AI-Driven Security Assessment that connects AI to 400+ pentesting tools via MCP (Exegol container), enabling natural language-driven security workflows.

MCP Servers for Security

Model Context Protocol servers that bridge AI assistants with cybersecurity tools.

Tool	Description	Stars
HexStrike AI	Advanced MCP server with 150+ cybersecurity tools. Lets AI agents autonomously run pentesting, vuln discovery, and bug bounty tools.
MCP-Security-Checklist	Comprehensive security checklist for MCP-based AI tools by SlowMist. Input validation, audit capability, and risk assessment.
awesome-mcp-security	Curated collection of MCP security resources: tools, papers, videos, articles, and vulnerability information.
mcp-for-security	MCP server implementations for SQLMap, FFUF, NMAP, Masscan and more security testing tools.
mcp-security-hub	36 MCP servers with 175+ tools: Nmap, Ghidra, Nuclei, SQLMap, Hashcat. Recon, web sec, binary analysis, blockchain, AD.
awesome-cyber-security-mcp	Curated list of MCP servers focused on cybersecurity operations.
mcp-server-security-standard	Open, testable security control standard (MSSS) for certifying MCP servers with levels and evidence requirements.
pentestMCP	AI-powered penetration testing via MCP. 20+ security assessment tools (Nmap, Nuclei, ZAP, SQLMap) through MCP protocol.
PentestThinkingMCP	AI pentesting reasoning engine with Beam Search, MCTS, attack scoring for attack path planning and CTF solving.
mcp-vanguard	Security pentesting MCP Server with 22 tools. Windows/WSL bridge for Kali tools integration with Claude.

Bug Bounty & Recon AI

AI-enhanced tools for reconnaissance and bug bounty automation.

Tool	Description	Stars
Nuclei	Fast vulnerability scanner with YAML-based DSL. AI-powered template generation via `-ai` flag. 10k+ community templates.
AutoSploit	Automated mass exploiter combining Shodan/Censys/Zoomeye for target discovery with Metasploit for exploitation.
Mantis	Security framework automating discovery, reconnaissance, and vulnerability scanning. Chains multiple tools from domain input.
ReconAIzer	Burp Suite extension using OpenAI GPT to discover endpoints, params, URLs, and subdomains during bug bounty recon.
Sn1per	Attack surface management platform combining automated scanning with intelligent analysis.
HackBot	AI-powered cybersecurity chatbot using LLaMA for code analysis, scan analysis, and cybersecurity Q&A.
BigBountyRecon	58 different techniques using Google dorks and open source tools for initial reconnaissance.
awesome-bugbounty-tools	Curated list of bug bounty tools including AI-enhanced scanners and recon automation frameworks.

CTF Solvers & Challenge Tools

AI tools for solving Capture The Flag competitions and security challenges.

Tool	Description	Stars
AI-Goat	Learn AI security through vulnerable LLM CTF challenges. Run locally, no sign-ups or cloud fees needed.
ChatGPT on CTF	Research testing whether AI-LLMs can solve CTF challenges by running commands in test environments.
LLM-Security-CTF	Learn LLM/AI security through CTF challenges: prompt injection, jailbreaks, guardrail bypass, tool abuse.
EnIGMA	Enhanced Interactive Generative Model Agent. State-of-the-art on NYU CTF and CyBench. 3x more challenges than prior agents. (ICML 2025)	Paper
CTFAgent	LLM-powered CTF solver with plan-and-execute paradigm. Outperforms 88% of human CTF participants.	Paper
CSAW Agentic CTF	CSAW competition using generative autonomous AI to solve CTF challenges. Major academic AI security benchmark.	Competition

Vulnerability Analysis & Scanning

AI-powered vulnerability discovery and code analysis tools.

Tool	Description	Stars
Garak	NVIDIA's LLM vulnerability scanner. Probes for hallucination, data leakage, prompt injection, toxicity, and jailbreaks.
Vulnhuntr	Zero-shot vulnerability discovery using Claude AI. Analyzes Python call chains. Found 12+ zero-days in production code.
Agentic Security	LLM vulnerability scanner / AI red teaming kit. Protects AI systems from jailbreaks, fuzzing, and multimodal attacks.
agent-scan	Snyk's security scanner for AI agents, MCP servers and agent skills. Detects prompt injections and tool poisoning.
Agentic Radar	Security scanner for agentic systems. Workflow visualization, tool identification, MCP server detection, vulnerability mapping.
ModelScan	Scans ML models for unsafe code. Supports H5, Pickle, SavedModel, and more formats.
NVIDIA Vulnerability Analysis	NVIDIA blueprint for identifying and mitigating container security vulnerabilities with generative AI.
Semgrep	Lightweight static analysis for finding bugs and enforcing code standards with AI pattern matching.

Red Team & Offensive AI

Offensive AI tools, adversarial ML, and AI red teaming frameworks.

Tool	Description	Stars
Promptfoo	LLM red teaming, pentesting, and vuln scanning with 100+ attack types. Tests complete AI systems including RAG and agents.
Adversarial Robustness Toolbox	IBM's Python library for ML Security: evasion, poisoning, extraction, inference attacks and defenses.
llm-attacks	Universal and transferable attacks on aligned language models. Academic research on adversarial attacks against LLMs.
PyRIT	Microsoft's Python Risk Identification Tool for red teaming generative AI systems.
ai-exploits	Collection of real-world AI/ML exploits for responsibly disclosed vulnerabilities.
offensive-ai-compilation	Curated list of Offensive AI resources: tools, techniques, research papers, and attack methodologies.
DeepTeam	Framework for red teaming LLMs. 10+ adversarial attacks, supports OWASP Top 10 for LLMs and NIST AI RMF.
Social-Engineer-Toolkit	Open-source penetration testing framework designed for social engineering.
Caldera	MITRE's automated adversary emulation platform with AI-driven attack planning.

Blue Team & Defensive AI

AI tools for defense, detection, and incident response.

Tool	Description	Stars
Falco	Cloud Native Runtime Security by CNCF. Detects threats across containers, hosts, K8s, and cloud.
Guardrails AI	Adding guardrails to LLMs. Input/output validation, content moderation, and PII detection.
NeMo Guardrails	NVIDIA's toolkit for programmable guardrails on LLM systems. Content moderation, jailbreak detection.
LLM Guard	Security toolkit for LLM interactions. Scanners for PII, prompt injection, banned topics, data leakage.
GhostWriter	Report management platform for red/blue team operations with AI-assisted analysis.
Wazuh	Open-source security monitoring with AI-enhanced threat detection, SIEM, and XDR.
Sigma	Generic signature format for SIEM systems with AI-powered rule creation.
Shuffle	Open-source SOAR platform for security automation and orchestration.

AI Security (Securing AI Systems)

Tools for defending AI/LLM systems against prompt injection, jailbreaks, and emerging threats.

Tool	Description	Stars
Rebuff	LLM prompt injection detector. Multi-layered defense: heuristics, LLM-based detection, vector DB, canary tokens.
prompt-injection-defenses	Every practical and proposed defense against prompt injection attacks. Comprehensive reference.
Awesome-LLMSecOps	LLM Security Operations in one repo. Curated links covering the full LLMSecOps lifecycle.
awesome-mcp-security	Collection of MCP security resources: vulnerabilities, tools, papers, and mitigation strategies.

Fuzzing with AI/ML

Machine learning-guided fuzzing and automated test generation tools.

Tool	Description	Stars
OSS-Fuzz-Gen	Google's LLM-powered fuzzing via OSS-Fuzz. Generates fuzz targets for C/C++. 29% max line coverage increase.
FuzzyAI	CyberArk's automated LLM fuzzing tool for finding jailbreaks and security vulnerabilities in LLM APIs.
FuzzForge AI	AI-powered workflow automation platform for AppSec and fuzzing. Marketplace of security tools with intelligent agents.
AFL++	Superior fork of AFL with ML-guided mutation strategies. Industry standard for fuzzing.
EvoMaster	First open-source AI-driven tool for system-level test generation. Evolutionary algorithms for REST, GraphQL, RPC.
ToolFuzz	First framework for testing LLM agent tools. Combines fuzzing with LLMs for correctness evaluation.

Pentest Report Generation

AI tools for automated penetration testing report creation.

Tool	Description	Stars
PwnDoc	Pentest report generator for findings and customizable Docx reports with templates.
SysReptor	Customizable pentest reporting platform with AI-assisted writing and finding descriptions.
APTRS	Automated Pentest Reporting System. Python/Django-based. PDF/Excel generation, project tracking.
Faction	Pen test report generation and assessment collaboration platform.

OSINT with AI

AI-powered Open Source Intelligence tools.

Tool	Description	Stars
SpiderFoot	Automated OSINT platform. 200+ data sources, threat surface mapping, most modules need no API keys.
Sherlock	Hunt down social media accounts by username across 400+ social networks.
Taranis AI	Advanced OSINT tool using NLP-powered news collection and AI for situational analysis.
OSINTGPT	OSINT analysis tool leveraging GPT embeddings and vector search for document similarity.
theHarvester	E-mails, subdomains and names harvester for OSINT.
GHunt	Offensive Google framework for OSINT.
Recon-ng	Full-featured web reconnaissance framework with modular design.

Research Papers & Resources

Papers

PentestGPT: An LLM-empowered Automatic Penetration Testing Tool — USENIX Security 2024
LLM Agents can Autonomously Hack Websites — GPT-4 hacks 73% of test websites
Getting pwn'd by AI: Penetration Testing with LLMs — Systematic study
AutoAttacker: LLM Guided Automatic Cyber-attacks — Autonomous attack systems
EnIGMA: Enhanced Interactive Generative Model Agent for CTF — ICML 2025

Awesome Lists

List	Description	Stars
Awesome-GPT-Agents	GPT agents for cybersecurity: offensive, defensive, and analytical.
awesome-llm-security	Tools, documents, and projects about LLM Security.
Awesome-LLM4Cybersecurity	LLMs for cybersecurity: pentesting reasoning, exploit generation, vulnerability detection.
awesome-cybersecurity-agentic-ai	Autonomous security agents, frameworks, and research.
awesome-ai-cybersecurity	Resources for AI in cybersecurity. Collection for professionals and researchers.
awesome-pentest	General pentesting resources.

Standards & Frameworks

OWASP AI Security and Privacy Guide
MITRE ATLAS — Adversarial Threat Landscape for AI Systems
NIST AI Risk Management Framework

Contributing

Contributions welcome! Please read CONTRIBUTING.md before submitting a PR.

Criteria for Inclusion

Open-source with a GitHub repository
Uses AI/ML in a meaningful way for security
Actively maintained (commits within 12 months)
Has documentation

Disclaimer

Tools listed are for educational purposes and authorized security testing only. Always obtain proper authorization before testing systems you do not own. The maintainers are not responsible for misuse.

Maintained by @skyvanguard
_{If this list helped you, consider giving it a star!}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation