A curated list of AI-powered tools, frameworks, and resources for penetration testing, bug bounty hunting, red teaming, and cybersecurity operations.
The intersection of Artificial Intelligence and offensive security is evolving rapidly. This list tracks the most impactful open-source projects that leverage LLMs, machine learning, and autonomous agents for cybersecurity.
Maintained by @skyvanguard — Cybersecurity & AI Researcher | TryHackMe Top 1% Global
Stars updated: February 2026
- Autonomous AI Pentesting Agents
- MCP Servers for Security
- Bug Bounty & Recon AI
- CTF Solvers & Challenge Tools
- Vulnerability Analysis & Scanning
- Red Team & Offensive AI
- Blue Team & Defensive AI
- AI Security (Securing AI Systems)
- Fuzzing with AI/ML
- Pentest Report Generation
- OSINT with AI
- Research Papers & Resources
- Contributing
Fully autonomous agents that perform end-to-end penetration testing tasks with minimal human intervention.
| Tool | Description | Stars |
|---|---|---|
| Shannon | Fully autonomous AI hacker using Claude Agent SDK. 96.15% success rate on XBOW Benchmark. Finds and exploits real vulnerabilities in web apps. | |
| Strix | Open-source AI hackers that run code dynamically, find vulnerabilities, and validate them through actual proof-of-concepts. | |
| PentestGPT | Automated penetration testing agentic framework powered by LLMs. Published at USENIX Security 2024. | |
| CAI | Open-source framework for building AI agents for real-world security testing. Top-10 in Dragos OT CTF 2025. | |
| PentestAgent | AI agent framework for black-box security testing with prebuilt attack playbooks for bug bounty, red-team, and pentest. | |
| PentAGI | Fully autonomous AI agents system. Docker sandboxed, 20+ built-in tools (nmap, metasploit, sqlmap), Neo4j knowledge graph. | |
| Nebula | AI-powered pentesting assistant for automating recon, note-taking, and vulnerability analysis with human decision checkpoints. | |
| NeuroSploit | AI-powered pentest framework with 100 vulnerability types, isolated Kali containers, 9 specialized agent personas. | |
| Pentest Copilot | AI-powered browser-based ethical hacking assistant with agentic architecture that runs commands in the pentest environment. | |
| Auto-Pentest-GPT-AI | LLM-powered pentesting for your software with automated vulnerability discovery and exploitation. | |
| AIDA | AI-Driven Security Assessment that connects AI to 400+ pentesting tools via MCP (Exegol container), enabling natural language-driven security workflows. |
Model Context Protocol servers that bridge AI assistants with cybersecurity tools.
| Tool | Description | Stars |
|---|---|---|
| HexStrike AI | Advanced MCP server with 150+ cybersecurity tools. Lets AI agents autonomously run pentesting, vuln discovery, and bug bounty tools. | |
| MCP-Security-Checklist | Comprehensive security checklist for MCP-based AI tools by SlowMist. Input validation, audit capability, and risk assessment. | |
| awesome-mcp-security | Curated collection of MCP security resources: tools, papers, videos, articles, and vulnerability information. | |
| mcp-for-security | MCP server implementations for SQLMap, FFUF, NMAP, Masscan and more security testing tools. | |
| mcp-security-hub | 36 MCP servers with 175+ tools: Nmap, Ghidra, Nuclei, SQLMap, Hashcat. Recon, web sec, binary analysis, blockchain, AD. | |
| awesome-cyber-security-mcp | Curated list of MCP servers focused on cybersecurity operations. | |
| mcp-server-security-standard | Open, testable security control standard (MSSS) for certifying MCP servers with levels and evidence requirements. | |
| pentestMCP | AI-powered penetration testing via MCP. 20+ security assessment tools (Nmap, Nuclei, ZAP, SQLMap) through MCP protocol. | |
| PentestThinkingMCP | AI pentesting reasoning engine with Beam Search, MCTS, attack scoring for attack path planning and CTF solving. | |
| mcp-vanguard | Security pentesting MCP Server with 22 tools. Windows/WSL bridge for Kali tools integration with Claude. |
AI-enhanced tools for reconnaissance and bug bounty automation.
| Tool | Description | Stars |
|---|---|---|
| Nuclei | Fast vulnerability scanner with YAML-based DSL. AI-powered template generation via -ai flag. 10k+ community templates. |
|
| AutoSploit | Automated mass exploiter combining Shodan/Censys/Zoomeye for target discovery with Metasploit for exploitation. | |
| Mantis | Security framework automating discovery, reconnaissance, and vulnerability scanning. Chains multiple tools from domain input. | |
| ReconAIzer | Burp Suite extension using OpenAI GPT to discover endpoints, params, URLs, and subdomains during bug bounty recon. | |
| Sn1per | Attack surface management platform combining automated scanning with intelligent analysis. | |
| HackBot | AI-powered cybersecurity chatbot using LLaMA for code analysis, scan analysis, and cybersecurity Q&A. | |
| BigBountyRecon | 58 different techniques using Google dorks and open source tools for initial reconnaissance. | |
| awesome-bugbounty-tools | Curated list of bug bounty tools including AI-enhanced scanners and recon automation frameworks. |
AI tools for solving Capture The Flag competitions and security challenges.
| Tool | Description | Stars |
|---|---|---|
| AI-Goat | Learn AI security through vulnerable LLM CTF challenges. Run locally, no sign-ups or cloud fees needed. | |
| ChatGPT on CTF | Research testing whether AI-LLMs can solve CTF challenges by running commands in test environments. | |
| LLM-Security-CTF | Learn LLM/AI security through CTF challenges: prompt injection, jailbreaks, guardrail bypass, tool abuse. | |
| EnIGMA | Enhanced Interactive Generative Model Agent. State-of-the-art on NYU CTF and CyBench. 3x more challenges than prior agents. (ICML 2025) | Paper |
| CTFAgent | LLM-powered CTF solver with plan-and-execute paradigm. Outperforms 88% of human CTF participants. | Paper |
| CSAW Agentic CTF | CSAW competition using generative autonomous AI to solve CTF challenges. Major academic AI security benchmark. | Competition |
AI-powered vulnerability discovery and code analysis tools.
| Tool | Description | Stars |
|---|---|---|
| Garak | NVIDIA's LLM vulnerability scanner. Probes for hallucination, data leakage, prompt injection, toxicity, and jailbreaks. | |
| Vulnhuntr | Zero-shot vulnerability discovery using Claude AI. Analyzes Python call chains. Found 12+ zero-days in production code. | |
| Agentic Security | LLM vulnerability scanner / AI red teaming kit. Protects AI systems from jailbreaks, fuzzing, and multimodal attacks. | |
| agent-scan | Snyk's security scanner for AI agents, MCP servers and agent skills. Detects prompt injections and tool poisoning. | |
| Agentic Radar | Security scanner for agentic systems. Workflow visualization, tool identification, MCP server detection, vulnerability mapping. | |
| ModelScan | Scans ML models for unsafe code. Supports H5, Pickle, SavedModel, and more formats. | |
| NVIDIA Vulnerability Analysis | NVIDIA blueprint for identifying and mitigating container security vulnerabilities with generative AI. | |
| Semgrep | Lightweight static analysis for finding bugs and enforcing code standards with AI pattern matching. |
Offensive AI tools, adversarial ML, and AI red teaming frameworks.
| Tool | Description | Stars |
|---|---|---|
| Promptfoo | LLM red teaming, pentesting, and vuln scanning with 100+ attack types. Tests complete AI systems including RAG and agents. | |
| Adversarial Robustness Toolbox | IBM's Python library for ML Security: evasion, poisoning, extraction, inference attacks and defenses. | |
| llm-attacks | Universal and transferable attacks on aligned language models. Academic research on adversarial attacks against LLMs. | |
| PyRIT | Microsoft's Python Risk Identification Tool for red teaming generative AI systems. | |
| ai-exploits | Collection of real-world AI/ML exploits for responsibly disclosed vulnerabilities. | |
| offensive-ai-compilation | Curated list of Offensive AI resources: tools, techniques, research papers, and attack methodologies. | |
| DeepTeam | Framework for red teaming LLMs. 10+ adversarial attacks, supports OWASP Top 10 for LLMs and NIST AI RMF. | |
| Social-Engineer-Toolkit | Open-source penetration testing framework designed for social engineering. | |
| Caldera | MITRE's automated adversary emulation platform with AI-driven attack planning. |
AI tools for defense, detection, and incident response.
| Tool | Description | Stars |
|---|---|---|
| Falco | Cloud Native Runtime Security by CNCF. Detects threats across containers, hosts, K8s, and cloud. | |
| Guardrails AI | Adding guardrails to LLMs. Input/output validation, content moderation, and PII detection. | |
| NeMo Guardrails | NVIDIA's toolkit for programmable guardrails on LLM systems. Content moderation, jailbreak detection. | |
| LLM Guard | Security toolkit for LLM interactions. Scanners for PII, prompt injection, banned topics, data leakage. | |
| GhostWriter | Report management platform for red/blue team operations with AI-assisted analysis. | |
| Wazuh | Open-source security monitoring with AI-enhanced threat detection, SIEM, and XDR. | |
| Sigma | Generic signature format for SIEM systems with AI-powered rule creation. | |
| Shuffle | Open-source SOAR platform for security automation and orchestration. |
Tools for defending AI/LLM systems against prompt injection, jailbreaks, and emerging threats.
| Tool | Description | Stars |
|---|---|---|
| Rebuff | LLM prompt injection detector. Multi-layered defense: heuristics, LLM-based detection, vector DB, canary tokens. | |
| prompt-injection-defenses | Every practical and proposed defense against prompt injection attacks. Comprehensive reference. | |
| Awesome-LLMSecOps | LLM Security Operations in one repo. Curated links covering the full LLMSecOps lifecycle. | |
| awesome-mcp-security | Collection of MCP security resources: vulnerabilities, tools, papers, and mitigation strategies. |
Machine learning-guided fuzzing and automated test generation tools.
| Tool | Description | Stars |
|---|---|---|
| OSS-Fuzz-Gen | Google's LLM-powered fuzzing via OSS-Fuzz. Generates fuzz targets for C/C++. 29% max line coverage increase. | |
| FuzzyAI | CyberArk's automated LLM fuzzing tool for finding jailbreaks and security vulnerabilities in LLM APIs. | |
| FuzzForge AI | AI-powered workflow automation platform for AppSec and fuzzing. Marketplace of security tools with intelligent agents. | |
| AFL++ | Superior fork of AFL with ML-guided mutation strategies. Industry standard for fuzzing. | |
| EvoMaster | First open-source AI-driven tool for system-level test generation. Evolutionary algorithms for REST, GraphQL, RPC. | |
| ToolFuzz | First framework for testing LLM agent tools. Combines fuzzing with LLMs for correctness evaluation. |
AI tools for automated penetration testing report creation.
| Tool | Description | Stars |
|---|---|---|
| PwnDoc | Pentest report generator for findings and customizable Docx reports with templates. | |
| SysReptor | Customizable pentest reporting platform with AI-assisted writing and finding descriptions. | |
| APTRS | Automated Pentest Reporting System. Python/Django-based. PDF/Excel generation, project tracking. | |
| Faction | Pen test report generation and assessment collaboration platform. |
AI-powered Open Source Intelligence tools.
| Tool | Description | Stars |
|---|---|---|
| SpiderFoot | Automated OSINT platform. 200+ data sources, threat surface mapping, most modules need no API keys. | |
| Sherlock | Hunt down social media accounts by username across 400+ social networks. | |
| Taranis AI | Advanced OSINT tool using NLP-powered news collection and AI for situational analysis. | |
| OSINTGPT | OSINT analysis tool leveraging GPT embeddings and vector search for document similarity. | |
| theHarvester | E-mails, subdomains and names harvester for OSINT. | |
| GHunt | Offensive Google framework for OSINT. | |
| Recon-ng | Full-featured web reconnaissance framework with modular design. |
- PentestGPT: An LLM-empowered Automatic Penetration Testing Tool — USENIX Security 2024
- LLM Agents can Autonomously Hack Websites — GPT-4 hacks 73% of test websites
- Getting pwn'd by AI: Penetration Testing with LLMs — Systematic study
- AutoAttacker: LLM Guided Automatic Cyber-attacks — Autonomous attack systems
- EnIGMA: Enhanced Interactive Generative Model Agent for CTF — ICML 2025
| List | Description | Stars |
|---|---|---|
| Awesome-GPT-Agents | GPT agents for cybersecurity: offensive, defensive, and analytical. | |
| awesome-llm-security | Tools, documents, and projects about LLM Security. | |
| Awesome-LLM4Cybersecurity | LLMs for cybersecurity: pentesting reasoning, exploit generation, vulnerability detection. | |
| awesome-cybersecurity-agentic-ai | Autonomous security agents, frameworks, and research. | |
| awesome-ai-cybersecurity | Resources for AI in cybersecurity. Collection for professionals and researchers. | |
| awesome-pentest | General pentesting resources. |
- OWASP AI Security and Privacy Guide
- MITRE ATLAS — Adversarial Threat Landscape for AI Systems
- NIST AI Risk Management Framework
Contributions welcome! Please read CONTRIBUTING.md before submitting a PR.
- Open-source with a GitHub repository
- Uses AI/ML in a meaningful way for security
- Actively maintained (commits within 12 months)
- Has documentation
Tools listed are for educational purposes and authorized security testing only. Always obtain proper authorization before testing systems you do not own. The maintainers are not responsible for misuse.
Maintained by @skyvanguard
If this list helped you, consider giving it a star!