YAPP - Yet Another Pentest Parser

One parser to rule them all

A powerful Python library and CLI tool for parsing and processing multiple pentesting tool outputs.

YAPP is a comprehensive solution for parsing pentesting tool outputs (Nessus, Nmap, and Burp-soon) into structured JSON, with advanced consolidation capabilities, excel outputting, and both programmatic and command-line interfaces. Built as an extensible framework with modularity, efficiency, and ease of use in mind.

🎯 Why YAPP Exists

The pentesting industry has a multi-faceted tooling problem (CAPDEV). Most businesses treat capabilities development as an afterthought - they'd rather hire more people to combat performance and resource issues, rather than fix their piss-poor workflows and capabiltiies.

Why? 💰C.R.E.A.M.💰
- Unbilled consultant = cost to business
  - This is a bad mindset as business teaches us that you need to spend money to make money.
Result?
- Consultants working unpaid overtime because basic data processing eats half their day
- Delays, lots of them. These delays then cause a snowball effect where the next engagement is impacted.
- Cutting corners to meet deadlines.
- Burnout & general negative vibes.

✨ Features

🔧 Multi-Tool Support

Nessus XML: Full vulnerability parsing with consolidation and API formatting
Nmap XML: Service discovery with port filtering and flat JSON output
Extensible Framework: Easy to add new parsers following established patterns
Auto-Detection: Automatically identifies file types

🖥️ Dual Interface Design

CLI Tool: Beautiful command-line interface with colored output and tool-specific options
Python Library: Clean programmatic API for integration into your projects
One External Dependency: It used to be 0 deps but openpyxl is needed for xlsx

📊 Advanced Nessus Processing

Parse Nessus XML files into structured JSON/Python dictionaries
Advanced consolidation engine with smart vulnerability grouping
Plugin output pattern matching and filtering
Rule-based vulnerability categorization
API-ready output formatting with entity limiting
Excel report generation from consolidated findings

🗺️ Comprehensive Nmap Support

Parse Nmap XML into structured format with service details
Port status filtering (open, closed, filtered)
Flat JSON output for legacy tool compatibility
Service enumeration and script output capture

🎯 Intelligence & Analytics

Track vulnerabilities globally and per host (Nessus)
Comprehensive statistics and metrics for both tools
Multiple FQDN support per host
Detailed vulnerability information (CVE, CVSS, affected systems)
Human-readable output with plugin/service names
Consolidation rule logging -- easily debug consolidation rules by seeing what hasn't matched and why

⚡ Performance

Benchmark Results (Nessus):

File Size: 118 MB Nessus XML (1005 hosts, 214 findings, 17 remediations)
Processing Time: 9.18 seconds total (5.76s processing + 0.45s I/O)
Throughput: ~13 MB/second (total) / ~20 MB/second (processing only)
Memory Efficient: Low memory footprint with streaming parser
Includes: Full parsing + consolidation engine + API formatting + JSON output (3 files)

Tested on: WSL2 (Debian) on Windows host

🚀 Installation

For CLI Usage Only

# Install globally with pipx (recommended for CLI-only usage)
git clone https://github.com/FlyingPhish/YetAnotherPentestParser && cd YetAnotherPentestParser
pipx install .

OR

pipx install git+https://github.com/FlyingPhish/YetAnotherPentestParser.git
# pipx install git+https://github.com/FlyingPhish/YetAnotherPentestParser.git@branch

For Programmatic Usage

# In your virtual environment
pip install git+https://github.com/FlyingPhish/YetAnotherPentestParser.git
# pip install git+https://github.com/FlyingPhish/YetAnotherPentestParser.git@branch

Upgrading

# When installed using pipx
pipx upgrade yapp

# When installed using pip
pip install git+https://github.com/FlyingPhish/YetAnotherPentestParser.git --force-reinstall

💡 Usage

🖥️ Command Line Interface

Core Arguments:

-i, --input-file: Path to input file (required)
-t, --file-type: File type (auto, nessus, nmap) - auto-detects by default
-of, --output-folder: Output directory (default: ./output)
-on, --output-name: Custom output filename
--no-output: Display results only, don't write files
--version: What it says on the tin

Nessus Options:

-c, --consolidate: Enable vulnerability consolidation
-a, --api-output: Generate API-ready format (requires -c)
-r, --rules-file: Custom consolidation rules file
-el, --entity-limit: Maximum entities per API finding
--log-exclusions: Enable detailed exclusion logging to file during consolidation (Nessus only - used to debug rules)

Nmap Options:

-s, --port-status: Filter by port status (all, open, closed, filtered)
-fj, --flat-json: Generate flat JSON for legacy tool compatibility

Excel Options:

-e, --excel-output: Generate Excel report (Consolidated JSON only)

Basic parsing (auto-detects file type):

yapp -i scan.nessus
yapp -i scan.xml

Nessus with consolidation and API formatting:

yapp -i scan.nessus -c -a -el 10 # If finding has > 10 affected, the API output will just say 'refer to external document'
yapp -i scan.nessus -c -a

Generate Excel report from consolidated findings (separate run):

# First: Parse and consolidate
yapp -i scan.nessus -c

# Then: Generate Excel from consolidated JSON
yapp -i output/x_Consolidated_Findings.json -e

When using the -e flag on a consolidated JSON file, an Excel workbook is generated with one worksheet per consolidated finding. Each sheet contains:

Columns: FQDN, IP, Port, and one column per consolidated plugin showing Yes/No if that plugin affected the service
Rows: One row per affected service
Worksheets: One sheet per consolidated vulnerability (e.g., "SSL/TLS Protocol Weaknesses", "SSH Weaknesses")

Nmap with port filtering and flat JSON:

yapp -i scan.xml -s open --flat-json

Custom output with specific file type:

yapp -i scan.xml -t nmap -of ./reports -on results.json

Display results without saving files:

yapp -i scan.nessus --no-output -c

🐍 Python Library

from yapp import process_file

# Auto-detect and parse any supported file
results = process_file('scan.nessus')  # or scan.xml

# Nessus with full pipeline
nessus_results = process_file(
    'scan.nessus',
    consolidate=True,
    api_format=True,
    entity_limit=10
)

# Nmap with filtering and flat JSON
nmap_results = process_file(
    'scan.xml',
    port_status='open',
    flat_json=True
)

# Access parsed data
nessus_data = nessus_results['parsed']
consolidated = nessus_results.get('consolidated')
api_ready = nessus_results.get('api_ready')

nmap_data = nmap_results['parsed']
flat_json = nmap_results.get('flat_json')

For comprehensive examples, see Library Usage Examples and Library Documentation

🏗️ Project Structure

yapp/
├── __init__.py              # Main package API
├── cli.py                   # CLI interface
├── core/                    # Core processing modules
│   ├── __init__.py
│   ├── processor.py         # Main processing pipeline
│   ├── nessus_parser.py     # Nessus XML parsing
│   ├── nmap_parser.py       # Nmap XML parsing
│   ├── excel_formatter.py   # Excel logic
│   ├── consolidator.py      # Vulnerability consolidation
│   └── formatter.py         # API output formatting
├── utils/                   # Utility modules
│   ├── __init__.py
│   ├── file_utils.py        # File operations & detection
│   ├── json_utils.py        # JSON handling
│   ├── display.py           # CLI output formatting
│   └── logger.py            # Logging utilities
└── config/                  # Configuration
    ├── __init__.py
    ├── default_rules.json    # Default consolidation rules
    └── consolidation_README.md

🔬 Nessus Consolidation Engine

The consolidation engine intelligently groups related vulnerabilities, reducing noise and improving vulnerability management efficiency.

Features:

Smart Pattern Matching: Regex patterns for vulnerability names and plugin output
Plugin Output Filtering: Search actual Nessus plugin output content
Flexible Grouping: Group by IP, port, service, or custom criteria
Rule-Based Configuration: JSON rules for different vulnerability types
Advanced Logic: AND/OR pattern matching, exclusion rules
Entity Limiting: Control API output size with configurable entity limits

Common Consolidation Rules:

Outdated Software: Group software with version update patterns
Certificate Issues: Consolidate SSL/TLS certificate problems
Weak Encryption: Group protocol and cipher vulnerabilities
JavaScript Libraries: Separate web application library issues
Operating System: Group OS-specific updates and patches

📊 Excel Report Generation

Transform consolidated vulnerability data into structured Excel workbooks for easy analysis and validation.

Features:

Matrix Layout: One worksheet per vulnerability with Yes/No plugin indicators
Service-Level Detail: Each row shows FQDN, IP, Port, and which plugins affected it
Consolidation Validation: Quickly verify which plugins were grouped together
Analyst-Friendly Format: Familiar spreadsheet format for review and sign-off
Automatic Naming: Output filename matches input consolidated JSON file

Workflow:

Parse & Consolidate: yapp -i scan.nessus -c → Creates consolidated JSON
Generate Excel: yapp -i scan_Consolidated.json -e → Creates matching .xlsx file
Review: Open Excel workbook with one sheet per consolidated vulnerability

🔬 Nessus Consolidation Engine + API Formatter

You can use -a or --api-output, which transforms the results of your consolidation rules into a basic JSON structure that can be used with a reporting engine to turn Nessus results into findings within your pentest report engine. The output has been designed to work with my custom API for Ghostwriter. (DM me on Twatter (x) if you want more info on this)

🔬 Nmap Parser

You can see the proper output in the below sections

📋 Output Formats

Nessus Standard Parsed Output

{
  "context": {
    "scan_id": "string",
    "scan_start": "DD-MM-YYYY HH:MM:SS",
    "scan_duration": "H:MM:SS",
    "policy_name": "string"
  },
  "stats": {
    "hosts": {
      "total": 100,
      "credentialed_checks": 95,
      "multi_fqdn_hosts": 10
    },
    "vulnerabilities": {
      "total": 500,
      "by_severity": {
        "Critical": 5,
        "High": 25,
        "Medium": 150,
        "Low": 200,
        "None": 120
      }
    }
  },
  "hosts": {
    "1": {
      "ip": "string",
      "fqdns": ["string"],
      "os": "string",
      "scan_start": "string",
      "scan_end": "string",
      "credentialed_scan": bool,
      "vulnerabilities": {
        "Critical": int,
        "High": int,
        "Medium": int,
        "Low": int,
        "None": int
      },
      "ports": {
        "443/tcp": {
          "service": "string",
          "vulnerabilities": ["string"]
        }
      }
    }
  },
  "vulnerabilities": {
    "142960": {
      "name": "string",
      "family": "string",
      "severity": int,
      "risk_factor": "string",
      "cvss": {
        "base_score": float,
        "temporal_score": float,
        "vector": "string"
      },
      "cvss3": {
        "base_score": float,
        "temporal_score": float,
        "vector": "string"
      },
      "description": "string",
      "synopsis": "string",
      "solution": "string",
      "see_also": ["string"],
      "cve": [],
      "cwe": [],
      "xref": [],
      "affected_hosts": {
        "1": {
          "ip": "string",
          "fqdn": "string",
          "ports": ["string"],
          "plugin_output": "string"
        }
      }
    }
  }
}

Nessus Consolidated Output (with -c flag)

When using the -c flag, an additional consolidated findings file is generated:

{
  "consolidation_metadata": {
    "rules_applied": ["rule_name1", "rule_name2"],
    "original_plugins_count": int,
    "consolidated_count": int,
    "consolidation_timestamp": "string"
  },
  "consolidated_vulnerabilities": {
    "rule_name": {
      "title": "Human-Readable Title",
      "severity": int,
      "risk_factor": "string",
      "cvss": {},
      "cvss3": {},
      "consolidated_plugins": {
        "plugin_id": "Plugin Name"
      },
      "cve": [],
      "cwe": [],
      "solutions": [],
      "affected_services": {
        "192.168.1.100:443": {
          "ip": "string",
          "fqdn": "string", 
          "port": "string",
          "issues_found": [
            {
              "id": "plugin_id",
              "name": "Plugin Name"
            }
          ],
          "plugin_outputs": {
            "plugin_id": {
              "name": "Plugin Name",
              "output": "string"
            }
          }
        }
      }
    }
  }
}

Nmap Standard Parsed Output

{
  "context": {
    "scanner": "nmap",
    "scanner_version": "7.97",
    "scan_start": "string",
    "scan_end": "string",
    "scan_duration": "5m 30s",
    "scan_type": "syn"
  },
  "stats": {
    "hosts": {
      "total": 50,
      "unique_ips": 50,
      "by_status": {"up": 45, "down": 5}
    },
    "ports": {
      "by_status": {
        "open": 1203
      },
      "by_port": {
        "tcp/135": 50,
      }
    },
    "services": {
      "total": 150,
      "by_service": {"http": 25, "https": 20, "ssh": 30}
    }
  },
  "hosts": {
    "1": {
      "ip": "192.168.1.100",
      "hostname": "server.example.com",
      "status": "up",
      "ports": {
        "tcp/443": {
          "port_id": "443",
          "protocol": "tcp",
          "status": "open",
          "service_name": "https",
          "service_details": {
            "product": "Microsoft Windows RPC",
            "method": "probed",
            "conf": "10",
            "combined_info": "Microsoft Windows RPC"
          },
          "script_output": {}
        }
      },
      "port_summary": {
        "open": 15,
        "closed": 0,
        "filtered": 0
      }
    }
  },
  "services": {
    "service_1": {
      "host_ip": "10.10.0.12",
      "host_hostname": "",
      "port": "tcp/135",
      "port_status": "open",
      "service_name": "msrpc",
      "service_details": {
        "product": "Microsoft Windows RPC",
        "method": "probed",
        "conf": "10",
        "combined_info": "Microsoft Windows RPC"
      },
      "script_output": {}
    }
  }
}

Nmap Flat JSON Output (Legacy Compatibility)

[
  {
    "fqdn": "example.com",
    "ip": "192.168.1.1",
    "port": "TCP/80",
    "port_status": "open",
    "service": "http",
    "detailed_service_info": {
      "product": "nginx",
      "version": "1.18.0",
      "combined_info": "nginx 1.18.0",
      "extrainfo": "Ubuntu",
      "method": "probed",
      "conf": "10"
    },
    "script_output": {
      "http-server-header": "nginx/1.18.0 (Ubuntu)",
      "http-title": "Welcome to nginx!"
    }
  },
]

Nessus API-Ready Output (with Entity Limiting)

[
  {
    "type": "stock",
    "finding_id": 999,
    "affected_entities": "<p>192.168.1.100:443<br />server.example.com</p>"
  },
  {
    "type": "stock", 
    "finding_id": 1001,
    "affected_entities": "<p>Please refer to external document named 'replaceMe'.csv</p>" // This is what happens when you use -el and the result is > than your int
  }
]

🔧 Framework Extension

YAPP is designed as an extensible framework. Adding support for new pentesting tools follows a consistent pattern:

Create parser class in core/your_tool_parser.py
Update file detection in utils/file_utils.py
Add tool support to core/processor.py
Update CLI arguments and display functions
Export in module __init__.py files

See Module Expansion Guide for detailed instructions.

Supported Tools:

✅ Nessus (.nessus XML files)
✅ Nmap (.xml XML files)
🔄 Framework ready for: Masscan, Nuclei, OpenVAS, and more

📈 Roadmap

Current Version Features:

Future Enhancements:

Enhanced type annotations
Additional tool parsers
Advanced filtering and querying
Intergrate functionality from Nmap-Analysis
Intergrate functionality from NessCIS
(Maybe) AI reporting (finding + executive summary/consultants comments) using Fabric or something similar.

🤝 Contributing

We welcome contributions! See Module Expansion Guide for adding new parsers.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Follow existing patterns for consistency
Add tests and documentation
Submit a Pull Request

Key Design Principles:

KISS: Keep implementations simple and readable
DRY: Modular, reusable components
No External Dependencies: Pure Python implementation
Extensible: Framework-based architecture for easy expansion

📄 License

This project is licensed under the GNU Affero General Public License v3.0 - see the LICENSE file for details.

🙏 Acknowledgments

Built for the pentesting and security community.
Inspired by the need for a clean, dependency-free, multi-tool parser framework. We know that most companies skimp on internal R&D.

YAPP: Yet Another Pentest Parser - A unified framework for parsing pentesting tool outputs! 🔧🐍✨

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/badges		.github/badges
examples		examples
input		input
output		output
yapp		yapp
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

License

FlyingPhish/YetAnotherPentestParser

Folders and files

Latest commit

History

Repository files navigation

YAPP - Yet Another Pentest Parser

🎯 Why YAPP Exists

✨ Features

🔧 Multi-Tool Support

🖥️ Dual Interface Design

📊 Advanced Nessus Processing

🗺️ Comprehensive Nmap Support

🎯 Intelligence & Analytics

⚡ Performance

🚀 Installation

For CLI Usage Only

For Programmatic Usage

Upgrading

💡 Usage

🖥️ Command Line Interface

Basic parsing (auto-detects file type):

Nessus with consolidation and API formatting:

Generate Excel report from consolidated findings (separate run):

Nmap with port filtering and flat JSON:

Custom output with specific file type:

Display results without saving files:

🐍 Python Library

🏗️ Project Structure

🔬 Nessus Consolidation Engine

Features:

Common Consolidation Rules:

📊 Excel Report Generation

Features:

Workflow:

🔬 Nessus Consolidation Engine + API Formatter

🔬 Nmap Parser

📋 Output Formats

Nessus Standard Parsed Output

Nessus Consolidated Output (with -c flag)

Nmap Standard Parsed Output

Nmap Flat JSON Output (Legacy Compatibility)

Nessus API-Ready Output (with Entity Limiting)

🔧 Framework Extension

Supported Tools:

📈 Roadmap

Current Version Features:

Future Enhancements:

🤝 Contributing

Key Design Principles:

📄 License

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages