vram-calculator

Here are 11 public repositories matching this topic...

hlpun / Train-in-Silence

The first Task-Aware MCP server and automated VRAM calculator for LLM fine-tuning. Instantly snipe the cheapest, fastest GPUs across 10+ cloud providers.

python mcp cost-optimization fine-tuning llm mcp-server claude-code vram-calculator gpu-pricing

Updated May 8, 2026
Python

click6067-ship-it / fitllm-engine

Star

Accurate LLM memory/VRAM calculator — models sliding-window/linear/MoE attention & heterogeneous head_dim where naive calculators are 4–11× off. Apple Silicon + NVIDIA RTX · GGUF · F16/Q8/Q4 KV-cache quant · one MIT file. Powers fitllm.run.

inference nvidia moe quantization gemma mlx vram kv-cache apple-silicon llm llama-cpp local-llm ollama qwen gguf localllama memory-calculator vram-calculator

Updated Jun 4, 2026
JavaScript

PythonicVarun / canirun

Sponsor

Star

🚀 A lightweight CLI to estimate hardware requirements and quantization compatibility for Hugging Face models.

machine-learning python-cli quantization huggingface llm local-ai vram-calculator

Updated Jan 25, 2026
Python

2363186247 / MakeuownTools

Star

A collection of serverless, blazing-fast web tools: Docker Compose Generator, LLM VRAM Calculator, NAS RAID Capacity, and Nginx Reverse Proxy Builder.

selfhosted homelab docker-compose-generator pseo llm-tools satisfactory-calculator raid-calculator vram-calculator zfs-calculator reverse-proxy-generator off-grid-solar nginx-config-generator

Updated May 24, 2026
Astro

artvandelay / api-vs-selfhost-skill

Star

Anthropic-standard Skill — decide API-vs-self-host LLM costs and fine-tune ROI from any agent context (Claude Code, Cursor, Codex). Live GPU+API prices, deterministic local math.

self-hosting lora fine-tuning finetuning llm llm-cost agent-skills claude-code vram-calculator claude-skills claude-skill agent-skill codex-skill skill-md cursor-skills cursor-skill gpu-pricing llm-cost-calculator

Updated May 28, 2026
Python

joe0731 / hf_vram_calc

Star

A CLI tool for estimating GPU VRAM requirements for Hugging Face models, supporting various data types, parallelization strategies, and fine-tuning scenarios like LoRA.

gpu-memory vram huggingface pipeline-parallelism memory-estimation huggingface-models hugging-face-transformers huggingface-datasets vram-monitoring vram-calculator vram-memory-estimation

Updated Oct 22, 2025
Python

ShibaMeanu / HuggingModels

Star

macOS menu bar tool to explore Hugging Face models, detect GGUF/Safetensors configs, and calculate precise VRAM footprint and KV cache overhead.

macos swift ai quantization macos-app swiftui menu-bar-app huggingface llm safetensors gguf vram-calculator

Updated May 14, 2026
Swift

pipe1os / modelinfo-cli

Star

A lightweight CLI tool to inspect ML checkpoints (.safetensors, .gguf, .pt) and calculate inference VRAM, multi-GPU memory splits, and vLLM serving capacity.