Skip to content

laboratoiresonore/spellcaster

Repository files navigation

Spellcaster

Spellcaster

Type "hair" in GIMP. It selects the hair. Perfectly. In one second.
69 AI tools, one menu. GIMP · DaVinci Resolve · Darktable · chat UI · 100% local · zero config.

Spellcaster auto-detects your installed checkpoints and routes each generation to the right architecture. You never pick a sampler.

Windows   macOS   Linux

Releases License Stars Downloads Support on Patreon

Save a day of workI want to…5-click showcaseInstallThree UIsFAQDeep dive


📣 Status — April 2026

🆕 NewsDaVinci Resolve plugin is out — test it and report all bugs!
✅ What worksGIMP plugin is a powerhouse and a far better interface than ComfyUI for anything related to images
🔧 Current focusGlobal debugging and optimizing
⏭ NextSignal bridge  ·  Moar plugins / frontends (make a request)

Save a day of work, right now

Three things Spellcaster does in one click that most people spend an afternoon — or a whole day — on.


AI Select with SAM3 — earring mask

⚡ Let it make the picks

Type what you want selected — earring, hair, left shoe. Perfect mask, one second. No lasso, no zooming, no quick-mask dance.

Same trick runs everywhere else. Type what you want made, and Spellcaster picks the right model for the job, rewrites your prompt in that model's own language, filters out incompatible add-ons, sets up the guidance. You describe the outcome; it chooses the tools.

🖌️ GIMP · Select > AI Select by Description  ·  🧙 Guild · smart generate  ·  how the auto-picks work →

🔮 Resurrect — and rewrite — any photo

SUPIR restoration, one click. Damaged, compressed, low-res — faces, skin, texture all come back. Works on grandma's scanned photos and on the JPEGs you downloaded in 2006.

Then: do whatever you want to the picture. Point at the power line and it's gone. Say "add a coffee cup on the table" and it lands, matching light and perspective. Change the time of day. Colour in a black-and-white. Extend the canvas past its edges. Swap the face. Re-pose the subject. Every edit most photo apps took a decade to learn — done by describing it in English.

🖌️ GIMP · restore, erase, add, re-light  ·  📷 Darktable · batch across a whole shoot

SUPIR Restoration — before / after
Wan 2.2 Image-to-Video — breathing portrait

🎬 Animate a still — and drop it on the timeline

Wan 2.2 Image-to-Video. Pick a motion preset — zoom, parallax, a breathing portrait, 26 of them. The still moves. 2–5 seconds, 720p, rendered overnight if you want.

Or skip the still. Type a paragraph and LTX 2.3 gives you three seconds of cinema — fireballs, rain, a neon fly-through, a slow dolly-in. Straight from words to video.

Then the good part: Send to Resolve. The clip lands in your DaVinci bin, ready to cut. Or plan a whole sequence in the Guild — a Cinematographer wizard scaffolds twelve shots, renders them while you get coffee, and the full reel is waiting in Resolve's media pool when you get back.

🧙 Guild · Shotboard  ·  🖌️ GIMP · LTX + Wan  ·  🎬 Resolve · bridge plugin


POWERED BYfull matrix →

🎨 Image  Flux 2 Klein · Flux 1 Dev · Flux Kontext · Chroma · SDXL · Illustrious · Pony · Playground · SD 1.5 · SDXL Turbo · Z-Image Turbo
🎬 Video  Wan 2.2 I2V · LTX 2.3 · SeedVR
🧠 Helpers  SAM 3 · NormalCrafter · IC-Light · ReActor · Ollama

→ 69 tools across 19 models. See the full matrix in DEEP_DIVE.md


I want to…

Pick the thing you want. One tool per row. The Best in column tells you the fastest interface to launch it from — 🖌️ GIMP for pixel-accurate work, 📷 Darktable for RAW-first batching, 🧙 Wizard Guild for conversational prompts, 🎬 Resolve for timeline-aware video, 🎭 SillyTavern for in-chat renders.

AI Eraser Remove something from a photo
LaMa inpainting erases anything — tourists, power lines, ex-boyfriends — and fills the gap cleanly.
🖌️ GIMP — select + Ctrl+Alt+X ·
📷 Darktable — Send to GIMP first
Remove Background Cut out the subject
Three engines — rembg, BiRefNet (best for hair), BiRefNet Portrait. You get transparency. No cleanup needed.
🖌️ GIMPCtrl+Alt+B ·
🧙 Guild — "remove the background"
Colorize Colorize a B&W photo
Three engines — DDColor artistic/natural, or ControlNet + diffusion for the "restored family portrait" look.
🖌️ GIMP — one dialog ·
📷 Darktable — batch across shoots
IC-Light Change the lighting
IC-Light relighting. Pick "Golden Hour", "Neon", "Studio", 10 presets — the subject stays, the light changes.
🖌️ GIMP — Style menu, one preset click ·
🧙 Guild — "relight as sunset"
Face Restore Fix a face
Seven models — GPEN-2048, CodeFormer, GFPGAN, RestoreFormer++. Sharp eyes and skin back.
🖌️ GIMPCtrl+Alt+F ·
📷 Darktable — restore a whole folder
Face Identity Put your face on a character
ReActor, FaceID, PuLID, Flux 2 Headswap. Upload a reference, generate a new scene, the subject has your face.
🖌️ GIMP — upload + preset ·
🎭 ST/portrait in roleplay
Style Transfer Copy a style
IPAdapter style transfer. Point at any reference image — painting, photograph, illustration — and your image gets rewritten in that style.
🖌️ GIMP — reference image picker ·
🧙 Guild — drop a ref, chat it
SeedV2R Upscale Upscale with hallucinated detail
Nine upscalers: WaveSpeed SeedVR2 (best, 2K/4K), UltraSharp, RealESRGAN, Remacri, NMKD, Anime. Controllable hallucination.
🖌️ GIMPCtrl+Alt+U ·
📷 Darktable — Hybrid Blend panel
3D Normal Map Extract 3D surface geometry
NormalCrafter generates a 3D normal map from any 2D image. Use it in Blender, in game engines, or feed it back into Spellcaster to relight the scene.
🖌️ GIMP — Enhance ▸ 3D Normal Map ·
📷 Darktable — 3D / Relighting panel
Flux 2 Klein Edit with Flux 2 Klein
4-20 steps, photorealistic, best-in-class img2img and inpaint. Klein is the one people notice. 9 Klein-specific tools, all pre-tuned.
🖌️ GIMP — Flux 2 submenu, 9 tools ·
🧙 Guild — auto-routes via Portraitist wizard
Spellcaster — the mascot cat wizard Have fun and generate whatever the hell I want
Text to Image across 9 model families. Type "a cat wizard reading a spellbook." Get a cat wizard reading a spellbook. 25 scene presets.
🧙 Wizard Guild — just type it, the wizards route it ·
🖌️ GIMP — Text to Image dialog
Animate a still Animate a still into a clip
Wan 2.2 I2V (81 frames, 720p) or LTX 2.3. 26 motion presets — zoom, turntable, parallax, breathing portrait, falling petals.
🎬 Resolve — drop playhead, render to timeline ·
🖌️ GIMP — Image ▸ WAN I2V ·
🎭 ST/animate

Icon key · 🖌️ GIMP plugin · 📷 Darktable plugin · 🧙 Wizard Guild chat · 🎬 DaVinci Resolve plugin · 🎭 SillyTavern extension — all four talk to the same ComfyUI backend and share a gallery, so any result shows up everywhere.

Too many to list here. All 69 tools →


Five clicks, start to finish

A real pipeline, all built into the GIMP menu. Generate a scene → AI-select the subject → extract its 3D geometry → enhance the detail surgically → blend back in. Five menu items. Zero configuration. Takes longer to microwave a Hot Pocket — and unlike the Hot Pocket, the result doesn't make you question your life choices.

Step 1: Generate
1. Generate — SDXL creates the scene
Step 2: AI Select
2. AI Select — SAM3 isolates the statue
Step 3: Normal Map
3. 3D Normal Map — surface geometry extracted
Step 4: Enhance
4. Detail Enhance — surgical texture refinement on the isolated layer
Step 5: Blend Layers
5. Blend Layers — reintegrated into the original scene

Install

What you need

You need What it is Where to get it
ComfyUI The AI engine (runs in background, you never open it) github.com/comfyanonymous/ComfyUI
GIMP 3 Free image editor (the free Photoshop) gimp.org
A GPU with 4+ GB VRAM Runs the AI models You probably have one

No GPU on this machine? That's fine — if you have a gaming PC in another room, you can run ComfyUI there and Spellcaster on your laptop. Use the Remote Installer further down.

Get the installer

This is the one you want 99% of the time. Double-click and click Next a few times. It looks at your computer, downloads what's missing, sets up GIMP, and makes desktop shortcuts. About 370 MB — includes the wizard's own artwork (drawn locally by Spellcaster's own engine).

Windows   macOS   Linux

How to use it:

  1. Download it. Double-click. Wait. (First time, it asks Windows for permission — click Yes.)
  2. Click through the wizard. It auto-fills almost everything; you mostly just click Next. If something looks unfamiliar, leave it as the default — we picked smart defaults.
  3. Open GIMP. Go to the Filters menu → Spellcaster. Every AI tool is there. Pick one, click Generate, get a result.

That's the whole thing. If anything goes wrong, the installer tells you in plain English what went wrong and what to do.

Self-updating. Every Spellcaster installer below downloads the latest version from GitHub on every launch — you don't need to re-download new releases just to get bug fixes. Want to use the version baked into the .exe instead (e.g. you're offline)? Add --no-update when launching.

Comfortable with the command line? From source: git clone https://github.com/laboratoiresonore/spellcaster && cd spellcaster && python installer/install.py. Skip this line if you don't know what git is.


Other downloads (most people don't need these)

The main installer above does almost everything. These three smaller tools are for special situations:

Remote installer   Health checker   Repair tool

🌐 Remote Installer — for when ComfyUI lives on a different computer

Some people have a powerful PC with a beefy graphics card sitting in a closet, and use a quiet laptop on the couch. The Remote Installer is for that. It puts the GIMP plugin and chat UI on your laptop, but tells them "the actual AI brain is over there on the gaming PC". It also tries to find your gaming PC automatically.

Use it when: your ComfyUI is on a different computer on the same Wi-Fi/network as the one you're installing on. Skip it if ComfyUI is on the same machine you're installing onto — use the regular installer instead.

🩺 Health Checker — "is everything actually working?"

After installing, this tool runs a quick test on every feature you installed. It picks each one ("background remove", "face swap", "upscale"…), tries it for real on tiny test images, and tells you which ones work and which ones don't (and why). Takes about 1–5 minutes.

Use it when: you ran the installer and want to make sure everything actually works before opening GIMP. Or weeks later, when you've added new models to ComfyUI and want to confirm Spellcaster sees them. Or any time something feels broken.

🔧 Repair Tool — "the plugin disappeared from GIMP!"

Sometimes things break. GIMP doesn't show the Spellcaster menu anymore. Or a file got deleted. Or you tried to update and it half-worked. This tool re-downloads the GIMP plugin from GitHub and reinstalls it, without making you re-download every model again.

Use it when: Spellcaster used to work, then stopped, and you don't want to re-run the big installer. Try this first.


💡 Already installed Spellcaster, then added new models to ComfyUI?

You don't need to re-run the installer. Open GIMPFilters → Spellcaster → 🜍 Crypt → ↻ Refresh Models from Server. It re-checks your ComfyUI server and updates the model lists everywhere in about 2 seconds. New LoRAs, new checkpoints, new ControlNet files — all pulled in. The dropdowns in every Spellcaster dialog will show the new options the next time you open them.


Three ways to use it

GIMP plugin — select what to fix GIMP plugin — pick a preset GIMP plugin — result on a new layer

🖌️ GIMP plugin
The menu. 69 tools across Filters > Spellcaster. Select layers, paint masks, click a tool. Keyboard shortcuts (Ctrl+Alt+E/U/F/B/X) for the quick ones. Every output lands as a new layer. Non-destructive, reversible, composable.
Wizard Guild — chat UI Wizard Guild — second view Wizard Guild — third view

🧙 Wizard Guild
The chat UI. AI wizard characters walk you through every tool conversationally. "Restore this photo." "Make the sky orange." "Turn this into a video." No menus, no modes. How it works →
DaVinci Resolve plugin

🎬 DaVinci Resolve plugin
The timeline. Drop the playhead, type a prompt, get an LTX-2 clip back in the Media Pool. Gap-fill between clips with reference-aware rendering. Markers map to render profiles. How it works →

All three talk to the same ComfyUI backend. Every generated asset is visible from every interface. The mechanics →


💎 SillyTavern integration

SillyTavern — roleplay interface with Spellcaster

Spellcaster's SillyTavern plugin turns the roleplay surface into a fourth interface. Slash commands (/scene, /portrait, /animate) render Klein-2 stills and LTX-2 clips in-chat. A round-trip-safe Character Card Editor in GIMP lets you paint card art, edit the V2 metadata with LLM-scaffolded best-practice auto-optimization, and save spec-compliant chara PNGs without damaging the embedded data. SillyTavern upstream → · How it works →

⚗️ Experimental plugins — Photoshop, Krita, Blender, OBS Studio

Four additional plugins live in the repo but are not yet tested by the maintainer. They share the same plugin_base.SpellcasterPlugin as the main four and route through the Wizard Guild, so presence, telemetry, and cross-interface asset sharing work out of the box. Expect rough edges — please file issues / PRs if you run them.

Plugin Location Status
🎨 Photoshop (UXP panel) plugins/photoshop/ Smart Generate, img2img, Detail Hallucinate, Colorize B&W, Magic Eraser, AI Upscale, Remove Background, plus a ✨ Presets picker (product / portrait / social / background / detail / colorize) — all routed through the Guild's /api/run_builder. Selection-aware inpaint still requires the UXP batchPlay selection→bitmap dance and isn't wired yet.
🎨 Krita plugins/krita/ Full menu: txt2img, img2img, inpaint (uses the Krita selection as the mask), outpaint, IC-Light relight, 3D normal map, upscale, rembg, face restore, face swap from file, Detail Hallucinate, Colorize B&W, Magic Eraser, Style Transfer from file, LTX text-to-video, LTX image-to-video, WAN 2.2 image-to-video, plus a Presets… picker (anime, cinematic photo, concept art, textures, watercolour, oil-paint style, colorize, eraser, detail). Python-based; installs via pykrita.
🧊 Blender plugins/blender/ Sidebar panel in the 3D view with a Run Preset button at the top (PBR stone / wood / metal, sci-fi environment, fantasy landscape, HDRI skybox, character sheet, normal map, detail 4×). Operators: txt2img, img2img, outpaint, IC-Light, normal-map, Detail Hallucinate, Colorize, Magic Eraser, upscale, rembg, LTX T2V, LTX I2V, WAN I2V. Results land as Blender images (drop-in for material slots).
📺 OBS Studio plugins/obs/ Tools → Scripts → add spellcaster_obs.py. Text-first ops: generate scene backgrounds (Image source), transparent overlays (rembg on gen), short intro/BRB clips (LTX 2.3 text-to-video → Media source), Smart Generate (arch auto-pick), plus a Presets dropdown (cyberpunk / fantasy tavern / lo-fi / BRB / starting-soon / cyberpunk-flythrough / fantasy-reveal / abstract-particles) with a Run Selected Preset button. Canvas-input ops (img2img / inpaint / upscale-this-scene) intentionally skipped — OBS's Python API doesn't expose the preview pixel buffer cleanly.

All four feed the same SpeedCoach telemetry pipeline as the GIMP / Darktable / Resolve / SillyTavern plugins — dispatch rows land in dispatch_log.jsonl alongside every other frontend — so if you run them, you're helping tune the suggestion model for everyone.


For people who can't computer

If you can order food on your phone, you can use this. If you once successfully connected a printer on the first try, you're overqualified.

  • Installation? Automated. The installer sniffs your GPU, figures out what AI models your hardware can run, downloads them, installs everything, creates shortcuts. You click "Next" a few times.
  • Settings? Automated. Every tool has expert-tuned presets. Or run the Calibration Wizard — it shows you real images and asks "A or B?" Like an eye exam. Details →
  • Prompts? Automated. Type "a cat" — a local AI rewrites it into the flavour your model was trained on. SDXL wants tags. Flux wants paragraphs. Klein wants bullets. It does this for you. Details →
  • Model selection? Automated. The plugin detects what models are installed and picks the best one.
  • VRAM management? Automated. Video resolution auto-scales to fit your GPU. The LLM politely unloads itself during image generation. TeaCache acceleration silently injected.
  • Remote ComfyUI? Automated. Use spellcaster-remote-installer.exe — auto-discovers servers on your LAN, installs only the local plugins + Wizard Guild pointing at the remote.
  • Updates? Automated. The plugin checks GitHub on launch and silently patches itself.
  • Recovery? Automated. If an update corrupts the plugin, a 3-tier recovery system restores from backup, re-downloads from GitHub, or shows a visible error. GIMP never bricks.

Open GIMP. Go to the Spellcaster menu. Pick a tool. Click Generate. That's it.

Too intimidated by GIMP? The Wizard Guild is a chat interface where you just tell an AI wizard what you want. In English.


FAQ

What GPU do I need?

Any NVIDIA GPU with 4+ GB VRAM. AMD works too (ROCm/DirectML). The installer looks at your GPU and says "here's what you can run" — no guesswork. 4GB gets you the basics. 8GB is the sweet spot. 16GB unlocks the good stuff. 24GB and you're basically a wizard yourself.

Do I need to understand ComfyUI?

No. God no. That's the whole point. Every tool has expert-tuned presets. You never need to open ComfyUI, pick a sampler, write a negative prompt, or learn what "Euler ancestral CFG++ with Karras scheduling at 0.85 denoise" means. That sentence just gave you a headache. See? We saved you from that.

If you want to go deeper — every parameter is exposed, you can save custom presets, import raw ComfyUI workflows, and inject LoRAs. The power is there when you're ready for it.

Does anything leave my computer?

No. Nothing. Nada. Zero bytes. Your GPU does all the work, your images stay on your hard drive. The only network traffic is between GIMP and your own ComfyUI server, which can literally be localhost.

Can I use my own ComfyUI workflows?

Yes. Filters > Spellcaster Tools > Workflow Library runs any workflow JSON from GIMP.

ComfyUI on another machine?

Yes. The spellcaster-remote-installer.exe auto-discovers ComfyUI servers on your network (--scan), or pass the URL directly. Gaming PC in the closet, laptop on the couch — we have enabled laziness at an architectural level and we're proud of it. For multi-app coordination across multiple machines (Resolve on box A, GIMP on box B, SillyTavern on box C), see the optional Antenna service-mesh — most users never need it.

Can I have more than one LLM running at once?

Yes. ComfyUI's embedded LLM, Ollama, and a dedicated RP Kobold can all stay online together — SillyTavern keeps chatting with Kobold, the Guild uses ComfyUI for image prompts, Ollama handles install scaffolding. The pill picker decides who answers first; nobody gets stopped. Details →

How does the LoRA Shootout work?

Purpose-aware multi-sample renders. The LoRA registry is grouped into 20+ purpose buckets (skin_detail, style_photoreal, …), each with a subject-specific prompt (portrait / fullbody / macro / animal). You approve many LoRAs — not one winner — with keywords that the Guild auto-proposes when the keyword appears in a chat prompt. Auto-fallback tries up to 3 checkpoints of the same arch on failure. Scaffold details →

Can I talk to the Guild instead of typing?

Yes. Register a KoboldCpp in TTS/STT mode, then press-and-hold the 🎙️ button. Walkie-talkie — release to transcribe. Voice details →

Why "Spellcaster"?

Because every tool is a spell, every workflow is an incantation, your GPU is a familiar, and the entire project radiates the energy of someone who played too much D&D and then learned Python. Also "ComfyUI-GIMP-Middleware-With-69-Tools-And-A-Chat-UI-Full-Of-Wizards" didn't fit in the GitHub repo name.


Dig deeper

  • Full technical reference → DEEP_DIVE.md — all 69 tools enumerated, the 9-architecture registry, scaffold state machines, antenna service-mesh endpoints (optional, for advanced multi-machine setups), cross-interface backbone, prompt enhancement chain, privacy + boot safety details, every subsystem explained.
  • ComfyUI dependenciesDEPENDENCIES.md — 24 custom node packs Spellcaster uses, linked to upstream.

Love it? Share it.

Spellcaster is 100% free, 100% open-source, and 0% funded. If it saves you a day of clicking, the best way to pay it forward is to tell someone:

  • 🌟 Star the repo so more GIMP / Darktable / Resolve users find it
  • 📣 Post a screenshot of your first 5-click result with #Spellcaster on your favourite network
  • 🧙 Show a friend who spends too long on ComfyUI noodle graphs — 30 seconds to install, they'll thank you
  • 🐛 Open an issue if it broke, a PR if you fixed it, or a discussion if you just want to show off what you made
  • Tip the lab on Patreon if Spellcaster earned you an afternoon of sanity — it funds the next GPU and the next 5 a.m. debugging session

Word of mouth is the entire marketing budget. Make us famous.

Become a Patron


Credits & acknowledgements

Proudly vibe-coded as a pure pineapple-pen innovation. 🍍🖊️

Spellcaster doesn't reinvent the wheel — it duct-tapes together the best wheels the open-source AI community has ever built, then hides the duct tape behind a nice menu:

Core engine: ComfyUI by comfyanonymous — the node-based powerhouse that actually does everything

Models & architectures: Stability AI (SD 1.5, SDXL, SD3), Black Forest Labs (Flux), Flux 2 Klein, Wan 2.2, LTX Video, SeedVR2

Workflow pipelines: Elusarca's Klein 6-in-1 (Klein refiner, auto-inpaint, color match — used with permission), xb1n0ry's Comfy-Workflows (Wan 2.2 NAG + SLG, Klein 4-image-grid, Wan 2.2 block-swap low-VRAM, Qwen Image Edit 2509)

Face & identity: ReActor, IPAdapter, PuLID, ACE++, InsightFace, CodeFormer, GFPGAN, GPEN

Enhancement: Real-ESRGAN, SUPIR, IC-Light, DDColor, LaMa, NormalCrafter

Segmentation: SAM 2/3 (Meta), BiRefNet/RMBG, DepthAnything V3, Florence 2

ControlNet: ControlNet by lllyasviel, comfyui-controlnet-aux

Video: RIFE, GIMM-VFI, VHS, AnimateDiff

Acceleration: TeaCache, WaveSpeed/FBCache, LightX2V

LLM: Qwen3 (Alibaba), ComfyUI-QwenVL-Mod

Klein Enhancer: Flux2Klein-Enhancer — RefLatentController, TextRefBalance, ColorAnchor

Node packs: Impact Pack, ComfyUI-essentials, ComfyUI-GGUF, KJNodes, and dozens more — full list in DEPENDENCIES.md

Host apps: GIMP 3, Darktable, SillyTavern

Vibe coding assistant: Claude by Anthropic — wrote most of this while being yelled at


Made with unhealthy amounts of coffee, mass delusion, and a GPU that sounds like a jet engine.
If you've read this far, you're either installing it or writing a hate comment. Either way, we appreciate the engagement.

Love it? Share it. Star the repo. Fork it if you hate it. Either way, pass it on.