You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.
A prompt-aware LLM router that predicts which models can complete each request, then selects the cheapest capable one: 53.2% lower cost and +1.9 pts completion on our tested dataset.
Energy-efficient AI prompt routing system — sends simple prompts to lightweight models and reserves larger models for complex reasoning to cut compute, latency, and energy.
claude-router is a local prompt router that picks the right Claude model tier and prepends the right scaffold using local embeddings before you call the API. A deterministic routing layer for eval, research, content, and review prompts that helps teams stop overspending on Sonnet and Opus when Haiku plus structure is enough.
Intelligent LLM request router for Azure OpenAI — automatically routes prompts to GPT-4o-mini or GPT-4o based on complexity, task type, environment, and budget. Cuts AI API costs by 60-80% with zero impact on application quality.
Intelligent LLM router that dynamically routes prompts between local Ollama (Qwen) and cloud models (Gemini) using complexity scoring, semantic caching, and cost-aware decisioning.
Define your LLM providers once, then let llmroute dynamically route every prompt to the right model — a smart CLI switchboard for multi-provider AI workflows.
Intelligent LLM router that reduces AI API costs by up to 60% through smart model selection and caching. FastAPI service with multi-provider support (Gemini, Claude, OpenRouter) and Claude Desktop MCP integration.