Skip to content

Hypothesize-Tech/costkatana-cli

Repository files navigation

Cost Katana CLI πŸ₯·

AI in your terminal. Costs on your screen.

Chat with GPT-4, Claude, Geminiβ€”all from one command. See exactly what you spend, in real-time.


πŸš€ Get Started in 30 Seconds

Step 1: Install

npm install -g cost-katana-cli

Step 2: Initialize

cost-katana init

Step 3: Chat

cost-katana chat
You: Hello!
AI: Hi! How can I help you today?
πŸ’° Cost: $0.0001

You: Write me a Python function to sort a list
AI: Here's a Python function...
πŸ’° Session: $0.0023

That's it. You're now chatting with AI and tracking every cent. Usage and cost tracking is always onβ€”there is no option to disable it (required for usage attribution and cost visibility).


πŸ“– Tutorial: Master the CLI

Part 1: Interactive Chat

Start a conversation that remembers context:

cost-katana chat

CLI flags (before you start): use --thinking to enable Claude extended thinking on supported models (the gateway sizes the reasoning budget per task). Optionally set --thinking-effort to low, medium, high, or max (adaptive effort on Opus 4.6/4.7 and Sonnet 4.6).

cost-katana chat --thinking
cost-katana chat --thinking --thinking-effort high

In-session commands:

Command Action
help Show all commands
cost Display session cost
models Switch AI model
clear Clear chat history
save Export conversation
quit Exit chat

Part 2: Quick Questions

Skip chat mode for one-off questions:

# Get an instant answer
cost-katana ask "What is the capital of France?"

# Save output to file
cost-katana ask "Explain recursion" --output answer.md

# Use a specific model
cost-katana ask "Write a haiku" --model gpt-4

# Claude extended thinking (supported models only; effort optional)
cost-katana ask "Prove this inequality step by step" --thinking
cost-katana ask "Design a migration plan" --thinking --thinking-effort medium

Part 3: Choose Your Model

# Use Claude for creative writing
cost-katana chat --model claude-3-sonnet

# Use GPT-4 for complex reasoning
cost-katana chat --model gpt-4

# Use GPT-3.5 for simple tasks (10x cheaper)
cost-katana chat --model gpt-3.5-turbo

Part 4: Add a System Prompt

Shape the AI's personality:

# Code reviewer
cost-katana chat --system "You are a senior developer. Be concise and critical."

# Writing assistant
cost-katana chat --system "You are a professional copywriter. Focus on clarity."

# Tutor
cost-katana chat --system "You are a patient teacher. Explain concepts step by step."

Part 5: Enable Cost Optimization

# Cortex: 40-75% savings on long content
cost-katana chat --cortex

# Caching: 100% savings on repeated questions
cost-katana chat --cache

# Both together
cost-katana chat --cortex --cache

🎯 Essential Commands

Chat

cost-katana chat                              # Start interactive chat
cost-katana chat --model claude-3-sonnet      # Use specific model
cost-katana chat --system "Be concise"        # Add system prompt
cost-katana chat --cortex                     # Enable optimization
cost-katana chat --thinking                   # Claude extended thinking (supported models)
cost-katana chat --thinking --thinking-effort high

Ask

cost-katana ask "Your question"               # Quick answer
cost-katana ask "Question" --output file.md   # Save to file
cost-katana ask "Question" --model gpt-4      # Use specific model
cost-katana ask "Hard reasoning task" --thinking
cost-katana ask "Hard reasoning task" --thinking --thinking-effort medium

Analyze

cost-katana analyze                           # View spending summary
cost-katana analyze --days 7                  # Last 7 days
cost-katana analyze --export costs.csv        # Export to CSV

Models

cost-katana models                            # List all models
cost-katana models --provider openai          # Filter by provider
cost-katana models --prices                   # Show pricing

Compare

cost-katana compare "Your prompt" --models gpt-4,claude-3-sonnet,gemini-pro

Budget

cost-katana budget set --daily 10             # Set $10/day limit
cost-katana budget status                     # Check remaining budget
cost-katana budget alerts                     # View alerts

Config

cost-katana config                            # View current config
cost-katana config set model gpt-4            # Change default model
cost-katana config set temperature 0.7        # Set creativity level
cost-katana config set daily-limit 5          # Set spending limit

πŸ€– Supported Models

Provider Models
OpenAI GPT-5, GPT-4, GPT-4 Turbo, GPT-4o, GPT-3.5 Turbo, O1, O3
Anthropic Claude Sonnet 4.5, Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude 3 Opus
Google Gemini 2.5 Pro, Gemini 2.5 Flash, Gemini 1.5 Pro, Gemini 1.5 Flash
AWS Bedrock Claude, Titan, Mistral, Nova, Meta Llama (including Llama 4 Scout on Bedrock)
Others xAI Grok, DeepSeek, Mistral AI, Cohere, Meta Llama

Run cost-katana models for the complete list with pricing.


βš™οΈ Configuration

Environment Variables

# Option 1: Cost Katana API Key (Recommended)
export COST_KATANA_API_KEY="dak_your_key"
export COST_KATANA_MODEL="gpt-4"

# Option 2: Direct Provider Keys (self-hosted)
export OPENAI_API_KEY="sk-..."          # Required for GPT models
export GEMINI_API_KEY="..."             # Required for Gemini models
export AWS_ACCESS_KEY_ID="..."          # For AWS Bedrock
export AWS_SECRET_ACCESS_KEY="..."

⚠️ Self-hosted users: You must provide your own OpenAI/Gemini API keys.


πŸ’‘ Real-World Examples

Code Review Assistant

$ cost-katana chat --system "You are a senior developer. Be concise."

You: Review this code: [paste code]
AI: Issues found:
    1. Missing error handling on line 15
    2. Inefficient loopβ€”use map() instead
    3. Variable 'x' should be descriptive
πŸ’° Cost: $0.0045

You: How do I fix issue #2?
AI: Replace the for loop with: const results = items.map(item => transform(item));
πŸ’° Session: $0.0067

Content Writer with Optimization

$ cost-katana chat --model gpt-4 --cortex

You: Write a blog post about machine learning trends in 2024
AI: [Generates comprehensive 1500-word post]
πŸ’° Cost: $0.0123 (saved $0.041 with Cortex!)

Quick Research

$ cost-katana ask "Best Node.js weather API libraries"
> Popular options: axios + OpenWeatherMap, node-fetch + WeatherAPI...
πŸ’° Cost: $0.0002

$ cost-katana ask "Python datetime format examples" --output cheatsheet.md
βœ… Saved to cheatsheet.md
πŸ’° Cost: $0.0003

Reasoning-heavy questions (Claude extended thinking)

Use --thinking for one-shot asks or an entire chat session when you want visible extended reasoning on models that support it. Tune depth with --thinking-effort (low | medium | high | max).

$ cost-katana ask "Walk through the edge cases for this API design" --thinking --thinking-effort medium
> [Answer with deeper reasoning where the model supports it]
πŸ’° Cost: $0.00xx

$ cost-katana chat --thinking --system "You are a careful reviewer."
You: ...

Model Cost Comparison

$ cost-katana compare "Explain quantum computing" --models gpt-4,gpt-3.5-turbo,gemini-pro

πŸ“Š Cost Comparison

Model              Cost        Tokens    Latency
─────────────────────────────────────────────────
gpt-4              $0.0120     450       2.3s
gpt-3.5-turbo      $0.0012     420       0.8s
gemini-pro         $0.0003     435       1.1s

πŸ’‘ Recommendation: gemini-pro (40x cheaper than gpt-4)

πŸ’° Cost Optimization Tips

Strategy Savings Command
Use GPT-3.5 for simple tasks 90% --model gpt-3.5-turbo
Enable Cortex for long content 40-75% --cortex
Cache repeated queries 100% --cache
Use Gemini for high-volume 95% --model gemini-pro
# ❌ Expensive
cost-katana chat --model gpt-4

# βœ… Smart: Match model to task
cost-katana chat --model gpt-3.5-turbo

# βœ… Smarter: Add optimization
cost-katana chat --model gpt-3.5-turbo --cortex --cache

πŸ”§ Troubleshooting

"API key not found"

cost-katana init
# or
export COST_KATANA_API_KEY="dak_your_key"

"Model not available"

cost-katana models                    # See available models
cost-katana chat --model gpt-3.5-turbo  # Try alternative

"Rate limit exceeded"

# CLI auto-retries. If persistent:
cost-katana chat --model claude-3-haiku  # Switch provider

πŸ“Š Dashboard Integration

All CLI usage syncs to costkatana.com/dashboard:

  • Real-time cost tracking
  • Usage breakdown by model
  • Daily/weekly/monthly stats
  • Budget alerts
  • Optimization recommendations

πŸ“š More Examples

Explore 45+ complete examples:

πŸ”— github.com/Hypothesize-Tech/costkatana-examples

Section Description
CLI Examples Complete CLI guides
Cost Tracking Track costs across providers
Semantic Caching 30-40% cost reduction
Workflows Multi-step AI orchestration

πŸ†š Why Cost Katana CLI?

Feature Traditional CLIs Cost Katana CLI
Setup Multiple API keys One command: cost-katana init
Providers One per tool All providers, one command
Cost tracking ❌ βœ… Real-time
Optimization ❌ βœ… 40-75% savings
Dashboard ❌ βœ… Full analytics

πŸ“ž Support

Channel Link
Documentation docs.costkatana.com/cli
Dashboard costkatana.com
GitHub github.com/Hypothesize-Tech/costkatana-cli
Discord discord.gg/D8nDArmKbY
Email support@costkatana.com

πŸ“„ License

MIT Β© Cost Katana


Start chatting with AI in your terminal πŸ₯·

npm install -g cost-katana-cli
cost-katana init
cost-katana chat