Cost Katana CLI 🥷

AI in your terminal. Costs on your screen.

Chat with GPT-4, Claude, Gemini—all from one command. See exactly what you spend, in real-time.

🚀 Get Started in 30 Seconds

Step 1: Install

npm install -g cost-katana-cli

Step 2: Initialize

cost-katana init

Step 3: Chat

cost-katana chat

You: Hello!
AI: Hi! How can I help you today?
💰 Cost: $0.0001

You: Write me a Python function to sort a list
AI: Here's a Python function...
💰 Session: $0.0023

That's it. You're now chatting with AI and tracking every cent. Usage and cost tracking is always on—there is no option to disable it (required for usage attribution and cost visibility).

📖 Tutorial: Master the CLI

Part 1: Interactive Chat

Start a conversation that remembers context:

cost-katana chat

CLI flags (before you start): use --thinking to enable Claude extended thinking on supported models (the gateway sizes the reasoning budget per task). Optionally set --thinking-effort to low, medium, high, or max (adaptive effort on Opus 4.6/4.7 and Sonnet 4.6).

cost-katana chat --thinking
cost-katana chat --thinking --thinking-effort high

In-session commands:

Command	Action
`help`	Show all commands
`cost`	Display session cost
`models`	Switch AI model
`clear`	Clear chat history
`save`	Export conversation
`quit`	Exit chat

Part 2: Quick Questions

Skip chat mode for one-off questions:

# Get an instant answer
cost-katana ask "What is the capital of France?"

# Save output to file
cost-katana ask "Explain recursion" --output answer.md

# Use a specific model
cost-katana ask "Write a haiku" --model gpt-4

# Claude extended thinking (supported models only; effort optional)
cost-katana ask "Prove this inequality step by step" --thinking
cost-katana ask "Design a migration plan" --thinking --thinking-effort medium

Part 3: Choose Your Model

# Use Claude for creative writing
cost-katana chat --model claude-3-sonnet

# Use GPT-4 for complex reasoning
cost-katana chat --model gpt-4

# Use GPT-3.5 for simple tasks (10x cheaper)
cost-katana chat --model gpt-3.5-turbo

Part 4: Add a System Prompt

Shape the AI's personality:

# Code reviewer
cost-katana chat --system "You are a senior developer. Be concise and critical."

# Writing assistant
cost-katana chat --system "You are a professional copywriter. Focus on clarity."

# Tutor
cost-katana chat --system "You are a patient teacher. Explain concepts step by step."

Part 5: Enable Cost Optimization

# Cortex: 40-75% savings on long content
cost-katana chat --cortex

# Caching: 100% savings on repeated questions
cost-katana chat --cache

# Both together
cost-katana chat --cortex --cache

🎯 Essential Commands

Chat

cost-katana chat                              # Start interactive chat
cost-katana chat --model claude-3-sonnet      # Use specific model
cost-katana chat --system "Be concise"        # Add system prompt
cost-katana chat --cortex                     # Enable optimization
cost-katana chat --thinking                   # Claude extended thinking (supported models)
cost-katana chat --thinking --thinking-effort high

Ask

cost-katana ask "Your question"               # Quick answer
cost-katana ask "Question" --output file.md   # Save to file
cost-katana ask "Question" --model gpt-4      # Use specific model
cost-katana ask "Hard reasoning task" --thinking
cost-katana ask "Hard reasoning task" --thinking --thinking-effort medium

Analyze

cost-katana analyze                           # View spending summary
cost-katana analyze --days 7                  # Last 7 days
cost-katana analyze --export costs.csv        # Export to CSV

Models

cost-katana models                            # List all models
cost-katana models --provider openai          # Filter by provider
cost-katana models --prices                   # Show pricing

Compare

cost-katana compare "Your prompt" --models gpt-4,claude-3-sonnet,gemini-pro

Budget

cost-katana budget set --daily 10             # Set $10/day limit
cost-katana budget status                     # Check remaining budget
cost-katana budget alerts                     # View alerts

Config

cost-katana config                            # View current config
cost-katana config set model gpt-4            # Change default model
cost-katana config set temperature 0.7        # Set creativity level
cost-katana config set daily-limit 5          # Set spending limit

🤖 Supported Models

Provider	Models
OpenAI	GPT-5, GPT-4, GPT-4 Turbo, GPT-4o, GPT-3.5 Turbo, O1, O3
Anthropic	Claude Sonnet 4.5, Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude 3 Opus
Google	Gemini 2.5 Pro, Gemini 2.5 Flash, Gemini 1.5 Pro, Gemini 1.5 Flash
AWS Bedrock	Claude, Titan, Mistral, Nova, Meta Llama (including Llama 4 Scout on Bedrock)
Others	xAI Grok, DeepSeek, Mistral AI, Cohere, Meta Llama

Run cost-katana models for the complete list with pricing.

⚙️ Configuration

Environment Variables

# Option 1: Cost Katana API Key (Recommended)
export COST_KATANA_API_KEY="dak_your_key"
export COST_KATANA_MODEL="gpt-4"

# Option 2: Direct Provider Keys (self-hosted)
export OPENAI_API_KEY="sk-..."          # Required for GPT models
export GEMINI_API_KEY="..."             # Required for Gemini models
export AWS_ACCESS_KEY_ID="..."          # For AWS Bedrock
export AWS_SECRET_ACCESS_KEY="..."

⚠️ Self-hosted users: You must provide your own OpenAI/Gemini API keys.

💡 Real-World Examples

Code Review Assistant

$ cost-katana chat --system "You are a senior developer. Be concise."

You: Review this code: [paste code]
AI: Issues found:
    1. Missing error handling on line 15
    2. Inefficient loop—use map() instead
    3. Variable 'x' should be descriptive
💰 Cost: $0.0045

You: How do I fix issue #2?
AI: Replace the for loop with: const results = items.map(item => transform(item));
💰 Session: $0.0067

Content Writer with Optimization

$ cost-katana chat --model gpt-4 --cortex

You: Write a blog post about machine learning trends in 2024
AI: [Generates comprehensive 1500-word post]
💰 Cost: $0.0123 (saved $0.041 with Cortex!)

Quick Research

$ cost-katana ask "Best Node.js weather API libraries"
> Popular options: axios + OpenWeatherMap, node-fetch + WeatherAPI...
💰 Cost: $0.0002

$ cost-katana ask "Python datetime format examples" --output cheatsheet.md
✅ Saved to cheatsheet.md
💰 Cost: $0.0003

Reasoning-heavy questions (Claude extended thinking)

Use --thinking for one-shot asks or an entire chat session when you want visible extended reasoning on models that support it. Tune depth with --thinking-effort (low | medium | high | max).

$ cost-katana ask "Walk through the edge cases for this API design" --thinking --thinking-effort medium
> [Answer with deeper reasoning where the model supports it]
💰 Cost: $0.00xx

$ cost-katana chat --thinking --system "You are a careful reviewer."
You: ...

Model Cost Comparison

$ cost-katana compare "Explain quantum computing" --models gpt-4,gpt-3.5-turbo,gemini-pro

📊 Cost Comparison

Model              Cost        Tokens    Latency
─────────────────────────────────────────────────
gpt-4              $0.0120     450       2.3s
gpt-3.5-turbo      $0.0012     420       0.8s
gemini-pro         $0.0003     435       1.1s

💡 Recommendation: gemini-pro (40x cheaper than gpt-4)

💰 Cost Optimization Tips

Strategy	Savings	Command
Use GPT-3.5 for simple tasks	90%	`--model gpt-3.5-turbo`
Enable Cortex for long content	40-75%	`--cortex`
Cache repeated queries	100%	`--cache`
Use Gemini for high-volume	95%	`--model gemini-pro`

# ❌ Expensive
cost-katana chat --model gpt-4

# ✅ Smart: Match model to task
cost-katana chat --model gpt-3.5-turbo

# ✅ Smarter: Add optimization
cost-katana chat --model gpt-3.5-turbo --cortex --cache

🔧 Troubleshooting

"API key not found"

cost-katana init
# or
export COST_KATANA_API_KEY="dak_your_key"

"Model not available"

cost-katana models                    # See available models
cost-katana chat --model gpt-3.5-turbo  # Try alternative

"Rate limit exceeded"

# CLI auto-retries. If persistent:
cost-katana chat --model claude-3-haiku  # Switch provider

📊 Dashboard Integration

All CLI usage syncs to costkatana.com/dashboard:

Real-time cost tracking
Usage breakdown by model
Daily/weekly/monthly stats
Budget alerts
Optimization recommendations

📚 More Examples

Explore 45+ complete examples:

🔗 github.com/Hypothesize-Tech/costkatana-examples

Section	Description
CLI Examples	Complete CLI guides
Cost Tracking	Track costs across providers
Semantic Caching	30-40% cost reduction
Workflows	Multi-step AI orchestration

🆚 Why Cost Katana CLI?

Feature	Traditional CLIs	Cost Katana CLI
Setup	Multiple API keys	One command: `cost-katana init`
Providers	One per tool	All providers, one command
Cost tracking	❌	✅ Real-time
Optimization	❌	✅ 40-75% savings
Dashboard	❌	✅ Full analytics

📞 Support

Channel	Link
Documentation	docs.costkatana.com/cli
Dashboard	costkatana.com
GitHub	github.com/Hypothesize-Tech/costkatana-cli
Discord	discord.gg/D8nDArmKbY
Email	support@costkatana.com

📄 License

Start chatting with AI in your terminal 🥷

npm install -g cost-katana-cli
cost-katana init
cost-katana chat

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.github/workflows		.github/workflows
.husky		.husky
bin		bin
src		src
tests		tests
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.lintstagedrc.json		.lintstagedrc.json
.npmignore		.npmignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Cost Katana CLI 🥷

🚀 Get Started in 30 Seconds

Step 1: Install

Step 2: Initialize

Step 3: Chat

📖 Tutorial: Master the CLI

Part 1: Interactive Chat

Part 2: Quick Questions

Part 3: Choose Your Model

Part 4: Add a System Prompt

Part 5: Enable Cost Optimization

🎯 Essential Commands

Chat

Ask

Analyze

Models

Compare

Budget

Config

🤖 Supported Models

⚙️ Configuration

Environment Variables

💡 Real-World Examples

Code Review Assistant

Content Writer with Optimization

Quick Research

Reasoning-heavy questions (Claude extended thinking)

Model Cost Comparison

💰 Cost Optimization Tips

🔧 Troubleshooting

"API key not found"

"Model not available"

"Rate limit exceeded"

📊 Dashboard Integration

📚 More Examples

🆚 Why Cost Katana CLI?

📞 Support

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages