Skip to content

tover0314-w/opentypeless

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

83 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

English | 中文 | 日本語 | 한국어 | Español | Français | Deutsch | Português | Русский | العربية | हिन्दी | Italiano | Türkçe | Tiếng Việt | ภาษาไทย | Bahasa Indonesia | Polski | Nederlands

OpenTypeless Logo

OpenTypeless

Open-source AI voice input for desktop. Speak naturally, get polished text in any app.

Whether you're writing emails, coding, chatting, or taking notes — just press a hotkey,
speak your mind, and OpenTypeless transcribes and polishes your words with AI,
then types them directly into whatever app you're using.

CI Release License Stars Discord

OpenTypeless Demo

More screenshots

OpenTypeless Main Window

Settings History

Why OpenTypeless?

OpenTypeless macOS Dictation Windows Voice Typing Whisper Desktop
AI text polishing ✅ Multiple LLMs
STT provider choice ✅ 6+ providers ❌ Apple only ❌ Microsoft only ❌ Whisper only
Works in any app ❌ Copy-paste
Translation mode
Open source ✅ MIT
Cross-platform ✅ Win/Mac/Linux ❌ Mac only ❌ Windows only
Custom dictionary
Self-hostable ✅ BYOK

Features

  • 🎙️ Global hotkey recording — hold-to-record or toggle mode
  • 💊 Floating capsule widget that stays on top
  • 🗣️ 6+ STT providers: Deepgram, AssemblyAI, Whisper, Groq, GLM-ASR, SiliconFlow
  • 🤖 Text polishing via multiple LLMs: OpenAI, DeepSeek, Claude, Gemini, Ollama, and more
  • ⚡ Streaming output — text appears as the LLM generates it
  • ⌨️ Keyboard simulation or clipboard output
  • 📝 Highlight text before recording to give the LLM context
  • 🌐 Translation mode: speak in one language, output in another (20+ languages)
  • 📖 Custom dictionary for domain-specific terms
  • 🔍 Per-app detection to adapt formatting
  • 📜 Local history with full-text search
  • 🌗 Dark / light / system theme
  • 🚀 Auto-start on login

Tip

Recommended Configuration for Best Experience

Provider Model
🗣️ STT Groq whisper-large-v3-turbo
🤖 AI Polish Google gemini-2.5-flash

This combo delivers fast, accurate transcription with high-quality text polishing — and both offer generous free tiers.

Download

Download the latest version for your platform:

Download from Releases

Platform File
Windows .msi installer
macOS (Apple Silicon) .dmg
macOS (Intel) .dmg
Linux .AppImage / .deb

Prerequisites

Getting Started

# Install dependencies
npm install

# Run in development mode
npm run tauri dev

# Build for production
npm run tauri build

The built application will be in src-tauri/target/release/bundle/.

Configuration

All settings are accessible from the in-app Settings panel:

  • Speech Recognition — choose STT provider and enter your API key
  • AI Polish — choose LLM provider, model, and API key
  • General — hotkey, output mode, theme, auto-start
  • Dictionary — add custom terms for better transcription accuracy
  • Scenes — prompt templates for different use cases

API keys are stored locally via tauri-plugin-store. No keys are sent to OpenTypeless servers — all STT/LLM requests go directly to the provider you configure.

Cloud (Pro) Option

OpenTypeless also offers an optional Pro subscription that provides managed STT and LLM quota so you don't need your own API keys. This is entirely optional — the app is fully functional with your own keys.

Learn more about Pro

BYOK (Bring Your Own Key) vs Cloud

BYOK Mode Cloud (Pro) Mode
STT Your own API key (Deepgram, AssemblyAI, etc.) Managed quota (10h/month)
LLM Your own API key (OpenAI, DeepSeek, etc.) Managed quota (~5M tokens/month)
Cloud dependency None — all requests go directly to your provider Requires connection to www.opentypeless.com
Cost Pay your provider directly $4.99/month subscription

All core features — recording, transcription, AI polish, keyboard/clipboard output, dictionary, history — work entirely offline from OpenTypeless servers in BYOK mode.

Self-Hosting / No Cloud

To run OpenTypeless without any cloud dependency:

  1. Choose any non-Cloud STT and LLM provider in Settings
  2. Enter your own API keys
  3. That's it — no account or internet connection to www.opentypeless.com is needed

If you want to point the optional cloud features at your own backend, set these environment variables before building:

Variable Default Description
VITE_API_BASE_URL https://www.opentypeless.com Frontend cloud API base URL
API_BASE_URL https://www.opentypeless.com Rust backend cloud API base URL
# Example: build with a custom backend
VITE_API_BASE_URL=https://my-server.example.com API_BASE_URL=https://my-server.example.com npm run tauri build

Architecture

Data Flow Pipeline:

Microphone → Audio Capture → STT Provider → Raw Transcript → LLM Polish → Keyboard/Clipboard Output
src/                  # React frontend (TypeScript)
├── components/       # UI components (Settings, History, Capsule, etc.)
├── hooks/            # React hooks (recording, theme, Tauri events)
├── lib/              # Utilities (API client, router, constants)
└── stores/           # Zustand state management

src-tauri/src/        # Rust backend
├── audio/            # Audio capture via cpal
├── stt/              # STT providers (Deepgram, AssemblyAI, Whisper-compat, Cloud)
├── llm/              # LLM providers (OpenAI-compat, Cloud)
├── output/           # Text output (keyboard simulation, clipboard paste)
├── storage/          # Config (tauri-plugin-store) + history/dictionary (SQLite)
├── app_detector/     # Detect active application for context
├── pipeline.rs       # Recording → STT → LLM → Output orchestration
└── lib.rs            # Tauri app setup, commands, hotkey handling

Roadmap

  • Plugin system for custom STT/LLM integrations
  • Improved multi-language STT accuracy and dialect support
  • Voice commands (e.g. "delete last sentence")
  • Customizable hotkey combinations
  • Improved onboarding experience
  • Mobile companion app

FAQ

Is my audio sent to the cloud? In BYOK mode, audio goes directly to your chosen STT provider (e.g., Groq, Deepgram). Nothing passes through OpenTypeless servers. In Cloud (Pro) mode, audio is sent to our managed proxy for transcription.

Can I use it offline? With a local STT provider (Whisper via Ollama) and a local LLM (Ollama), the app works entirely offline. No internet connection needed.

Which languages are supported? STT supports 99+ languages depending on the provider. AI polish and translation support 20+ target languages.

Is the app free? Yes. The app is fully functional with your own API keys (BYOK). The Cloud Pro subscription ($4.99/month) is optional.

Community

Contributing

Contributions are welcome! See CONTRIBUTING.md for development setup and guidelines.

Looking for a place to start? Check out issues labeled good first issue.

Star History

Star History Chart

Built with Claude Code

This entire project was built in a single day using Claude Code — from architecture design to full implementation, including the Tauri backend, React frontend, CI/CD pipeline, and this README.

License

MIT

About

Talkmore with Opentypeless. Type with your voice. Anywhere. Talk - Recoding - Polish - Done!

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors