Build software better, together

mahimairaja / voiceai

Set of 📝 with 🔗 to help those building Voice AI agents 🎙️🤖

Updated May 25, 2026

proj-airi / webai-example-realtime-voice-chat

🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one file!

realtime-chat chatgpt-voice realtime-ai proj-airi project-airi

Updated Oct 29, 2025
TypeScript

randombet / bodhi_realtime_agent

Star

Real-time voice agents with parallel async background sub-agents — conversations continue naturally while tasks run • Join the builders → https://discord.gg/mqxKaN3UKC

nodejs typescript ai multi-agent gemini background-tasks conversational-ai voice-ai speech-to-speech voice-agent subagent vercel-ai-sdk agentic-framework realtime-ai gemini-live-api realtime-ai-agent

Updated May 18, 2026
Shell

streamcoreai / streamcore-server

Star

Open-source realtime voice agent server in Go with WebRTC (WHIP), barge-in, streaming STT/LLM/TTS pipelines, plugin system, multi-language SDKs, SIP telephony, ESP32 support & fully local mode.

Updated Apr 15, 2026
Go

EfficientTools / graffiti-detection-ai-model

Star

An AI-powered object detection system using YOLOv8 to identify and locate graffiti across various contexts including walls, buildings, over-bridges, vehicles, and other surfaces.

ai model yolov8 prevention-management realtime-ai graffiti-detection graffiti-detector realtime-model realtime-ai-model vandalism-prevention

Updated May 9, 2026
Python

visaoenhance / livekit-debug-playground

Star

LiveKit voice app validation skill. Use when building, debugging, or declaring working any LiveKit voice agent, Agents UI app, or React/Next.js LiveKit project. Enforces evidence-based validation before reporting a session, token endpoint, worker, transcript, or end-to-end voice interaction as complete.

webrtc developer-tools debugging-tools ai-agents voice-ai livekit voice-agents realtime-ai livekit-agents livekit-voice

Updated Mar 22, 2026
Python

thegauravmahto / sparks-ai-math-tutor

Star

Gemini Live API voice tutor for K-12 NCERT math — Hindi/English, hand-drawn whiteboard, open source

websocket edtech webaudio-api katex ncert gemini-api hindi-english ai-education fastapi voice-ai ai-agent generative-ai ai-tutor realtime-ai indian-education k12-education gemini-live-api math-tutor

Updated May 24, 2026
JavaScript

livepeer / dashboard

Star

Developer-facing interface for discovering and calling the Livepeer network.

developer-tools livepeer realtime-ai

Updated May 29, 2026
TypeScript

rajveer100704 / EdgepulseAI

Star

Bounded-latency browser edge inference pipeline for real-time voice interview summarization using ONNX Runtime Web + WASM. Features Web Worker isolation, semantic ring buffers, latest-only concurrency control, observability dashboard, offline-first architecture and production-ready whisper.cpp upgrade path.

webassembly wasm onnx web-workers edge-ai voice-ai transformer-js offline-ai realtime-ai browser-ai browser-ai-summarizer

Updated May 9, 2026
HTML

liu-dongfang / clinical-interview-voice-agent

Star

Voice agent prototype for structured clinical interviewing, with VAD-based interruption handling, modular ASR/LLM/TTS backends, and dialogue workflow control.

python tts vad asr dialogue-system ai-product clinical-ai voice-agent llm speech-ai realtime-ai structured-interview interruptible-ui

Updated Mar 14, 2026
Python

NDDimension / RealTime_HandSign_Detection_LSTM

Star

Real-time hand sign recognition using LSTM-based models for sequence detection from video frames.

python computer-vision gesture-detection lstm-neural-networks hand-sign-recognition realtime-ai

Updated Jul 1, 2025
Jupyter Notebook

m15-ai / Open-Claw-Realtime-Voice

Star

Real-time voice interface for OpenClaw. Stream speech-to-text, LLM reasoning, and text-to-speech into a low-latency conversational agent you can talk to—locally or in the cloud.

raspberry-pi text-to-speech streaming websockets speech-to-text conversational-ai voice-ai deepgram ai-agent llm realtime-ai openclaw low-patency ausio-processing

Updated May 3, 2026
Python

visaoenhance / livekit-agents-ui-demo

Star

LiveKit Agents UI demo showing a voice AI assistant that schedules roof inspections using real-time voice interaction, visualizers, and booking workflow.

react demo ai-agents voice-ai ai-assistant voice-agent livekit realtime-ai livekit-agents

Updated May 29, 2026
TypeScript

DevanshMistry890 / hotel-voice-agent

Star

A real-time (<500ms) voice AI concierge built with Next.js, FastAPI, and Gemini 2.5 Flash Lite. Features local RAG (ChromaDB) for policy retrieval, Tool Calling for live booking, and event-driven CRM logging to Google Sheets.

nextjs gemini-api crm-integration rag fastapi voice-agent generative-ai chromadb edgetts realtime-ai

Updated Jan 9, 2026
TypeScript

however-yir / pipecat-engine

Star

howeverpipecat: engineering-focused Pipecat distribution

python conversational-ai voice-ai ai-agent pipecat realtime-ai

Updated May 22, 2026
Python

Aashutosh31 / arc-ai-project

Star

Realtime multimodal AI agent with voice streaming, RAG memory, and autonomous workflows

react nodejs websocket socket-io autonomous-agents voice-assistant pinecone conversational-ai rag vector-database ai-assistant ai-agent llm realtime-streaming mistral-ai agentic-ai multimodal-ai realtime-ai

Updated May 25, 2026
JavaScript

VisionExpo / traffyx-ai

Star

Traffyx-AI — Traffic Forecasting & Urban Mobility Intelligence System Applied machine learning system for traffic prediction, congestion analysis, and real-world spatiotemporal data modeling.

docker opencv computer-vision pytorch object-detection multi-object-tracking smart-city urban-analytics faiss traffic-surveillance edge-ai ai-ml bytetrack yolov8 realtime-ai

Updated Feb 5, 2026
Python

ysocrius / ai-websocket-backend

Star

High-performance async Python backend for real-time AI conversations with Quart, Supabase, and OpenAI.

python websockets openai quart supabase llm realtime-ai

Updated Dec 21, 2025
Python

lavanya1402 / realtime-voice-ai-orchestrator

Star

Production-ready real-time voice AI pipeline integrating Twilio Media Streams, streaming ASR (Deepgram), LLM reasoning, and live analytics dashboard. Designed for ultra-low latency conversational intelligence in call center and healthcare environments.

twilio websocket cloud-deployment conversational-ai fastapi voice-ai deepgram llm realtime-ai call-center-automation streaming-ai

Updated Mar 1, 2026
Python

itsrobmack / realtime-voice-agent-gateway

Star

Realtime voice AI gateway with turn state, interruption handling, provider fallback, degraded state, audit events, runtime evals, Bun, and TypeScript.

typescript tts stt streaming-audio ai-agents bun voice-ai voice-agents audit-events realtime-ai agent-infrastructure barge-in provider-fallback operational-evals degraded-state

Updated May 18, 2026
TypeScript

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

realtime-ai

Here are 34 public repositories matching this topic...

mahimairaja / voiceai

proj-airi / webai-example-realtime-voice-chat

randombet / bodhi_realtime_agent

streamcoreai / streamcore-server

EfficientTools / graffiti-detection-ai-model

visaoenhance / livekit-debug-playground

thegauravmahto / sparks-ai-math-tutor

livepeer / dashboard

rajveer100704 / EdgepulseAI

liu-dongfang / clinical-interview-voice-agent

NDDimension / RealTime_HandSign_Detection_LSTM

m15-ai / Open-Claw-Realtime-Voice

visaoenhance / livekit-agents-ui-demo

DevanshMistry890 / hotel-voice-agent

however-yir / pipecat-engine

Aashutosh31 / arc-ai-project

VisionExpo / traffyx-ai

ysocrius / ai-websocket-backend

lavanya1402 / realtime-voice-ai-orchestrator

itsrobmack / realtime-voice-agent-gateway

Improve this page

Add this topic to your repo