pyannote-audio

Here are 12 public repositories matching this topic...

alperensumeroglu / ai-clips-maker

AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.

Updated Apr 2, 2025
Python

CrispStrobe / Susurrus

Star

speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, ctranslate2; applies pyannote for diarization

speech-to-text stt whisper pyannote-audio diarization pyannote whisper-cpp whisper-ai whispercpp ctranslate2 voxtral

Updated Dec 7, 2025
Python

samson6460 / pyannote-onnx-extended

Star

ONNX implementation of Pyannote Speaker Diarization 3.1 pipeline.

speaker-recognition speaker-diarization voice-activity-detection onnx speaker-segmentation pyannote-audio pyannote-onnx onnx-pyannote

Updated Jan 23, 2026
Python

Nidurshan / ai-clips-maker

Star

🎥 Transform long videos into short, shareable clips effortlessly using AI-driven tools for creators and educators.

audio-analysis automatic-speech-recognition face-tracking speaker-diarization media-processing pyannote-audio temporal-segmentation ml-pipeline ffmpeg-python deep-learning-pipelines video-scene-detection video-transcription openai-whisper huggingface-pipelines multimodal-ai video-resizing ai-video-summarization video-clip-generation

Updated Mar 11, 2026
Python

Global-Health-Engineering / ghe_transcribe

Star

A Tool to Transcribe Audio Files with Speaker Diarization

multilingual speech-to-text transcription speaker-recognition pyannote-audio diarization faster-whisper

Updated Nov 12, 2025
Python

Anny405 / DREAM-video-audio-speaker-diarization

Star

A multimodal speaker diarization system using audio, video, and dialogue cues 🗣️💬

python docker machine-learning transcript ffmpeg-wrapper pyannote-audio llm whisper-ai pythonwhisper

Updated Jan 21, 2026
Python

d-kavinraja / Multilingual-Speaker-Diarization-Role-Labeling

Star

An intelligent Streamlit application to transcribe and analyze multi-speaker medical consultations. This tool automatically identifies who spoke when (diarization), transcribes their speech (ASR), and assigns their role (Clinician or Patient), even in conversations that mix English and other languages like Hindi or Tamil.

pytorch whisper nlp-machine-learning langdetect pyannote-audio asr-model streamlit-webapp

Updated Aug 11, 2025
Python

Jibril14 / Deepgram_live_audio_transcriber_diarizer

Star

WebSocket based Python implementation that streams live audio to the Deepgram API for real-time transcription and speaker diarization.

python deep-learning livestream pytorch asynchronous-programming transcription audio-processing pyannote-audio diarization deepgram

Updated Nov 9, 2025
Python

AliDmrcIo / speech_recognition

Star

AI-Powered Speech Recognition & Diarization: A robust Streamlit application leveraging WhisperX and Faster-Whisper for accurate transcription and speaker separation. Features dual-mode processing (Fast/Pro), automatic speaker identification, color-coded Word (.docx) export, and CPU-optimized Docker deployment on AWS EC2.