Skip to content
#

pyannote-audio

Here are 12 public repositories matching this topic...

An intelligent Streamlit application to transcribe and analyze multi-speaker medical consultations. This tool automatically identifies who spoke when (diarization), transcribes their speech (ASR), and assigns their role (Clinician or Patient), even in conversations that mix English and other languages like Hindi or Tamil.

  • Updated Aug 11, 2025
  • Python

AI-Powered Speech Recognition & Diarization: A robust Streamlit application leveraging WhisperX and Faster-Whisper for accurate transcription and speaker separation. Features dual-mode processing (Fast/Pro), automatic speaker identification, color-coded Word (.docx) export, and CPU-optimized Docker deployment on AWS EC2.

  • Updated Jan 5, 2026
  • Python

Improve this page

Add a description, image, and links to the pyannote-audio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pyannote-audio topic, visit your repo's landing page and select "manage topics."

Learn more