Fully Open-Source Voice Activity Detection (VAD) for Real-Time Speech Applications

Voice Activity Detection (VAD) is a critical first step in any application involving speech recognition. However, while exploring real-time voice chat agents, I found that many state-of-the-art (SoTA) models are not truly open-source—they provide only open weights, limiting transparency and hindering research and development.

This repository aims to change that by providing a fully open and research-friendly implementation of the Silero VAD model. The goal is to advance the state of the art in VAD through open experimentation, training, and integration.

Status

As of May 27, 2025, this repository includes:

✅ A complete implementation of the Silero VAD model for research use

Roadmap

In the near future, I plan to add the following:

🧠 Code to train Silero VAD from scratch on custom datasets

📊 Evaluation scripts for standard VAD benchmarks

🔧 Support for LoRA fine-tuning to extend or adapt Silero VAD

🔌 Example integrations with Python, client-side web applications, and Unity

Instructions

Install the package in editable mode:

pip install --editable .

License

This project is released under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0), encouraging both academic research and commercial application.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
debug		debug
examples		examples
src/open_vad		src/open_vad
test		test
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
open_vad.code-workspace		open_vad.code-workspace
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fully Open-Source Voice Activity Detection (VAD) for Real-Time Speech Applications

Status

Roadmap

Instructions

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

stefanwebb/open-voice-activity-detection

Folders and files

Latest commit

History

Repository files navigation

Fully Open-Source Voice Activity Detection (VAD) for Real-Time Speech Applications

Status

Roadmap

Instructions

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages