Skip to content
Change the repository type filter

All

    Repositories list

    • LavaSR

      Public
      🌋LavaSR: Fast Speech restoration and enhancement
      Python
      Apache License 2.0
      40000Updated Feb 26, 2026Feb 26, 2026
    • Dictation demo application
      JavaScript
      0110Updated Feb 9, 2026Feb 9, 2026
    • Twake Drive Web App
      JavaScript
      GNU Affero General Public License v3.0
      126000Updated Jan 13, 2026Jan 13, 2026
    • Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
      Python
      Apache License 2.0
      252000Updated Nov 4, 2025Nov 4, 2025
    • A LibreOffice server wrapper that is exposed over HTTP to allow easy conversions from supported documents to PDF.
      Kotlin
      31100Updated Sep 18, 2025Sep 18, 2025
    • kokoro

      Public
      https://hf.co/hexgrad/Kokoro-82M
      JavaScript
      Apache License 2.0
      674020Updated May 3, 2025May 3, 2025
    • Open Source AI Automation ✨ All our 280+ pieces are now available as MCP to use with LLMs
      TypeScript
      Other
      3.4k000Updated Apr 5, 2025Apr 5, 2025
    • Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
      Python
      Apache License 2.0
      96000Updated Mar 26, 2025Mar 26, 2025
    • directus

      Public
      The Modern Data Stack 🐰 — Directus is an instant REST+GraphQL API and intuitive no-code data collaboration app for any SQL database.
      TypeScript
      Other
      4.6k000Updated Mar 21, 2025Mar 21, 2025
    • KBLaM

      Public
      Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
      Jupyter Notebook
      MIT License
      124000Updated Mar 5, 2025Mar 5, 2025
    • An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
      Go
      Apache License 2.0
      42000Updated Feb 11, 2025Feb 11, 2025
    • open-r1

      Public
      Fully open reproduction of DeepSeek-R1
      Python
      Apache License 2.0
      2.4k000Updated Feb 10, 2025Feb 10, 2025
    • audino

      Public
      Open source audio annotation tool for humans
      JavaScript
      MIT License
      141020Updated Feb 10, 2025Feb 10, 2025
    • 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
      Python
      Apache License 2.0
      2.4k000Updated Jan 5, 2025Jan 5, 2025
    • Kickstart your LLMOps initiative with a flexible, robust, and productive Python package.
      Python
      MIT License
      124000Updated Dec 10, 2024Dec 10, 2024
    • A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
      Python
      MIT License
      211000Updated Dec 9, 2024Dec 9, 2024
    • Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
      Python
      Apache License 2.0
      43010Updated Nov 12, 2024Nov 12, 2024
    • Theraxus AI: A modular conversational AI platform ⚙️ blending STT 🎙️, TTS 🗣️, and RAG 📚 for seamless, context-aware dialogues and human-like interactions 🤖💬
      Python
      Apache License 2.0
      3000Updated Nov 9, 2024Nov 9, 2024
    • OuteTTS

      Public
      Python
      Apache License 2.0
      113000Updated Nov 5, 2024Nov 5, 2024
    • Medical Graph RAG: Graph RAG for the Medical Data
      Python
      MIT License
      127000Updated Oct 25, 2024Oct 25, 2024
    • Official inference framework for 1-bit LLMs
      C++
      MIT License
      2.4k000Updated Oct 18, 2024Oct 18, 2024
    • Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
      Python
      MIT License
      2.2k000Updated Oct 12, 2024Oct 12, 2024
    • fstalign

      Public
      An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
      C++
      Apache License 2.0
      11000Updated Sep 24, 2024Sep 24, 2024
    • moshi

      Public
      Python
      Apache License 2.0
      906000Updated Sep 18, 2024Sep 18, 2024
    • Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
      Python
      701000Updated Sep 11, 2024Sep 11, 2024
    • Nvidia-NeMo

      Public template
      A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognitio…
      Python
      Apache License 2.0
      3.4k000Updated Sep 5, 2024Sep 5, 2024
    • Directus custom extension to disable listing
      TypeScript
      GNU General Public License v3.0
      3000Updated Aug 31, 2024Aug 31, 2024
    • graphrag

      Public
      A modular graph-based Retrieval-Augmented Generation (RAG) system
      Python
      MIT License
      3.3k000Updated Aug 26, 2024Aug 26, 2024
    • speech-to-speech

      Public template
      Speech To Speech: an effort for an open-sourced and modular GPT4-o
      Python
      Apache License 2.0
      508000Updated Aug 26, 2024Aug 26, 2024
    • A directus custom module extension for managing directus flow includes backup/restore, duplication and grouping the flow.
      Vue
      GNU General Public License v3.0
      12000Updated Aug 25, 2024Aug 25, 2024