Skip to content
Change the repository type filter

All

    Repositories list

    • masader

      Public
      The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
      JavaScript
      3519311Updated Jan 30, 2026Jan 30, 2026
    • Python
      7521Updated Oct 13, 2025Oct 13, 2025
    • CIDAR

      Public
      Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
      Jupyter Notebook
      84300Updated Apr 3, 2025Apr 3, 2025
    • Calliar

      Public
      A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.
      Jupyter Notebook
      2015320Updated Jun 24, 2024Jun 24, 2024
    • dar

      Public
      A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
      Python
      21110Updated Jun 23, 2024Jun 23, 2024
    • HTML
      2010Updated May 10, 2024May 10, 2024
    • .github

      Public
      1100Updated Apr 13, 2024Apr 13, 2024
    • CIDAR-v2

      Public
      Jupyter Notebook
      2630Updated Mar 30, 2024Mar 30, 2024
    • Python
      1110Updated Mar 3, 2024Mar 3, 2024
    • ARBML

      Public
      Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.
      JavaScript
      48422100Updated Mar 1, 2024Mar 1, 2024
    • Taqyim

      Public
      Python intefrace for evaluation on chatgpt models
      Jupyter Notebook
      41910Updated Feb 13, 2024Feb 13, 2024
    • evals

      Public
      Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
      Jupyter Notebook
      2.9k201Updated Feb 13, 2024Feb 13, 2024
    • nmatheg

      Public
      A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the…
      Jupyter Notebook
      52210Updated Jan 27, 2024Jan 27, 2024
    • tkseem

      Public
      Arabic Tokenization Library. It provides many tokenization algorithms.
      Jupyter Notebook
      2111043Updated Jan 4, 2024Jan 4, 2024
    • klaam

      Public
      Arabic speech recognition, classification and text-to-speech.
      Jupyter Notebook
      85422141Updated Sep 30, 2023Sep 30, 2023
    • tnkeeh

      Public
      Arabic cleaning, normalization and segmentation library.
      Python
      97320Updated Sep 28, 2023Sep 28, 2023
    • mat-bpe

      Public
      Jupyter Notebook
      0000Updated Aug 6, 2023Aug 6, 2023
    • Ashaar

      Public
      Arabic poetry analysis and generation.
      Jupyter Notebook
      42400Updated Jul 23, 2023Jul 23, 2023
    • Jupyter Notebook
      0200Updated Jun 4, 2023Jun 4, 2023
    • Python
      0300Updated Apr 3, 2023Apr 3, 2023
    • atmatah

      Public
      a repository containing scripts to automate processes, for instance configuring web-apps on remote machines
      Jinja
      0040Updated Jan 25, 2023Jan 25, 2023
    • qawafi

      Public
      Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.
      Jupyter Notebook
      103550Updated Jan 3, 2023Jan 3, 2023
    • 0000Updated Dec 31, 2022Dec 31, 2022
    • whisperar

      Public
      Python
      34110Updated Dec 25, 2022Dec 25, 2022
    • Bohour

      Public
      Bohour, a package that abstracts arabic poetry science, Aroud
      Python
      0220Updated Dec 2, 2022Dec 2, 2022
    • adawat

      Public
      Jupyter Notebook
      0600Updated Nov 17, 2022Nov 17, 2022
    • rasm

      Public
      Arabic Art using GANs
      Python
      31700Updated Aug 3, 2022Aug 3, 2022
    • bayanat

      Public
      Explore the content of Arabic text datasets.
      Jupyter Notebook
      31920Updated May 23, 2022May 23, 2022
    • Research

      Public
      Support Arabic people working on research by creating an environment for ideas in NLP and speech.
      01130Updated Apr 25, 2021Apr 25, 2021
    • MetRec

      Public
      Arabic Poetry Metric Classification Using Bidirectional Gated Recurrent Neural Networks
      Jupyter Notebook
      11100Updated Jun 3, 2020Jun 3, 2020