Skip to content
@NanoNets

Nanonets

Popular repositories Loading

  1. docext docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    Python 1.8k 132

  2. docstrange docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    Python 688 51

  3. nanonets-ocr-sample-python nanonets-ocr-sample-python Public

    NanoNets OCR API Example for Python

    Python 202 53

  4. RaspberryPi-ObjectDetection-TensorFlow RaspberryPi-ObjectDetection-TensorFlow Public

    Object Detection using TensorFlow on a Raspberry Pi

    Python 171 38

  5. ocr-with-tesseract ocr-with-tesseract Public

    A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV

    Jupyter Notebook 124 72

  6. ocr-python ocr-python Public

    OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

    Jupyter Notebook 116 17

Repositories

Showing 10 of 56 repositories
  • docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    NanoNets/docstrange’s past year of commit activity
    Python 688 MIT 51 8 0 Updated Sep 11, 2025
  • docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    NanoNets/docext’s past year of commit activity
    Python 1,755 Apache-2.0 132 17 (1 issue needs help) 3 Updated Aug 25, 2025
  • llm-data-converter Public

    Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

    NanoNets/llm-data-converter’s past year of commit activity
    Python 5 MIT 0 0 0 Updated Aug 14, 2025
  • nanonets-go Public

    Code samples in golang for nanonets API

    NanoNets/nanonets-go’s past year of commit activity
    Go 1 MIT 1 0 0 Updated May 29, 2025
  • DocAIAgent Public

    This code is part of a workshop conducted on how to build your own Document AI Agent using Open Source LLMs

    NanoNets/DocAIAgent’s past year of commit activity
    Jupyter Notebook 15 6 0 0 Updated May 8, 2025
  • table-metrics Public

    A repo with all metrics related to table extraction accuracy computation

    NanoNets/table-metrics’s past year of commit activity
    0 MIT 0 0 0 Updated Apr 24, 2025
  • nn-auto-bench Public

    AutoBench: Benchmarking Automation for Intelligent Document Processing (IDP) with confidence

    NanoNets/nn-auto-bench’s past year of commit activity
    Python 10 4 0 0 Updated Mar 18, 2025
  • search-kb Public
    NanoNets/search-kb’s past year of commit activity
    Python 0 0 0 0 Updated Feb 2, 2025
  • NanoNets/hands-on-vision-language-models’s past year of commit activity
    Jupyter Notebook 7 2 0 0 Updated Nov 15, 2024
  • Nanonets Public
    NanoNets/Nanonets’s past year of commit activity
    Python 3 0 0 0 Updated Oct 1, 2024