Demo: RAG/docling/llama-index service with an Instructlab frontend #287

nerdalert · 2024-10-21T20:45:18Z

RAG Demo:

Adds RAG APIs to docling-serve to create a RAG service.
llama-index for categorical document collections.
Instructlab quantized model is used for generations.
Two collections are created, one via URL and one via file
uploading to via the Instructlab UI.
Collections are then queried and returned with the Answer
to the query along with the sources and metadata from the
vector DB.
Do a negative test to ensure if the document does not
contain information matching the query it does not
hallucinate an answer.

** All components are running locally on a MAC M1.
The PDF ingestions are fast forwarded as they take about 60s each.**

Backend RAG service code will be posted to the docling repo and link here.

Demo video:

rag-demo-v2-oct21.mp4

Signed-off-by: Brent Salisbury <[email protected]>

nerdalert · 2024-10-30T16:28:06Z

The backend code is posted here docling-project/docling-serve#9

Demo: RAG/Docling/llama-index

a2b56a0

Signed-off-by: Brent Salisbury <[email protected]>

nerdalert force-pushed the rag-demo-v2 branch from 7937b32 to a2b56a0 Compare October 21, 2024 20:50

nerdalert mentioned this pull request Oct 30, 2024

Demo: RAG service docling-project/docling-serve#9

Draft

vishnoianil added the demo PR that contains Demo related changes label Nov 6, 2024

Provide feedback