Skip to content

[Feature] Extract text from documents #108

@mawandm

Description

@mawandm

Description
As a user, I'd like to extract text from the document.

Detail
Text extraction is useful to allow for intermediary steps to document ingestion. This will be allow for other processes such as;

  1. Data cleansing
  2. Data exclusion based on an exclusion list.
  3. Approval workflows

Acceptance Criteria

  1. An API /v1/extractions/text in the RAG microservice.
  2. Extraction path added to the API microservice during document processing.
  3. Persisting the extracted text to an external SQL datasource.

Metadata

Metadata

Assignees

Labels

APIBackend APIenhancementNew feature or requestragRag Engine

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions