[Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside)

### Describe the proposed feature


Hi, I see there are a few issues on the board proposing integrations of new backends.

I wondered how difficult this would be to do naively: it turns out that's doable, here's the result of a quick-and-dirty plugin I created in a couple of hours. I converted a nonreadable sample pdf using OCRmyPDF with easyOCR backend:

![image](https://github.com/ocrmypdf/OCRmyPDF/assets/27767588/d85dffe9-1a4c-4e2d-a7c5-f1f7672449db)

I basically created a hocr output from easyOCR result's object.However I am not sure  if this is a suitable approach or has fundamental limitations that prevent this kind of integration from succeeding.

I expect any OCR to provide bounding boxes+text (at least) that can be then expressed in hOcr format.

Is there some profound or semantic limitations I am unaware of that make the reconstruction of hOcr format difficult?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside) #1250

Describe the proposed feature

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside) #1250

Description

Describe the proposed feature

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions