Skip to content

sandwich pdf-rederer is not working. Getting gibberish text from output PDF. #17

@sanjay-nit

Description

@sanjay-nit

Hi, Even after using --pdf-renderer=sandwich option; getting gibberish text when I select the text from output pdf and paste somewhere.

FYI: I'm using MacOS(M1) 15.0.1

Below are the steps I took.

  1. pip install git+https://github.com/ocrmypdf/OCRmyPDF-EasyOCR.git
  2. command: ocrmypdf --pdf-renderer sandwich image.pdf test.pdf

Version info:

easyocr==1.7.2
ocrmypdf==16.5.0
ocrmypdf-easyocr==0.2.1

Below are logs:
image

I'm attaching the PDF file I tested with.
image.pdf
ouput.pdf

I also tried processing image version of this PDF, this isn't working too.
command: ocrmypdf --pdf-renderer=sandwich --force-ocr --image-dpi 300 image.jpg test.pdf
image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions