Skip to content

Error in single page exits the ocrmypdf command - add option to continue with next page? #1576

@a1ch3mist

Description

@a1ch3mist

I'm processing a 60-page PDF. One of the pages keeps getting an error with tesseract (supposedly deskew, when the image is mostly blank with a few dots/marks).

My ocrpdf command exits, when I'd rather it just continue to the next page and leave no OCR-text on the bad page. I don't see an option for doing that though.

Is there a way already to proceed when a single-page gives an error (error comes via tesseract it seems from the verbose logs)? If not, please add this as an option.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions