-
Notifications
You must be signed in to change notification settings - Fork 1
Description
We agreed to include a short © statement in the footer of the PDF pages (possibly with some opacity): "© Parzival-Projekt, Universität Bern, $year"
I think we can prepare a template document with that info, say copyright-template.pdf, and then merge it into the page using
pdftk monopsen-pdftk-updated-links.pdf background copyright-template.pdf output monopsen-pdftk-updated-copyrighted.pdf
or
pdftk monopsen-pdftk-updated-links.pdf stamp copyright-template.pdf output monopsen-pdftk-updated-copyrighted.pdf
Should be easy to chain this with what we did previously:
Merging the single files using
pdfunite $(ls -v *.pdf) monopsen.pdfyields monopsen.pdf.
FWIW, it isn't hard to replace the links:
pdftk monopsen-pdftk.pdf output foo.tmp parallel sed -i -e 's/\(https\?\)\(.*\)parzdb.*dreissiger=\([0-9]\+\).*\()\)/https\2fassungen\/\3\4/' -- foo.tmp pdftk foo.tmp output monopsen-pdftk-updated-links.pdf compressmonopsen-pdftk-updated-links.pdf
Using
pdftkwe get pages of varying size, but it might be good enough.Edit: no changes to the text according to
diff <(pdftotext -layout monopsen-pdftk.pdf /dev/stdout) <(pdftotext -layout monopsen-pdftk-updated-links.pdf /dev/stdout)
Edit 2: no visual changes according todiff-pdf-visually monopsen-pdftk.pdf monopsen-pdftk-updated-links.pdf:Temporary directory: /tmp/diffpdf-4kqqhns4 Converting each page of the PDFs to an image... PDFs have same number of pages. Checking each pair of converted images... Min sig = inf, significant?=False. The PDFs are the same.
Originally posted by @pdaengeli in #17
We can wrap this in a script and run it in the pipeline to facilitate later updates.
For now however, the plan is to do a "final" conversion mid December. Might need to reach out to SA to share new PDFs then.
(@domsteinbach, I think you were interested in the PDF workflow, so tagging you here)