New blog post for #fileformatfriday - #PDF Quality assessment for #digitisation batches with #Python, #PyMuPDF and #Pillow. This introduces the new #Pdfquad tool, which might be useful for others as well:
https://www.bitsgalore.org/2024/12/13/pdf-quality-assessment-for-digitisation-batches-with-python-pymupdf-and-pillow
=> More informations about this toot | More toots from bitsgalore@digipres.club
@bitsgalore
I cringe over their "best practice" of a color 300dpi JPEG for a text document.
=> More informations about this toot | More toots from bitsavers@oldbytes.space
@bitsavers I know, there are historical reasons behind this, which are mostly related to having a production workflow with processing steps that involve multiple vendors.
Personally I'd be in favor of scanning to an image format like JP2, and then use that as a basis for all derivatives. But this would require a major re-design of the processing workflow and all technical documentation. We've had some discussions about this, but this hasn't led to any action so far.
=> More informations about this toot | More toots from bitsgalore@digipres.club This content has been proxied by September (ba2dc).Proxy Information
text/gemini