[Feature request] Utilize PDF Mix Tool as a front end for OCRmyPDF back-end
Hi. There is a very powerful command line program which can perform "Optimize scanned PDF" as following:
- Deskew (correction of slopping)
- Descreen (remove black dots noises)
- Remove background
- Text sharpening (text enhancement: make text more clear)
- centralize (make content image at middle of page so that left white margin will be equal to right white margin of the page)
- OCR PDF
OCRmyPDF depend on QPDF + a python variant of it called pikepdf.
Sites of OCRmyPDF: https://pypi.org/project/ocrmypdf/ & https://github.com/jbarlow83/OCRmyPDF
Sites of pikepdf: https://pikepdf.readthedocs.io/en/latest/ & https://github.com/pikepdf/pikepdf
Note: pikepdf has abilities to re-size the pages of PDF, also, so it could be helpful for #24 (closed)
Bellow are screenshots from Windows programs have optimize scanned PDF functionalities (centralize contents not included):