DietPDF aims at reducing PDF file size while not degrading quality nor losing metadata

Last update: Jul 27, 2022

Related tags

PDF Files Processing dietpdf

Overview

dietpdf

DietPDF aims at reducing PDF file size while not degrading quality nor losing metadata.

Description

DietPDF aims at reducing PDF file size while not degrading quality.

Here are some tricks used to achieve this goal:

Use Zopfli instead of Zlib to get better compression ratio while being compatible with Zlib.
Use JpegTran to optimize and remove unnecessary data from embedded JPEGs.
Use of Run-Length Encoding to help Zopfli achieve better compression.
Use Zopfli on embedded JPEGs, it helps sometimes
Remove unnecessary spaces in the PDF
Converts end of lines to spaces in Form Objects or Contents (this helps compression)

It also comes with extractpdf which extract all the streams contained in a PDF file.

Notes

This program is not ready for production!

It does not support cross-reference objects for the moment.

This project has been set up using PyScaffold 3.3.1. For details and usage information on PyScaffold see https://pyscaffold.org/.

Requirements

This is plain Python 3 using (quite) only standard libraries.

It uses the following external programs:

zopfli (apt install zopfli)
jpegtran (apt install libjpeg-turbo-progs)

Installation

In dietpdf directory:

pip3 install .

python3 setup.py install --home=~

DietPDF aims at reducing PDF file size while not degrading quality nor losing metadata

Related tags

Overview

dietpdf

Description

Notes

Requirements

Installation

Owner

Frédéric BISSON

A bulk pdf generator. This application can generate PDFs in bulk by using just one click.

A simple Python script to convert multiple images (well technically also a single image) into a pdf.

Convert PDF to AudioBook and Audio Speech to PDF

A simple pdf size compressing telegram robot witten in python.

Simple pdf editor while preserving structure and format.

Compare-pdf - A Flask driven restful API for comparing two PDF files

this is simple program, that converts pdf file to png

Generate a preview image for a PDF.

Program that locks/unlocks pdf files🐍

pikepdf is a Python library for reading and writing PDF files.

Zen-Knit is a formal (PDF), informal (HTML) report generator for data analyst and data scientist who wants to use python.

Svg2pdfgen - Svg To PDF gen with python

minipdf is a package for creating simple, single-page PDF documents.

Split given PDF document into 4 page groups and convert them to booklet format

Merge multiple PDF files into one.

PDFSanitizer - Renders possibly unsafe PDF files and outputs harmless PDF files

A tool for certificate PDF generation.

Python PDF Parser (Not actively maintained). Check out pdfminer.six.

Auto Convert PDFs to png files in python

Telegram bot that can do a lot of things related to PDF files.