Differences between revisions 1 and 10 (spanning 9 versions)
Revision 1 as of 2020-08-26 01:37:41
Size: 1301
Comment:
Revision 10 as of 2023-04-08 17:25:59
Size: 1923
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
= Poppler Utils = = Poppler Utilities =
Line 3: Line 3:
A set of PDF manipulation scripts are distributed as the `poppler-utils` package. The scripts are all based on the xpdf project and codebase. '''Poppler Utilities''' are a set of PDF manipulation scripts. The scripts are all based on the xpdf project and codebase.

See also the close alternatives, [[LaTeX/Pdfjam|pdfjam]] or [[PostScriptUtilities|psutils]].
Line 11: Line 13:
== PDF Manipulation == == Installation ==

Most [[Linux]] distributions offer a `poppler-utils` package.

----



== Usage ==



=== Joining Documents ===
Line 21: Line 35:


=== Splitting Documents ===
Line 29: Line 47:
----
Line 32: Line 49:

== PDF into Image ==
=== Extracting Images ===
Line 50: Line 66:


=== Converting to Images ===
Line 62: Line 82:

----



== See also ==

[[https://man.archlinux.org/man/extra/poppler/pdfimages.1.en|pdfimages(1)]]

[[https://man.archlinux.org/man/extra/poppler/pdfseparate.1.en|pdfseparate(1)]]

[[https://man.archlinux.org/man/extra/poppler/pdftoppm.1.en|pdftoppm(1)]]

[[https://man.archlinux.org/man/extra/poppler/pdfunite.1.en|pdfunite(1)]]


Poppler Utilities

Poppler Utilities are a set of PDF manipulation scripts. The scripts are all based on the xpdf project and codebase.

See also the close alternatives, pdfjam or psutils.


Installation

Most Linux distributions offer a poppler-utils package.


Usage

Joining Documents

To join PDF documents, try:

pdfunite INPUT1 [INPUT2 ..] OUTPUT

Note that all input files must be unencrypted.

Splitting Documents

To separate a PDF document into one for each page, try:

pdfseparate INPUT OUTPUT-PATTERN

Output files will be named as OUTPUT-PATTERN-001.pdf.

Extracting Images

To extract images from a PDF file, try:

pdfimages -png INPUT OUTPUT-PATTERN

Output files will be named as OUTPUT-PATTERN-001.png. Alternate formats available include:

Format

Option

JPEG

-j

JPEG2000

-jp2

TIFF

-tiff

JBIG2

-jbig2

CCITT

-ccitt

Converting to Images

To render each page of a PDF into an image, try:

pdftoppm -png INPUT OUTPUT-PATTERN

Format

Option

JPEG

-jpeg

CMYK JPEG

-jpegcmyk

TIFF

-tiff

Note that resolution is 150x150 by default. This can be set using -r N or -rx N -ry N.


See also

pdfimages(1)

pdfseparate(1)

pdftoppm(1)

pdfunite(1)


CategoryRicottone

PopplerUtilities (last edited 2023-04-08 17:25:59 by DominicRicottone)