PDF

Apache PDFBox – library for working with PDF documents

The Apache PDFBox library is an open source Java tool for working with PDF documents.

This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.

PDFBox also includes several command line utilities.

Key Features

  • Extract text – extract Unicode test from PDF files.
  • Split & Merge – split a single PDF into many files or merge multiple PDF files.
  • Fill Forms – extract data from PDF forms or fill a PDF form.
  • Preflight – validate PDF files against the PDF/A-1b standard.
  • Print – print a PDF file using the standard Java printing API.
  • Save as Image – save PDFs as image files such as PNG or JPEG.
  • Create PDFs – create a PDF from scratch, with embedded fonts and images.
  • Signing – digitally sign PDF files.
  • Uses the Java Cryptography Architecture (JCA) and the Bouncy Castle libraries for handling encryption in PDF documents.

Website: pdfbox.apache.org
Support: Mailing Lists, GitHub Code Repository
Developer: Apache Software Foundation
License: Apache License 2.0

PDFBox is written in Java. Learn Java with our recommended free books and free tutorials.


Related Software

PDF Development Libraries
PDFBoxCreate, render, print, split, merge, alter, verify and extract text and metadata
TCPDFPHP class for generating PDF documents
PopplerLibrary for rendering PDF files, and examining or modifying their structure
PDFKitPDF document generation library for Node and the browser
pdfcpuPDF processing library
Apache FOPPrint formatter driven by XSL formatting objects
QPDFLibrary and programs that inspect and manipulate the structure of PDF files
PoDoFoParse PDF files and modify their contents into memory
OpenPDFLibrary for creating and editing PDF files; fork of iText
xhtml2pdfHTML to PDF converter using Python
libHaruLibrary for generating PDFs
CapyPDFFully color managed PDF generation library
pdf-libCreate and modify PDF documents in a JavaScript environment
PDFioPDF read/write library
PDFsharp.NET library for processing PDF files
JasperReportsReporting engine written in Java
CamlPDFOCaml library for reading, writing and modifying PDF files

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Know a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments