The Apache PDFBox library is an open source Java tool for working with PDF documents.
This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.
PDFBox also includes several command line utilities.
Key Features
- Extract text – extract Unicode test from PDF files.
- Split & Merge – split a single PDF into many files or merge multiple PDF files.
- Fill Forms – extract data from PDF forms or fill a PDF form.
- Preflight – validate PDF files against the PDF/A-1b standard.
- Print – print a PDF file using the standard Java printing API.
- Save as Image – save PDFs as image files such as PNG or JPEG.
- Create PDFs – create a PDF from scratch, with embedded fonts and images.
- Signing – digitally sign PDF files.
- Uses the Java Cryptography Architecture (JCA) and the Bouncy Castle libraries for handling encryption in PDF documents.
Website: pdfbox.apache.org
Support: Mailing Lists, GitHub Code Repository
Developer: Apache Software Foundation
License: Apache License 2.0
PDFBox is written in Java. Learn Java with our recommended free books and free tutorials.
Related Software
| PDF Development Libraries | |
|---|---|
| PDFBox | Create, render, print, split, merge, alter, verify and extract text and metadata |
| TCPDF | PHP class for generating PDF documents |
| Poppler | Library for rendering PDF files, and examining or modifying their structure |
| PDFKit | PDF document generation library for Node and the browser |
| pdfcpu | PDF processing library |
| Apache FOP | Print formatter driven by XSL formatting objects |
| QPDF | Library and programs that inspect and manipulate the structure of PDF files |
| PoDoFo | Parse PDF files and modify their contents into memory |
| OpenPDF | Library for creating and editing PDF files; fork of iText |
| xhtml2pdf | HTML to PDF converter using Python |
| libHaru | Library for generating PDFs |
| CapyPDF | Fully color managed PDF generation library |
| pdf-lib | Create and modify PDF documents in a JavaScript environment |
| PDFio | PDF read/write library |
| PDFsharp | .NET library for processing PDF files |
| JasperReports | Reporting engine written in Java |
| CamlPDF | OCaml library for reading, writing and modifying PDF files |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Know a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

