/Exploring-PDFBOX

The Apache PDFBox™ library is an open source Java tool for working with PDF documents. For more details visit

Primary LanguageJavaMIT LicenseMIT

Exploring-PDFBOX

The Apache PDFBox® library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command line utilities. Apache PDFBox is published under the Apache License v2.0.

How to Use with IDE :

Prerequisites :
1.Download Eclipse or Netbeans IDE
2.Download PDFBOX jar from official website - click here to download
3.Create new project in java and add jar by Right click -> Properties -> build path -> add jars -> add PDFBOX jar

Available Examples :

1.Extracting phone numbers from pdf : see File Name : extractPhoneNumber.java
2.Splitting PDFs : see File Name : PDFSplitters.java
3.File Size Counter for pdf or other documents : see File Name : FileSizeCounter.java
4.Fetch MetaData of pdf : see File Name : GetMetadata.java
5.Create PDF file with Images : see File Name : CreatePDFWithImage.java
6.PDF is Rotated or Not ? PDF pages are Landscape or Portrait ? How to extract images from PDF ? What is Fontstyle used in document ? see File : RotatedPDFChecker.java