/document-metadata-reader

Reads the metadata from the Microsoft documents & PDF

Primary LanguageJavaApache License 2.0Apache-2.0

Document Metadata Reader

Archive Notice

This project is being archived as of July 2024.

Introduction

The document metadata reader is used to extract the metadata of the documents. The supported documents are as follows

  • .docx - Microsoft Word Document
  • .pdf - PDF Document
  • .pptx - Microsoft Powerpoint
  • .xlsx - Microsoft Excel

Execution

The main file is com.smuralee.documents.ReadProperties. We run the executable jar with the argument having the location of the top-level folder containing the documents.

java -jar target/document-metadata-reader-1.0-jar-with-dependencies.jar documents/

Output

The successful execution will result in creating the excel file - Document_Inventory.xlsx in the directory.