/t3ext-extractor

TYPO3 Extension extractor. This extension detects and extracts metadata (EXIF / IPTC / XMP / ...) from potentially thousand different file types (such as MS Word/Powerpoint/Excel documents, PDF and images) and bring them automatically and natively to TYPO3 when uploading assets. Works with built-in PHP functions but takes advantage of Apache Tika and other external tools for enhanced metadata extraction.

Primary LanguagePHPGNU General Public License v2.0GPL-2.0

Metadata and content analysis service

This extension detects and extracts metadata (EXIF / IPTC / XMP / ...) from potentially thousand different file types (such as MS Word/Powerpoint/Excel documents, PDF and images) and bring them automatically and natively to TYPO3 when uploading assets. Works with built-in PHP functions but takes advantage of Apache Tika and other external tools for enhanced metadata extraction.

Metadata for a document

Requirements

For best results, Apache Tika is required (either as standalone JAR or running as server).

Extraction of metadata from common image files (jpg, tiff, ...) is often quicker using external tool exiftool and if not available, it will fall back to PHP's built-in EXIF and IPTC library.

For PDF, external tool pdfinfo will be used.

Read more in the manual.