filext
Python library to identify file type based on its file signature
Installation
pip install filext
Usage
If the category of the file is not certain, you may use whatfile
but is a little slower
from filext import whatfile
file_path = "./tests/files/document.pdf"
# pass file as path str
file_type = whatfile(file_path)
with open(filepath, "rb") as file:
# pass file as bytes
file_type = whatfile(file.read())
If the category of the file is known, you may use the function for that category instead.
from filext import whatdoc
file_path = "./tests/files/document.pdf"
# pass file as path str
file_type = whatdoc(file_path)
with open(filepath, "rb") as file:
# pass file as bytes
file_type = whatdoc(file.read())
Supported File Types
Documents
- DOC
- PPT
- XLS
- DOCX
- PPTX
- XLSX
Images
- BMP
- GIF
- HEIC
- JPG
- PNG
- TIF