/filext

Python library to identify file type based on its file signature

Primary LanguagePythonMIT LicenseMIT

filext

Python library to identify file type based on its file signature

Installation

pip install filext

Usage

If the category of the file is not certain, you may use whatfile but is a little slower

from filext import whatfile

file_path = "./tests/files/document.pdf"

# pass file as path str
file_type = whatfile(file_path)

with open(filepath, "rb") as file:
    # pass file as bytes
    file_type = whatfile(file.read())

If the category of the file is known, you may use the function for that category instead.

from filext import whatdoc

file_path = "./tests/files/document.pdf"

# pass file as path str
file_type = whatdoc(file_path)

with open(filepath, "rb") as file:
    # pass file as bytes
    file_type = whatdoc(file.read())

Supported File Types

Documents

  • PDF
  • DOC
  • PPT
  • XLS
  • DOCX
  • PPTX
  • XLSX

Images

  • BMP
  • GIF
  • HEIC
  • JPG
  • PNG
  • TIF