extract
There are 927 repositories under extract topic.
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
scinfu/SwiftSoup
SwiftSoup: Pure Swift HTML Parser, with best of DOM, CSS, and jquery (Supports Linux, iOS, Mac, tvOS, watchOS)
mholt/archiver
DEPRECATED. Please use mholt/archives instead.
dlt-hub/dlt
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
torakiki/pdfsam
PDFsam, a desktop application to split, merge, mix, rotate PDF files and extract pages
atlanhq/camelot
Camelot: PDF Table Extraction for Humans
Wisser/Jailer
Database Subsetting and Relational Data Browsing Tool.
mafaca/UtinyRipper
GUI and API library to work with Engine assets, serialized and bundle files
CatchTheTornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
retroplasma/earth-reverse-engineering
Reversing Google's 3D satellite mode
DonJayamanne/pythonVSCode
This extension is now maintained in the Microsoft fork.
j4k0xb/webcrack
Deobfuscate obfuscator.io, unminify and unpack bundled javascript
dompdf/php-font-lib
A library to read, parse, export and make subsets of different types of font files.
extractus/article-extractor
To extract main article from given URL with Node.js
camelot-dev/excalibur
A web interface to extract tabular data from PDFs
JonathanLink/PDFLayoutTextStripper
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
activescott/lessmsi
A tool to view and extract the contents of an Windows Installer (.msi) file.
wix-incubator/vscode-glean
The extension provides refactoring tools for your React codebase
kevva/download
Download and extract files
laktak/extrakto
extrakto for tmux - quickly select, copy/insert/complete text without a mouse
MasterScrat/Chatistics
💬 Python scripts to parse Messenger, Hangouts, WhatsApp and Telegram chat logs into DataFrames.
OmkarPathak/pyresparser
A simple resume parser used for extracting information from resumes
XboxDev/extract-xiso
Xbox ISO Creation/Extraction utility. Imported from SourceForge.
exyte/ReadabilityKit
Preview extractor for news, articles and full-texts in Swift
OP-Engineering/link-preview-js
⛓ Extract web links information: title, description, images, videos, etc. [via OpenGraph], runs on mobiles and node.
003random/getJS
A tool to fastly get all javascript sources/files
pgilad/leasot
Parse and output TODOs and FIXMEs from comments in your files
paillave/Etl.Net
Mass processing data with a complete ETL for .net developers
slingdata-io/sling-cli
Sling is a CLI tool that extracts data from a source storage/database and loads it in a target storage/database.
ICIJ/datashare
A self‑hosted search engine for documents. Help us improve Datashare by answering a survey on structured content: https://forms.gle/PYgusFsoBaMyzUec9
Linzaer/Face-Track-Detect-Extract
💎 Detect , track and extract the optimal face in multi-target faces (exclude side face and select the optimal face).
retroplasma/flyover-reverse-engineering
Reversing Apple's 3D satellite mode
Ne-Lexa/php-zip
PhpZip is a php-library for extended work with ZIP-archives.
xvoland/Extract
Bash/Zsh function for extract: .zip, .rar, .bz2, .gz, .zlib, .tar, .tbz2, .tgz, .Z, .7z, .xz, .exe, .tar.bz2, .tar.gz, .tar.xz, etc.
kevva/decompress
Extracting archives made easy
luukdv/color.js
Extract colors from an image (0.75 KB) 🎨