extractor
There are 693 repositories under extractor topic.
peazip/PeaZip
Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.
Bioruebe/UniExtract2
Universal Extractor 2 is a tool to extract files from any type of archive or installer.
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
TeamNewPipe/NewPipeExtractor
NewPipe's core library for extracting data from streaming sites
zelon88/HRConvert2
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
tatuylonen/wiktextract
Wiktionary dump file parser and multilingual data extractor
StanGirard/seo-audits-toolkit
SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...
pcjbird/AssetsExtractor
『Assets提取工具』是一款OSX平台上用于将Assets.car或xxx.app中打包的png图片、pdf等资源重新提取出来的开发者工具。Assets.car常见于iOS/Mac/Unity等开发中的资源打包。
AlexMathew/scrapple
A framework for creating semi-automatic web content extractors
WirelessAlien/ZipXtract
A fully open source app to extract rar, zip, tar, bz2, gz, 7z, xz, jar and z etc (encrypted .zip & .7z supported)
alexrintt/kanade
Android app to extract apks from installed apps.
MikeMeliz/TorCrawl.py
Crawl and extract (regular or onion) webpages through TOR network
tobyxdd/android-ota-payload-extractor
A fast & natively cross-platform Android OTA payload extractor written in Go
adoconnection/SevenZipExtractor
C# wrapper for 7z.dll
zhupingqi/RuiJi.Net
crawler framework, distributed crawler extractor
opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
dwisiswant0/galer
A fast tool to fetch URLs from HTML attributes by crawl-in.
lipoja/URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
hossein-mohseni/CF-Web
🌥 Iran cloudflare Domain list 🌍
alexeichhorn/YouTubeKit
YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS
suosi-inc/go-pkg-spider
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
microsoft/RecursiveExtractor
RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives and any nested combination of the supported formats.
crazy-max/undock
Extract contents of a container image in a local folder
Bioruebe/godotdec
An unpacker for Godot Engine package files (.pck)
lifenjoiner/ISx
ISx is an InstallShield installer extractor
hxz393/BrutalityExtractor
适用于高性能系统的多进程解压缩软件(A multiprocess decompression software for high-performance system)
UE-Explorer/UE-Explorer
UnrealScript decompiler and explorer tool for Unreal Engine packages.
LittleBigBug/QuickBMS
QuickBMS by aluigi - Github Mirror
vgmoose/OpenBackupExtractor
A free program for extracting data (like voicemails) from iPhone and iPad backups.
gilbsgilbs/babel-plugin-i18next-extract
Babel plugin that statically extracts i18next and react-i18next translation keys.
CTR-tools/CTR-tools
Crash Team Racing (PS1) tools - a C# framework and a set of tools by DCxDemo to parse files found in the original kart racing game by Naughty Dog (and a bit of Crash Bash too).
morungos/node-word-extractor
Read data from a Word document using node.js
astahmer/box-extractor
Static code extraction. Zero-runtime CSS-in-TS `<Box />` -> became a part of Panda CSS
schollz/ingredients
Extract recipe ingredients from any recipe website on the internet.
bpolaszek/bentools-etl
PHP ETL (Extract / Transform / Load) library with SOLID principles + almost no dependency.
cdimascio/essence
Automatically extract the main text content (and more) from an HTML document