extractor

There are 726 repositories under extractor topic.

  • peazip/PeaZip

    Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.

    Language:Pascal5.9k730323
  • Bioruebe/UniExtract2

    Universal Extractor 2 is a tool to extract files from any type of archive or installer.

    Language:AutoIt4k142313365
  • news-please

    fhamborg/news-please

    news-please - an integrated web crawler and information extractor for news that just works

    Language:Python2.3k52180443
  • TeamNewPipe/NewPipeExtractor

    NewPipe's core library for extracting data from streaming sites

    Language:Java1.6k68544488
  • zelon88/HRConvert2

    A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.

    Language:PHP1.3k165278
  • tatuylonen/wiktextract

    Wiktionary dump file parser and multilingual data extractor

    Language:Python9971638499
  • seo-audits-toolkit

    StanGirard/seo-audits-toolkit

    SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...

    Language:Python7341541136
  • WirelessAlien/ZipXtract

    A fully open source app to extract rar, zip, tar, bz2, gz, 7z, xz, jar and z etc (encrypted .zip & .7z supported)

    Language:Kotlin72784430
  • pcjbird/AssetsExtractor

    『Assets提取工具』是一款OSX平台上用于将Assets.car或xxx.app中打包的png图片、pdf等资源重新提取出来的开发者工具。Assets.car常见于iOS/Mac/Unity等开发中的资源打包。

    Language:Objective-C5989782
  • AlexMathew/scrapple

    A framework for creating semi-automatic web content extractors

    Language:Python501231741
  • TorCrawl.py

    MikeMeliz/TorCrawl.py

    Crawl and extract (regular or onion) webpages through TOR network

    Language:Python43261978
  • kanade

    alexcmgit/kanade

    Android app to extract apks from installed apps.

    Language:Dart40163419
  • tobyxdd/android-ota-payload-extractor

    A fast & natively cross-platform Android OTA payload extractor written in Go

    Language:Go3782721
  • adoconnection/SevenZipExtractor

    C# wrapper for 7z.dll

    Language:C#326175584
  • alexeichhorn/YouTubeKit

    YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS

    Language:Swift293217766
  • opensemanticsearch/open-semantic-etl

    Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

    Language:Python2712613772
  • zhupingqi/RuiJi.Net

    crawler framework, distributed crawler extractor

    Language:C#2669639
  • galer

    dwisiswant0/galer

    A fast tool to fetch URLs from HTML attributes by crawl-in.

    Language:Go25861539
  • lipoja/URLExtract

    URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

    Language:Python25799663
  • CF-Web

    hossein-mohseni/CF-Web

    🌥 Iran cloudflare Domain list 🌍

  • UE-Explorer/UE-Explorer

    UnrealScript decompiler and explorer tool for Unreal Engine packages.

    Language:C#24976938
  • undock

    crazy-max/undock

    Extract contents of a container image in a local folder

    Language:Go2122615
  • suosi-inc/go-pkg-spider

    一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。

    Language:Go2124010
  • LittleBigBug/QuickBMS

    QuickBMS by aluigi - Github Mirror

    Language:C21091224
  • microsoft/RecursiveExtractor

    RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives and any nested combination of the supported formats.

    Language:C#203115833
  • Bioruebe/godotdec

    An unpacker for Godot Engine package files (.pck)

    Language:C#1935417
  • hxz393/BrutalityExtractor

    适用于高性能系统的多进程解压缩软件(A multiprocess decompression software for high-performance system)

    Language:Python1913212
  • lifenjoiner/ISx

    ISx is an InstallShield installer extractor

    Language:C189131323
  • CTR-tools

    CTR-tools/CTR-tools

    Crash Team Racing (PS1) tools - a C# framework and a set of tools by DCxDemo to parse files found in the original kart racing game by Naughty Dog (and a bit of Crash Bash too).

    Language:C#1721419814
  • gilbsgilbs/babel-plugin-i18next-extract

    Babel plugin that statically extracts i18next and react-i18next translation keys.

    Language:TypeScript169511839
  • vgmoose/OpenBackupExtractor

    A free program for extracting data (like voicemails) from iPhone and iPad backups.

    Language:Swift1689925
  • gengteng/axum-valid

    axum-valid is a library that provides data validation extractors for the Axum web framework. It integrates validator, garde and validify, three popular validation crates in the Rust ecosystem, to offer convenient validation and data handling extractors for Axum applications.

    Language:Rust1532278
  • morungos/node-word-extractor

    Read data from a Word document using node.js

    Language:JavaScript14264530
  • schollz/ingredients

    Extract recipe ingredients from any recipe website on the internet.

    Language:HTML12941127
  • astahmer/box-extractor

    Static code extraction. Zero-runtime CSS-in-TS `<Box />` -> became a part of Panda CSS

    Language:TypeScript127473
  • bpolaszek/bentools-etl

    PHP ETL (Extract / Transform / Load) library with SOLID principles + almost no dependency.

    Language:PHP126534