extract-data
There are 255 repositories under extract-data topic.
Agenty/scrapingai
Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty
DapengFeng/waymo-toolkit
A toolkit for extracting elements and visualization for Waymo Open Dataset
guillaC/SQLiteDiskExplorer
SQLiteDiskExplorer enables you to explore, catalog, and batch extract SQLite files from disks and removable media.
dmryutov/parsers
Collection of parsers written in PHP, Python
laur89/docker-seedbox-rclone-fetch-extract
Dockerised service pulling data from remote seedbox & extracting archives
pdfix/pdfix_sdk_example_dotnet
Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...
BlockBuilder57/XBC2ModelDecomp
Extracts Xenoblade 2 models into XNALara and glTF format
floriancochard/extract-data-from-paper
A tool designed to extract numerical data from scanned historical weather documents.
MeltanoLabs/tap-dbt
Singer Tap for dbt API v2 built with the Meltano SDK
CatherineFramework/mercy
Mercy is an open-source Rust crate and CLI designed for building cybersecurity utilities and projects.
darkskygit/ChatImporter
import chat records from your im and store into single sqlite database
aidayang/MinerU-OneClick
MinerU免安装部署一键启动整合包
dewshr/NCBI-GenBank-file-parser
This program can be used to parse the NCBI GenBank file to create a tabulated csv file.
KEZIMAdynamics/DokuExtractor
Easily extract data from PDF documents
jehad-halahla/linux_project
a linux lab bash project that focuses on automation and text extraction
ocampor/pivot-table-to-csv
This repository takes a *.xslx that contains a Pivot Table with hidden external source data and converts the pivot cache into CSV. It takes into account files that are too big to be in memory and handles this situation by dividing the original data into n batches.
Agenta-AI/job_extractor_template
Template for an AI application that extracts the job information from a job description using openAI functions and langchain
Genone22/emails_phones_scraping
Extract emails and phone numbers from the list of url addresses
PatrykBala/DocumentNER
Extract text data from documents using OCR (optical character recognition) technology and NER (named entity recognition).
kormanowsky/jextract
Allows extracting data from DOM
zSynctic/Img2Txt
Img2Txt - Extract Text From Images using AI
JdeJabali/JXLDataTableExtractor
Extract data as tables from Excel. Search columns by their header or index number. Sets conditions for extracting the rows.
jeffersonsalvador/cnpj-extractor
🇺🇸 Solution for importing and analyzing public Brazilian business data (CNPJ). 🇧🇷 Processamento de Dados CNPJ: Uma solução robusta e conteinerizada para importação e análise de dados empresariais brasileiros (CNPJ).
sypht-team/sypht-elixir-client
An Elixir client for the Sypht API https://sypht.com
TIMESTICKING/image_graph_line_to_digital_convert
read digital numbers(points) from a image with plots. image to plot. image2digital. line2digital. image2digitalline. graph2digitalline. imageline2digital. graphline2digital. pictureline2digital. readimageline.
alexey-savchenko-am/Excel.DataTable
Allows to extract data from excel table or write some data to table.
bjorn3/goodgame_empire_import
A importer for goodgame empire
diannejardinez/ETL-Project
Extraction of data from websites and available APIs. Transformation of datasets. Loading datasets in pgAdmin with PostgreSQL
malakhovks/doc-docx-extract-api
Atomic Web Service (AWS, REST API) for converting DOC/DOCX files to plain/text, powered by catdoc, docx2txt and Node.js
mheriyanto/EDFI
:earth_asia: EDFI is an open-source script to extract data from a 2D or 3D image.
astemiracle/d2
extracting dota 2 stats
mohamedhaddi/recursive-extractor
Extract a recursively compressed single file (multiple archive formats).
orvill-as/extract-email
This program prompts the user for input and output file paths, extracts email addresses from the input file using a regular expression, and writes the email addresses to the output file. It also measures and prints the elapsed time taken to run the program.
OxideDevX/info_you_windows
Script for extracting data about the computer with the record of the latter in the text log file
palwesh/Resume-Parser
Extract the data from resume using djnago rest api
pratik149/pdf-table-extractor
Extract tables from searchable as well as non-searchable pdf files