data-collection
There are 1043 repositories under data-collection topic.
NaiboWang/EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
airbytehq/airbyte
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
snowplow/snowplow
The leader in Customer Data Infrastructure
cloudquery/cloudquery
The open source ELT framework powered by Apache Arrow
firecrawl/firecrawl-mcp-server
🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
jitsucom/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
pyper-dev/pyper
Concurrent Python made simple
brightdata/brightdata-mcp
A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.
Decodo/Decodo
HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.
plan-player-analytics/Plan
Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar:
getodk/collect
ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨
chaoss/augur
Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/
pnoker/iot-dc3
IoT DC3 is a fully open-source distributed Internet of Things (IoT) platform built on Spring Cloud. It accelerates IoT project development and simplifies IoT device management, offering a comprehensive solution for building robust IoT systems.
zhaoyachao/zdh_web
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
chapmanjacobd/library
99+ CLI tools to build, browse, and blend your media library
K3V1991/Disable-Firefox-Telemetry-and-Data-Collection
How to disable Firefox Telemetry and Data Collection
ScriptSmith/reaper
Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
graphlit/graphlit-mcp-server
Model Context Protocol (MCP) Server for Graphlit Platform
elbwalker/walkerOS
Open source tag management and event data collection
wq/wq
📱🌐📋 wq: a modular framework supporting web / native apps for mobile surveys and geospatial data collection. Powered by Django REST Framework, Redux, React, and Material UI.
ProjectNeura/LEADS
Enable your racing car with powerful, data-driven instrumentation, control, and analysis systems, all wrapped up in a gorgeous look.
alibaba/android_viewtracker
A data collection library for click and exposure event with the UI.
MobileRoboticsSkoltech/OpenCamera-Sensors
Android app for synchronized recording of video and IMU data with advanced camera options, useful for 3D reconstruction, SLAM, AR, video stabilization. Supports remote control over network.
silverton-io/buz
Serverless multi-protocol + multi-destination event collection system.
madhavmk/Noise2Noise-audio_denoising_without_clean_training_data
Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.
notmarek/BeFake
BeReal Python API wrapper
CertifaiAI/classifai
:fire: One of the most comprehensive open-source data annotation platform.
FFTAI/teleoperation
A.K.A. Fourier Advanced Robot Teleoperation System (F.A.R.T.S.) 💨
WuKongSecurity/SpiderBOX
SpiderBox - 虫盒 - 爬虫逆向资源导航站
wq/wq.app
💻📱 wq's app library: a JavaScript framework powering offline-first web & native apps for geospatial data collection, mobile surveys, and citizen science. Powered by Redux, React, Material UI and Maplibre GL.
wq/wq.db
☁🌐 wq's db library, extending Django REST framework to support apps for geospatial field data collection, citizen science, and crowdsourcing.
Minipada/ros2_data_collection
Collect, validate and send data reliably from ROS 2 to create APIs and dashboards.
DouglasNeuroInformatics/OpenDataCapture
An electronic data capture platform for administering remote and in-person clinical instruments
bps-statistics/form-gear
FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.
getodk/build
ODK Build is a drag-and-drop form designer for ODK XForms. Thousands of users around the world depend on it for their data collection campaigns. Contribute and make the world a better place! ✨📝✨
hiDaDeng/shreport
上海证券交易所上市公司定期报告下载,项目地址