data-collection

There are 1043 repositories under data-collection topic.

  • NaiboWang/EasySpider

    A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

    Language:JavaScript42.5k2487215.2k
  • airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Language:Python19.5k19215.4k4.8k
  • snowplow

    snowplow/snowplow

    The leader in Customer Data Infrastructure

    Language:Scala7k2674k1.2k
  • cloudquery

    cloudquery/cloudquery

    The open source ELT framework powered by Apache Arrow

    Language:Go6.2k662.2k542
  • firecrawl/firecrawl-mcp-server

    🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.

    Language:JavaScript4.5k465
  • jitsu

    jitsucom/jitsu

    Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

    Language:TypeScript4.4k46594315
  • pyper-dev/pyper

    Concurrent Python made simple

    Language:Python1.5k2730
  • brightdata/brightdata-mcp

    A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

    Language:JavaScript1.3k
  • Decodo/Decodo

    HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

    Language:Java1.1k47
  • Plan

    plan-player-analytics/Plan

    Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar:

    Language:Java956161.9k169
  • getodk/collect

    ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨

    Language:Kotlin745593.2k1.4k
  • augur

    chaoss/augur

    Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/

    Language:Python65421725902
  • pnoker/iot-dc3

    IoT DC3 is a fully open-source distributed Internet of Things (IoT) platform built on Spring Cloud. It accelerates IoT project development and simplifies IoT device management, offering a comprehensive solution for building robust IoT systems.

    Language:Java596120201
  • zhaoyachao/zdh_web

    大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块

    Language:Java5252010176
  • library

    chapmanjacobd/library

    99+ CLI tools to build, browse, and blend your media library

    Language:Python44383814
  • K3V1991/Disable-Firefox-Telemetry-and-Data-Collection

    How to disable Firefox Telemetry and Data Collection

  • ScriptSmith/reaper

    Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

    Language:Python389271367
  • graphlit/graphlit-mcp-server

    Model Context Protocol (MCP) Server for Graphlit Platform

    Language:TypeScript3591021
  • elbwalker/walkerOS

    Open source tag management and event data collection

    Language:TypeScript303823015
  • wq/wq

    📱🌐📋 wq: a modular framework supporting web / native apps for mobile surveys and geospatial data collection. Powered by Django REST Framework, Redux, React, and Material UI.

    Language:JavaScript259215447
  • ProjectNeura/LEADS

    Enable your racing car with powerful, data-driven instrumentation, control, and analysis systems, all wrapped up in a gorgeous look.

    Language:Python2584130337
  • alibaba/android_viewtracker

    A data collection library for click and exposure event with the UI.

    Language:Java23817244
  • MobileRoboticsSkoltech/OpenCamera-Sensors

    Android app for synchronized recording of video and IMU data with advanced camera options, useful for 3D reconstruction, SLAM, AR, video stabilization. Supports remote control over network.

    Language:Java21084820
  • silverton-io/buz

    Serverless multi-protocol + multi-destination event collection system.

    Language:Go207731226
  • madhavmk/Noise2Noise-audio_denoising_without_clean_training_data

    Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.

    Language:Jupyter Notebook1976745
  • notmarek/BeFake

    BeReal Python API wrapper

  • classifai

    CertifaiAI/classifai

    :fire: One of the most comprehensive open-source data annotation platform.

    Language:Java124414426
  • FFTAI/teleoperation

    A.K.A. Fourier Advanced Robot Teleoperation System (F.A.R.T.S.) 💨

    Language:Python124305
  • SpiderBOX

    WuKongSecurity/SpiderBOX

    SpiderBox - 虫盒 - 爬虫逆向资源导航站

    Language:CSS1213220
  • wq/wq.app

    💻📱 wq's app library: a JavaScript framework powering offline-first web & native apps for geospatial data collection, mobile surveys, and citizen science. Powered by Redux, React, Material UI and Maplibre GL.

    Language:JavaScript1201410832
  • wq/wq.db

    ☁🌐 wq's db library, extending Django REST framework to support apps for geospatial field data collection, citizen science, and crowdsourcing.

    Language:Python11675618
  • Minipada/ros2_data_collection

    Collect, validate and send data reliably from ROS 2 to create APIs and dashboards.

    Language:C++11551399
  • DouglasNeuroInformatics/OpenDataCapture

    An electronic data capture platform for administering remote and in-person clinical instruments

    Language:TypeScript114436213
  • bps-statistics/form-gear

    FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.

    Language:TypeScript1124611
  • build

    getodk/build

    ODK Build is a drag-and-drop form designer for ODK XForms. Thousands of users around the world depend on it for their data collection campaigns. Contribute and make the world a better place! ✨📝✨

    Language:JavaScript1102419581
  • hiDaDeng/shreport

    上海证券交易所上市公司定期报告下载,项目地址

    Language:Python1092433