data-collection

There are 821 repositories under data-collection topic.

  • NaiboWang/EasySpider

    A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

    Language:JavaScript36.2k2265314.4k
  • airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Language:Python16.3k18814.7k4.2k
  • snowplow

    snowplow/snowplow

    The leader in Next-Generation Customer Data Infrastructure

    Language:Scala6.8k2694k1.2k
  • cloudquery

    cloudquery/cloudquery

    The open source high performance ELT framework powered by Apache Arrow

    Language:Go5.9k632.2k513
  • jitsu

    jitsucom/jitsu

    Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

    Language:TypeScript4.1k43562295
  • Smartproxy/Smartproxy

    HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

    Language:Java1.1k22838
  • Plan

    plan-player-analytics/Plan

    Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar:

    Language:Java874171.9k169
  • getodk/collect

    ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨

    Language:Java718603k1.4k
  • augur

    chaoss/augur

    Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/ and learn more about Augur at our website https://augurlabs.io

    Language:Python59222685845
  • pnoker/iot-dc3

    IoT DC3 is an open-source distributed Internet of Things (IoT) platform based on Spring Cloud. It is used for rapid development of IoT projects and management of IoT devices, providing a comprehensive solution for IoT system development.

    Language:Java537140187
  • zhaoyachao/zdh_web

    大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块

    Language:Java4682010171
  • library

    chapmanjacobd/library

    90+ CLI tools to build, browse, and blend your media library: an index for your archive.

    Language:Python37493410
  • ScriptSmith/reaper

    Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

    Language:Python368281370
  • elbwalker/walkerOS

    Open-source event collection and tag management (gtag.js/GTM alternative)

    Language:TypeScript294718015
  • K3V1991/Disable-Firefox-Telemetry-and-Data-Collection

    How to disable Firefox Telemetry and Data Collection

  • wq/wq

    📱🌐📋 wq: a modular framework supporting web / native apps for mobile surveys and geospatial data collection. Powered by Django REST Framework, Redux, React, and Material UI.

    Language:JavaScript259225446
  • alibaba/android_viewtracker

    A data collection library for click and exposure event with the UI.

    Language:Java24018246
  • silverton-io/buz

    Serverless multi-protocol + multi-destination event collection system.

    Language:Go198731123
  • madhavmk/Noise2Noise-audio_denoising_without_clean_training_data

    Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.

    Language:Jupyter Notebook1787742
  • MobileRoboticsSkoltech/OpenCamera-Sensors

    Android app for synchronized recording of video and IMU data with advanced camera options, useful for 3D reconstruction, SLAM, AR, video stabilization. Supports remote control over network.

    Language:Java17894819
  • notmarek/BeFake

    BeReal Python API wrapper

  • wq/wq.app

    💻📱 wq's app library: a JavaScript framework powering offline-first web & native apps for geospatial data collection, mobile surveys, and citizen science. Powered by Redux, React, Material UI and Maplibre GL.

    Language:JavaScript1201610832
  • classifai

    CertifaiAI/classifai

    :fire: One of the most comprehensive open-source data annotation platform.

    Language:Java119514525
  • wq/wq.db

    ☁🌐 wq's db library, extending Django REST framework to support apps for geospatial field data collection, citizen science, and crowdsourcing.

    Language:Python11795618
  • build

    getodk/build

    ODK Build is a drag-and-drop form designer for ODK XForms. Thousands of users around the world depend on it for their data collection campaigns. Contribute and make the world a better place! ✨📝✨

    Language:JavaScript1102619582
  • bps-statistics/form-gear

    FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.

    Language:TypeScript1084611
  • DouglasNeuroInformatics/OpenDataCapture

    An electronic data capture platform for administering remote and in-person clinical instruments

    Language:TypeScript103431812
  • fabianoriccardi/ESPLogger

    An Arduino library providing a minimal interface to log data on flash memory and SD cards with ESP8266 and ESP32.

    Language:C++84101515
  • Minipada/ros2_data_collection

    Collect, validate and send data reliably from ROS 2 to create APIs and dashboards.

    Language:C++8151387
  • ineffyble/genders.wtf

    Language:Nunjucks743835
  • Goblyn

    loseys/Goblyn

    Goblyn is a Python tool focused to enumeration and capture of website files metadata.

    Language:Python70319
  • mxdldev/android-amap-track-collect

    这阵子由于项目需要,需要从手机上采集用户的运动轨迹数据,这样的功能大家都见到的很多了,比如咕咚、悦动圈,对跑步运动轨迹数据进行采集,再如,微信运动、钉钉运动,对于每一天你走步进行计数,如果要记录轨迹就离不开的手机定位,如果要记录步数那就离不开陀螺仪(角速度传感器),花了一天多的时间实现了一个定位数据实时采集的功能。

    Language:Java706023
  • OpenCOVID19CoughCheck/CoughCheckApp

    Development of AI audio app to compare the cough of a Coronavirus (COVID-19) infected individual with the cough of an uninfected individual.

    Language:JavaScript6691117
  • pantunes/xtcryptosignals

    Cryptocurrencies price data collection, price tickers, signals notifications, charts, Telegram bot and more.

    Language:Python66310421
  • akvo/akvo-flow

    A data collection and monitoring tool that works anywhere.

    Language:Java65332.3k31
  • alttch/pptop

    Open, extensible Python injector/profiler/analyzer

    Language:Python63231