data-collection

There are 727 repositories under data-collection topic.

  • NaiboWang/EasySpider

    A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

    Language:JavaScript27.4k1903723.2k
  • airbytehq/airbyte

    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

    Language:Python14.5k17913.7k3.7k
  • snowplow

    snowplow/snowplow

    The leader in Next-Generation Customer Data Infrastructure

    Language:Scala6.8k2674k1.2k
  • cloudquery

    cloudquery/cloudquery

    The open source high performance ELT framework powered by Apache Arrow

    Language:Go5.6k582.2k499
  • jitsu

    jitsucom/jitsu

    Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

    Language:TypeScript3.9k41543272
  • Plan

    plan-player-analytics/Plan

    Player Analytics plugin for Minecraft Server platforms - View player activity of your server with ease. :calendar:

    Language:Java813171.9k166
  • Smartproxy/Smartproxy

    HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

    Language:C#73119638
  • getodk/collect

    ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨

    Language:Java700612.8k1.3k
  • augur

    chaoss/augur

    Python library and web service for Open Source Software Health and Sustainability metrics & data collection. You can find our documentation and new contributor information easily here: https://oss-augur.readthedocs.io/en/main/ and learn more about Augur at our website https://augurlabs.io

    Language:Python57623631844
  • pnoker/iot-dc3

    IoT DC3 is an open source, distributed Internet of Things (IoT) platform based on Spring Cloud. It is used for rapid development of IoT projects and management of IoT devices. It is a set of solutions for IoT system.

    Language:Java475110169
  • zhaoyachao/zdh_web

    大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块

    Language:Java4262010160
  • ScriptSmith/reaper

    Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

    Language:Python356291370
  • elbwalker/walkerOS

    Unified and privacy-centric event data collection for digital analytics

    Language:TypeScript282715116
  • wq/wq

    📱🌐📋 wq: a modular framework supporting web / native apps for mobile surveys and geospatial data collection. Powered by Django REST Framework, Redux, React, and Material UI.

    Language:JavaScript257235440
  • alibaba/android_viewtracker

    A data collection library for click and exposure event with the UI.

    Language:Java23818246
  • K3V1991/Disable-Firefox-Telemetry-and-Data-Collection

    How to disable Firefox Telemetry and Data Collection

  • library

    chapmanjacobd/library

    70+ CLI tools to build, browse, and blend your media library. An index for your archive.

    Language:Python1855326
  • silverton-io/buz

    Serverless multi-protocol + multi-destination event collection system.

    Language:Go178531020
  • madhavmk/Noise2Noise-audio_denoising_without_clean_training_data

    Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoising networks using only noisy speech samples.

    Language:Jupyter Notebook1627641
  • MobileRoboticsSkoltech/OpenCamera-Sensors

    Android app for synchronized recording of video and IMU data with advanced camera options, useful for 3D reconstruction, SLAM, AR, video stabilization. Supports remote control over network.

    Language:Java16094818
  • notmarek/BeFake

    BeReal Python API wrapper

  • wq/wq.app

    💻📱 wq's app library: a JavaScript framework powering offline-first web & native apps for geospatial data collection, mobile surveys, and citizen science. Powered by Redux, React, Material UI and Maplibre GL.

    Language:JavaScript1171710732
  • classifai

    CertifaiAI/classifai

    :fire: One of the most comprehensive open-source data annotation platform.

    Language:Java116514425
  • wq/wq.db

    ☁🌐 wq's db library, extending Django REST framework to support apps for geospatial field data collection, citizen science, and crowdsourcing.

    Language:Python116105618
  • build

    getodk/build

    ODK Build is a drag-and-drop form designer for ODK XForms. Thousands of users around the world depend on it for their data collection campaigns. Contribute and make the world a better place! ✨📝✨

    Language:JavaScript1102619582
  • bps-statistics/form-gear

    FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.

    Language:TypeScript1094610
  • fabianoriccardi/ESPLogger

    An Arduino library providing a minimal interface to log data on flash memory and SD cards with ESP8266 and ESP32.

    Language:C++81101515
  • Minipada/ros2_data_collection

    Collect, validate and send data reliably from ROS 2 to create APIs and dashboards.

    Language:C++7641386
  • ineffyble/genders.wtf

    Language:Nunjucks703534
  • akvo/akvo-flow

    A data collection and monitoring tool that works anywhere.

    Language:Java65332.3k31
  • OpenCOVID19CoughCheck/CoughCheckApp

    Development of AI audio app to compare the cough of a Coronavirus (COVID-19) infected individual with the cough of an uninfected individual.

    Language:JavaScript6591117
  • mxdldev/android-amap-track-collect

    这阵子由于项目需要,需要从手机上采集用户的运动轨迹数据,这样的功能大家都见到的很多了,比如咕咚、悦动圈,对跑步运动轨迹数据进行采集,再如,微信运动、钉钉运动,对于每一天你走步进行计数,如果要记录轨迹就离不开的手机定位,如果要记录步数那就离不开陀螺仪(角速度传感器),花了一天多的时间实现了一个定位数据实时采集的功能。

    Language:Java636022
  • pantunes/xtcryptosignals

    Cryptocurrencies price data collection, price tickers, signals notifications, charts, Telegram bot and more.

    Language:Python63310421
  • alttch/pptop

    Open, extensible Python injector/profiler/analyzer

    Language:Python61231
  • Goblyn

    loseys/Goblyn

    Goblyn is a Python tool focused to enumeration and capture of website files metadata.

    Language:Python61318
  • getodk/briefcase

    ODK Briefcase is a Java application for fetching and pushing forms and their contents. It helps make billions of data points from ODK portable. Contribute and make the world a better place! ✨💼✨

    Language:Java6024456156