data-verification

There are 25 repositories under data-verification topic.

  • unionai-oss/pandera

    A light-weight, flexible, and expressive statistical data testing library

    Language:Python4k201k357
  • pointblank

    rstudio/pointblank

    Data quality assessment and metadata reporting for data frames and database tables

    Language:R9903034159
  • sparkdq-community/sparkdq

    A declarative PySpark framework for row- and aggregate-level data quality validation.

    Language:Python561196
  • jeffo777/input-right

    An open-source AI voice agent platform that turns conversations into 100% accurate, user-verified data via a visual form.

    Language:TypeScript21117
  • verifalia/verifalia-js-sdk

    Verifalia REST API - Javascript SDK and helper library, for Node.js and the browser: verify email addresses in real-time and check whether they are deliverable, invalid, or otherwise risky.

    Language:JavaScript13163
  • yusangeng/io-validate

    Javascript data validator.

    Language:JavaScript6000
  • CramBL/fastPASTA

    Mirror of the repository on CERN's Gitlab. CLI for viewing and verifying data integrity on the raw binary data read out from the ALICE detector and its subdetectors.

    Language:Rust4111
  • verifalia/verifalia-node-sdk

    Verifalia SDK for Node.js - OBSOLETE, please use https://github.com/verifalia/verifalia-js-sdk

    Language:JavaScript4005
  • darsan-in/Nexa-Bot

    Nexa Auto automates the process of verifying the authenticity of addresses for room service eligibility and retrieving detailed specifications across multiple websites. Utilizing Selenium for web automation and GPT for handling missing data, Nexa Auto significantly reduces manual effort in data entry tasks.

    Language:Python310
  • rsgalloway/hashio

    Custom file and directory checksum and verification tool

    Language:Python31381
  • IQTLabs/VennData

    One of the biggest barriers to widespread machine learning adoption is the difficulty in collecting a 'good' dataset. There is an overall consensus that a 'good' dataset is a big dataset, but we believe that we can do better. As such the VennData project was created to develop tools to guide in the collection, curation, augmentation and validation of data.

    Language:Jupyter Notebook2501
  • noob-ethereum

    markodayan/noob-ethereum

    Minimalist Ethereum library for JavaScript/TypeScript developers

    Language:TypeScript2200
  • ajitsing/data_verifier

    Ruby gem to verify data

    Language:Ruby110
  • SPUR-2020-Topical-Analysis-Toolkit

    GrahamJamesKeane/SPUR-2020-Topical-Analysis-Toolkit

    Deliver insights into the topical content of undergraduate degree programmes.

    Language:Python1100
  • YannisPap/Wrangle-OpenStreetMap-Data

    Chose a region and used data munging techniques to assess the quality of the data for validity, accuracy, completeness, consistency and uniformity.

    Language:HTML1100
  • Elevated-Standards/DataDemise

    DataDemise is an application for certifying and verifying the destruction of data stored across various cloud providers. It ensures secure and verifiable destruction of data, providing certificates as proof of destruction.

    Language:Go0210
  • Predominio/Public-Webcam-Surveillance-Research

    🌐 Explore verified public webcam networks for traffic, tourism, and environmental monitoring, distinct from unauthorized surveillance practices.

  • Ramy-Badr-Ahmed/Merkle-DAG-Matlab

    Merkle-Directed Acyclic Graph (DAG) in MATLAB - https://doi.org/10.5281/zenodo.12808889

    Language:MATLAB0100
  • renlabs-dev/prediction-swarm

    First swarm formed through Torus abstractions, over the problem space of finding the internet's prophets.

    Language:Python0000
  • v1k1nghawk/Signa

    File Fingerprinting

    Language:C++0100
  • Yamada-North-America/LotCom-scanner-configs

    JavaScript-based configurations for Cognex Scanners in the LotCom Distributed System.

    Language:JavaScript01141
  • abramisola/dangerous

    It's a dangerous world out there.

    Language:Go10
  • Jeyso215/Public-Webcam-Surveillance-Research

    Verified active public camera networks (2025) with ethical context, legal frameworks, and verification methodology for academic study of surveillance infrastructure. All links manually validated on 2025-09-03.

  • rrwen/recovr-infracycle

    Pedalling Forward: The Evolution of Dedicated Cycling Infrastructure in Canadian Cities from 2010 to 2022

    Language:R10
  • Sshubam/Setu-URL-verification-AI4Bharat

    Simple streamlit application to verify content of websites before scraping them via authenticated users and storing them in Firebase Firestore.

    Language:Python10