Pinned Repositories
cambria-project
Schema evolution with bi-directional lenses.
dataunitylab-site
distributed-dependency-discovery
fuzzy-sets
jsonoid-discovery
Distributed JSON schema discovery
NADEEF
A Generalized Data Cleaning System
paper-gender-analysis
relational-playground
An exploration of relational algebra :mag:
schema-guru
JSONs -> JSON Schema (and Avro coming soon)
semantic-regex
Data Unity Lab's Repositories
dataunitylab/jsonoid-discovery
Distributed JSON schema discovery
dataunitylab/relational-playground
An exploration of relational algebra :mag:
dataunitylab/semantic-regex
dataunitylab/cambria-project
Schema evolution with bi-directional lenses.
dataunitylab/dataunitylab-site
dataunitylab/schema-guru
JSONs -> JSON Schema (and Avro coming soon)
dataunitylab/ExplainDaV
dataunitylab/fuzzy-sets
dataunitylab/json-schema-profile
dataunitylab/JSON-Schema-Test-Suite
A language agnostic test suite for the JSON Schema specifications
dataunitylab/jsonoid-bowtie
dataunitylab/jsonoid-web
dataunitylab/paper-gender-analysis
dataunitylab/RegexGenerator
This project contains the source code of a tool for generating regular expressions for text extraction: 1. automatically, 2. based only on examples of the desired behavior, 3. without any external hint about how the target regex should look like
dataunitylab/sherlock-project
This repository provides data and scripts to use Sherlock, a neural-network based model to detect semantic data types. https://sherlock.media.mit.edu
dataunitylab/.github
dataunitylab/cambria-automerge
dataunitylab/cFinder
Code release of our paper `Protecting Data Integrity of Web Applications with Database Constraints Inferred from Application Code.` in ASPLOS 2023
dataunitylab/genderComputer
Tool that tries to guess a person's gender based on their name and location
dataunitylab/holoclean
A Machine Learning System for Data Enrichment.
dataunitylab/jsonoid-server
dataunitylab/jsonsubschema
Tool for checking whether a JSON schema is a subschema of another JSON schema.
dataunitylab/modevo
Ongoing migration to GitHub
dataunitylab/oas-db
A tool to generate annotated OpenAPI specs (and their implementations) containing known anti-patterns and issues.
dataunitylab/OSSRH-96874
dataunitylab/sato
Code and data for Sato https://arxiv.org/abs/1911.06311.
dataunitylab/schemastore-analysis
dataunitylab/skinfer
Skinfer is a tool for inferring and merging JSON schemas
dataunitylab/sqlcheck
Automatically identify anti-patterns in SQL queries
dataunitylab/ui-equity-tool