Pinned Repositories
hetu-core
abstractions
SQL views for Dune Analytics
AutoML-EM
benchmarks
A place in which we publish scripts for reproducible benchmarks.
BrightID-AntiSybil
Sybil detection package for BrightID
cmpt884-fall16
SFU's Graduate Seminar on "Human-in-the-loop Data Management"
JOS
Codes for building an AI-native database
APIConnectors
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
deeperlib
Deep Web Crawler for Data Enrichment
peiwangdb's Repositories
peiwangdb/JOS
Codes for building an AI-native database
peiwangdb/abstractions
SQL views for Dune Analytics
peiwangdb/AutoML-EM
peiwangdb/benchmarks
A place in which we publish scripts for reproducible benchmarks.
peiwangdb/BrightID-AntiSybil
Sybil detection package for BrightID
peiwangdb/cmpt884-fall16
SFU's Graduate Seminar on "Human-in-the-loop Data Management"
peiwangdb/contracts
smart contracts of codefordao
peiwangdb/dbt
dbt (data build tool) enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
peiwangdb/dbt-ml-preprocessing
A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.
peiwangdb/dedupe
:id: A python library for accurate and scaleable fuzzy matching, record deduplication and entity-resolution.
peiwangdb/deeper
deep entity resolution
peiwangdb/docs
Documentation for Dune Analytics
peiwangdb/docs.getdbt.com
The code behind docs.getdbt.com
peiwangdb/ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
peiwangdb/graph-node
Graph Node indexes data from blockchains such as Ethereum and serves it over GraphQL
peiwangdb/hetu-core
peiwangdb/jina
An easier way to build neural search on the cloud
peiwangdb/pachyderm
Reproducible Data Science at Scale!
peiwangdb/peiwangdb.github.io
peiwangdb/pinot
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
peiwangdb/psycopg2
PostgreSQL database adapter for the Python programming language
peiwangdb/remix-ide
Documentation for Remix IDE
peiwangdb/Spoon-Knife
This repo is for demonstration purposes only.
peiwangdb/sqlfluff
A SQL linter and auto-formatter for Humans
peiwangdb/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
peiwangdb/Uni-detect
peiwangdb/waveportal-starter-project
peiwangdb/Web3SecCheck
peiwangdb/winter
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
peiwangdb/zksync
zkSync: trustless scaling and privacy engine for Ethereum