Pinned Repositories
APIConnectors
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
AreCELearnedYet
bigdata-cmpt733
Big Data Programming II
connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
covid19-datasets
A list of high quality open datasets for COVID-19 data analysis
dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
dbt-lineagex
deeperlib
Deep Web Crawler for Data Enrichment
lineagex
reprowd
Crowdsourced Data Processing Made Reproducible
SFU Database Group's Repositories
sfu-db/connector-x
Fastest library to load data from DB to DataFrames in Rust and Python
sfu-db/dataprep
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
sfu-db/AreCELearnedYet
sfu-db/lineagex
sfu-db/covid19-datasets
A list of high quality open datasets for COVID-19 data analysis
sfu-db/bigdata-cmpt733
Big Data Programming II
sfu-db/APIConnectors
A curated list of example code to collect data from Web APIs using DataPrep.Connector.
sfu-db/dbt-lineagex
sfu-db/deeperlib
Deep Web Crawler for Data Enrichment
sfu-db/CleanAgent
This is an experimental demo repository of agent on data cleaning task
sfu-db/BOExplain
Explaining Inference Queries with Bayesian Optimization
sfu-db/cmpt354
CMPT354: Database System I
sfu-db/dataprep-website
Website for DataPrep
sfu-db/SQLGen
An Automated SQL Query Generation Framework for Scalable Feature Discovery
sfu-db/accio
sfu-db/FeatAug
sfu-db/naru
Neural Relation Understanding: neural cardinality estimators for tabular data
sfu-db/Auto-FP-Final
Code repository of paper "Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data"
sfu-db/connectorx-bench
sfu-db/dataprep-data
Data repository for dataprep
sfu-db/EZHacks-tutorial
sfu-db/FedRain-and-Frog
code of FedRain and Frog for VLDB 2022
sfu-db/feedback-kde
fork from https://bitbucket.org/mheimel/feedback-kde/src/default/
sfu-db/incubator-wayang
Apache Wayang(incubating) is the first cross-platform data processing system.
sfu-db/learnedcardinalities
Code and workloads from the Learned Cardinalities paper (https://arxiv.org/abs/1809.00677)
sfu-db/postgres_scanner
sfu-db/public_bi_benchmark
BI benchmark with user generated data and queries
sfu-db/quicksel
Mirror from quicksel
sfu-db/staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks
sfu-db/WebConnectorSurvey