Pinned Repositories
accel-brain-code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
AggreCol
Algorithm to detect aggregations in verbose CSV files.
bioperl-db
BioPerl BioSQL ORM
csv-annotation-tool
GUI tool to annotate the type of lines/cells and aggregations.
data-binning
A data bucketization library.
data-knoller
data-knoller is a library to provide data preparation on user-specified dataset.
line-type-classification
Classify the types of lines in two dimensional tabular data.
splendor-gan
Train a GAN to learn how to play Splendor (suspended)
strudel
Strudel: Detecting structure in verbose CSV files via classifying lines and cells.
table-normalizer
Extract normalized relational tables from verbose CSV files via reinforcement learning (q-learning).
lanchiang's Repositories
lanchiang/data-knoller
data-knoller is a library to provide data preparation on user-specified dataset.
lanchiang/strudel
Strudel: Detecting structure in verbose CSV files via classifying lines and cells.
lanchiang/accel-brain-code
The purpose of this repository is to make prototypes as case study in the context of proof of concept(PoC) and research and development(R&D) that I have written in my website. The main research topics are Auto-Encoders in relation to the representation learning, the statistical machine learning for energy-based models, adversarial generation networks(GANs), Deep Reinforcement Learning such as Deep Q-Networks, semi-supervised learning, and neural network language model for natural language processing.
lanchiang/AggreCol
Algorithm to detect aggregations in verbose CSV files.
lanchiang/csv-annotation-tool
GUI tool to annotate the type of lines/cells and aggregations.
lanchiang/data-binning
A data bucketization library.
lanchiang/line-type-classification
Classify the types of lines in two dimensional tabular data.
lanchiang/splendor-gan
Train a GAN to learn how to play Splendor (suspended)
lanchiang/table-normalizer
Extract normalized relational tables from verbose CSV files via reinforcement learning (q-learning).
lanchiang/data-downloader
lanchiang/data-file-boundary-detection
This project is to detect the boundaries between structured data and comments in data files.
lanchiang/data-preparation
Data preparation project
lanchiang/DataCleaner
The premier open source Data Quality solution
lanchiang/datasets-unioner
This repository is a tool to union different single table datasets.
lanchiang/etf_hist
lanchiang/excel-transformer
Transform excel files to the plain text format.
lanchiang/hopf
Holistic primary key and foreign key detection
lanchiang/Little-Big-Data
Data describing topics ranging from Cars and Air Travel to Billionaires and Celebrities
lanchiang/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
lanchiang/matrix-calculation-in-hadoop
lanchiang/metadata-ms
A library to store metadata of relational databases including the schema, statistics, and integrity constraints.
lanchiang/musicbrainz-server
The official musicbrainz-server codebase
lanchiang/naru
Neural Relation Understanding: neural cardinality estimators for tabular data
lanchiang/nmf-matrix-decomposition-in-distributed-environment
lanchiang/pads-haskell
Haskell binding for PADS
lanchiang/roadmap
A public roadmap for Streamlit
lanchiang/schema-generator
This is the tool to generate schema for the seminar dpfs
lanchiang/Spoon-Knife
This repo is for demonstration purposes only.
lanchiang/sqlParser
A parsing tool to extract primary keys and foreign keys from sql files.
lanchiang/univocity-parsers
uniVocity-parsers is a suite of extremely fast and reliable parsers for Java. It provides a consistent interface for handling different file formats, and a solid framework for the development of new parsers.