/Data-Integration-Demo

Data Integration for Fast-evolving Data

Primary LanguageJupyter Notebook

Data Integration Demo

Data Integration for Data with Schema Changes

  1. Attribute Renaming
  2. Cell Value Reformatting
  3. Key Expansion
  4. Key Order Changing

Table Cell Position Prediction Using Sequence/Language Model

  1. Column (attribute) prediction
  2. Key index prediction
  3. Aggregation mode prediction (only apply to covid-19 dataset)

Dataset

Covid-19 scenario

Machine log scenario

  • Linux log data
  • macOS log data
  • Android log data

Model

Usage

Dataset:

Github Repository

Google Drive

Download dataset:

import gdown
gdown.download('https://drive.google.com/drive/folders/19oLAKktjI0uk8v4lcdBTnRBTyqN-tGeR', output=None, quiet=False)

Run with Google Colab or local Jupyter Notebook.