- Attribute Renaming
- Cell Value Reformatting
- Key Expansion
- Key Order Changing
- Column (attribute) prediction
- Key index prediction
- Aggregation mode prediction (only apply to covid-19 dataset)
-
Covid-19 dataset https://github.com/CSSEGISandData/COVID-19
-
Google Mobility Report dataset https://www.google.com/covid19/mobility/
- Linux log data
- macOS log data
- Android log data
- Pretrained Word Embedding (fastText) + Bi-LSTM
- Pretrained Word Tokenizer (BERT Tokenizer) + Encoder-only Transfomer
Dataset:
Download dataset:
import gdown
gdown.download('https://drive.google.com/drive/folders/19oLAKktjI0uk8v4lcdBTnRBTyqN-tGeR', output=None, quiet=False)
Run with Google Colab or local Jupyter Notebook.