Skip to content

Latest commit

 

History

History
78 lines (31 loc) · 1.43 KB

readme.md

File metadata and controls

78 lines (31 loc) · 1.43 KB

Data Integration Demo

Data Integration for Data with Schema Changes

  1. Attribute Renaming
  2. Cell Value Reformatting
  3. Key Expansion
  4. Key Order Changing

Table Cell Position Prediction Using Sequence/Language Model

  1. Column (attribute) prediction
  2. Key index prediction
  3. Aggregation mode prediction (only apply to covid-19 dataset)

Dataset

Covid-19 scenario

Machine log scenario

  • Linux log data
  • macOS log data
  • Android log data

Model

Usage

Dataset:

Github Repository

Google Drive

Download dataset:

import gdown
gdown.download('https://drive.google.com/drive/folders/19oLAKktjI0uk8v4lcdBTnRBTyqN-tGeR', output=None, quiet=False)

Run with Google Colab or local Jupyter Notebook.