/data-cleaning

A tool to clean various datasets from Turing AI

Primary LanguageJavaScriptMIT LicenseMIT

Data-cleaning

A tool for cleaning datasets from turing ai

Usage guide

  1. Clone the repository
git clone https://github.com/TuringAI-Team/Data-cleaning.git
  1. Install the requirements
npm install
  1. Create folders, where you will store your datasets and results Create a folder in root called data Create a folder in root called steps
  2. Export your dataset from turing ai to the data folder and name it data.csv
  3. Run the CLI
npm run dev
  1. Follow the instructions in the CLI
  2. The results will be stored in the steps folder

To do

  • Fix bug where sometimes it answer with invalid json
  • Improve cleaning to not remove so much data