/openai-finetuning-dataprepare-sanbox

openai-finetuning-data-prepare-sanbox

Primary LanguagePython

OpenAI Finetuning dataprepare Sandbox

Introduction

This repository uses the python language to convert data in json format to jsonl file format. And it provides a simple example code to help prepare data using openai tool.

Setup

  1. If you don’t have Python installed, install it from here

  2. Clone this repository

  3. Navigate into the project directory

    $ cd openai-finetuning-dataprepare-sanbox
  4. Move the prepared json file to the dataset folder.

  5. Run jsonToJsonl.py

    $ python jsonToJsonl.py
  6. Check if the jsonl file is created correctly in the converted folder.

  7. Run dataPrepareTool.py

    $ python dataPrepareTool.py

Features