A Simple Dataset Collector

utility

  • Data_Format_Validation.py is the script that validates the format of the openai fine-tuning dataset.

Pokemon_Crawler

  • This is a simple web crawler that crawls images of pokemon from pinterest and uses GPT-4o to label the images.

  • The labeled images are then saved in the Pictures_test folder.

  • Pokemon_fine_tune.py is the script that generate the dataset for the fine-tuning.

  • Pokemon_Crawler.py is the main script that crawls the images and labels them.