llama3_finetune_inference: A Python repository from MDGrey33

This project intends to share an example of how to do the following inpython:

load Llama3 and run inference in python poetry run python ./llm_inference/llama-3-8b-instruct.py
fine tune llama3 model based on raw text file poetry run python ./llm_finetune/finetune-llama3.py
run inference on newly fintuned model with your data poetry run python ./llm_inference/llama-nur-inference.py This project was run on a macbook pro M3 max with 128 GB ram overnight.

Getting started

Clone repository git clone https://github.com/MDGrey33/llama3_finetune_inference.git cd to local folder cd llama3_finetune_inference setup poetry project poetry install create a hugging face token and request permission to the model you want to use run pwd in the folder you cloned to get your project rootcreate .env file and add the following to it

HF_TOKEN=yourhuggingfacetoken
PROJECT_ROOT=yourprojectroot

load Llama3 and run inference on poetry run python ./llm_inference/llama-3-8b-instruct.py
fine tune llama3 model based on raw text file poetry run python ./llm_finetune/finetune-llama3.py
run inference on newly fintuned model with your data poetry run python ./llm_inference/llama-nur-inference.py