Pinned Repositories
dpo-rlaif
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
DPO-fail
This project demonstrates a failure to fine-tune Llama 3.1 with DPO (Direct Preference Optimization) on a custom dataset.
EBH-book
Evidence-Based Hiring book, in markdown, with pandoc build scripts for HTML, EPUB and PDF.
educational-transformer
Easy-to-follow, educational implementation of the transformer model in PyTorch.
reddit-filter
Python scripts to filter the "humor-chains" dataset from Hugging Face.
ShortGPT
WebBS-Calculator
Source code for Web Bloat Score Calculator.
ZSvedic's Repositories
ZSvedic/WebBS-Calculator
Source code for Web Bloat Score Calculator.
ZSvedic/EBH-book
Evidence-Based Hiring book, in markdown, with pandoc build scripts for HTML, EPUB and PDF.
ZSvedic/educational-transformer
Easy-to-follow, educational implementation of the transformer model in PyTorch.
ZSvedic/DPO-fail
This project demonstrates a failure to fine-tune Llama 3.1 with DPO (Direct Preference Optimization) on a custom dataset.
ZSvedic/reddit-filter
Python scripts to filter the "humor-chains" dataset from Hugging Face.
ZSvedic/ShortGPT