patrickfleith's Stars
github/gitignore
A collection of useful .gitignore templates
DovAmir/awesome-design-patterns
A curated list of software and architecture related design patterns.
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
PromtEngineer/localGPT
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
camenduru/stable-diffusion-webui-colab
stable diffusion webui colab
zhanymkanov/fastapi-best-practices
FastAPI Best Practices and Conventions we used at our startup
explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
opendatalab/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
MilesCranmer/PySR
High-Performance Symbolic Regression in Python and Julia
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
samwit/langchain-tutorials
A set of LangChain Tutorials from my youtube channel
ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
zenml-io/awesome-open-data-annotation
Open Source Data Annotation & Labeling Tools
facebookresearch/atlas
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data
amanchadha/coursera-machine-learning-engineering-for-prod-mlops-specialization
Programming assignments and quizzes from all courses within the Machine Learning Engineering for Production (MLOps) specialization offered by deeplearning.ai
open-spaced-repetition/free-spaced-repetition-scheduler
A spaced repetition algorithm based on DSR model
davanstrien/awesome-synthetic-datasets
awesome synthetic (text) datasets
ESA-PhiLab/iris
Semi-automatic tool for manual segmentation of multi-spectral and geo-spatial imagery.
zzndream/ShipRSImageNet
ShipRSImageNet is the largest ship detection dataset in the Computer Vision and Earth Vision communities.
ARCLab-MIT/splid-devkit
Development toolkit for the Satellite Pattern-of-Life Identification Dataset (SPLID)
expertailab/SPACE-IDEAS
Source code and datasets for the paper SPACE-IDEAS: A Dataset for Salient Information Detection in Space Innovation
philschmid/langchain-samples-and-experiments