johnnygreco's Stars
ggerganov/llama.cpp
LLM inference in C/C++
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
guidance-ai/guidance
A guidance language for controlling large language models.
joke2k/faker
Faker is a Python package that generates fake data for you.
karpathy/char-rnn
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
mshumer/gpt-prompt-engineer
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
huggingface/chat-ui
Open source codebase powering the HuggingChat app
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
karpathy/ng-video-lecture
tlkh/asitop
Perf monitoring CLI tool for Apple Silicon
jaymody/picoGPT
An unnecessarily tiny implementation of GPT-2 in NumPy.
samim23/polymath
Convert any music library into a music production sample-library with ML
keras-team/keras-core
A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.
ykdojo/kaguya
A ChatGPT plugin that allows you to load and edit your local files in a controlled way, as well as run any Python, JavaScript, and bash script.
skrub-data/skrub
Prepping tables for machine learning
Liuhong99/Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
Natooz/MidiTok
MIDI / symbolic music tokenizers for Deep Learning models 🎶
Tsingularity/dift
[NeurIPS'23] Emergent Correspondence from Image Diffusion
gretelai/gretel-synthetics
Synthetic data generators for structured and unstructured text, featuring differentially private learning.
huggingface/large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
yt-project/unyt
Handle, manipulate, and convert data with units in Python
weaviate/healthsearch-demo
Discover Healthsearch: Unlocking Health with Semantic Search ✨
yasyf/summ
GPT-based Conversation Summarizer
scientific-python/lazy_loader
Populate library namespace without incurring immediate import costs
gretelai/gretel-python-client
The Gretel Python Client allows you to interact with the Gretel REST API.
koaning/prodigy-tui
A textual TUI for Prodigy
johnnygreco/nerb
🏗️ Named Entity Regex Builder (NERB)