florescl's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
nomic-ai/gpt4all
gpt4all: run open-source LLMs anywhere
ggerganov/llama.cpp
LLM inference in C/C++
facebookresearch/llama
Inference code for LLaMA models
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
kingoflolz/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
rustformers/llm
An ecosystem of Rust libraries for working with large language models
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
tensorchord/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
google-research/t5x
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
IST-DASLab/sparsegpt
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
bsilverthorn/maccarone
AI-managed code blocks in Python ⏪⏩
pengbaolin/LLM-Augmenter
ml6team/fondant
Production-ready data processing made easy and shareable
loubnabnl/santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
Zyq-scut/RLTF
Accepted by Transactions on Machine Learning Research (TMLR)
antimatter15/AlpacaChat
A Swift library that runs Alpaca prediction locally to implement ChatGPT like app on Apple platform devices.
ypapanik/t5-for-code-generation
Semantic Parsing with text-to-text Transformers