srush's Stars
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
jart/cosmopolitan
build-once run-anywhere c library
huggingface/candle
Minimalist ML framework for Rust
state-spaces/mamba
Vaibhavs10/insanely-fast-whisper
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
sgl-project/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
jxmorris12/vec2text
utilities for decoding deep representations (like sentence embeddings) back to text
freddyaboulton/gradio-tools
kyang6/llmparser
Classify and extract structured data with LLMs
aedocw/epub2tts
Turn an epub or text file into an audiobook
patrick-kidger/mkposters
Make posters from Markdown files.
anishathalye/auriga
Auriga is a minimalist LaTeX beamer presentation theme 📽
ofirpress/self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
fpgaminer/GPTQ-triton
GPTQ inference Triton kernel
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
google-research/jaxpruner
odashi/davinci-functions
Library to ask OpenAI GPT for generating objects on the Python runtime.
XzwHan/CARD
Official PyTorch implementation for the paper "CARD: Classification and Regression Diffusion Models"
tianlinxu312/Everything-about-LLMs
A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.
michaelfarrell76/End-To-End-Generative-Dialogue
A neural conversation model
jxiw/BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE benchmark with subquadratic complexity in length (or without attention).
da03/markup2im
Diffusion-based markup-to-image generation
vvvm23/mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
eschluntz/PytorchBridge
Designing bridge trusses with Pytorch autograd
varun-ml/diffusion-models-tutorial
Experiment with diffusion models that you can run on your local jupyter instances
yashbonde/rasp
Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.
sustcsonglin/gated_linear_attention_layer
herrmann/rustorch
"PyTorch in Rust"