matthewmackay's Stars
TylerYep/torchinfo
View model summaries in PyTorch!
gpu-mode/awesomeMLSys
An ML Systems Onboarding list
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
cloneofsimo/insightful-nn-papers
These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning
arpitingle/gpu-alpha
High Quality Resources on GPU Programming/Architecture
rwitten/HighPerfLLMs2024
karpathy/LLM101n
LLM101n: Let's build a Storyteller
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
stanford-cs336/spring2024-lectures
KellerJordan/modded-nanogpt
NanoGPT (124M) in 3.4 minutes
jax-ml/coix
Inference Combinators in JAX
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
gpu-mode/resource-stream
GPU programming related news and material links
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
patrick-kidger/jaxtyping
Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/
justinchiu/openlogprobs
Extract full next-token probabilities via language model APIs
Kinyugo/consistency_models
A mini-library for training consistency models.
guidance-ai/guidance
A guidance language for controlling large language models.
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
dottxt-ai/outlines
Structured Text Generation
blackjax-devs/blackjax
BlackJAX is a Bayesian Inference library designed for ease of use, speed and modularity.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
dataflowr/notebooks
code for deep learning courses
unit8co/darts
A python library for user-friendly forecasting and anomaly detection on time series.
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
state-spaces/mamba
Mamba SSM architecture