yxzwayne's Stars
mlfoundations/dclm
DataComp for Language Models
rebuy-de/aws-nuke
Nuke a whole AWS account and delete all its resources.
openxla/xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
facebookresearch/chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
okarthikb/state-space-models
willccbb/mlx_parallm
Fast parallel LLM inference for MLX
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
karpathy/LLM101n
LLM101n: Let's build a Storyteller
xhluca/bm25s
Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
EleutherAI/sae
Sparse autoencoders
openai/sparse_autoencoder
devflowinc/trieve
All-in-one infrastructure for building search, recommendations, and RAG. Trieve combines search language models with tools for tuning ranking and relevance.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
cohere-ai/cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
apple/ml-ane-transformers
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
KellerJordan/modded-nanogpt
GPT-2 (124M) quality in 5B tokens
google-deepmind/nanodo
MatX-inc/seqax
seqax = sequence modeling + JAX
skywalker023/fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
KhoomeiK/complexity-scaling
gzip Predicts Data-dependent Scaling Laws
microsoft/MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
togethercomputer/stripedhyena
Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
evo-design/evo
Biological foundation modeling from molecular to genome scale