jen-pan

jen-pan's Stars

stas00/ml-engineering
Machine Learning Engineering Open Book
Language:Python11.6k 117 30706
cs231n/cs231n.github.io
Public facing notes page
Language:Jupyter Notebook10.2k 534 1144.1k
gpu-mode/lectures
Material for gpu-mode lectures
Language:Jupyter Notebook3k 41 8297
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.8k 25 39257
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Language:Python2.6k 24 27191
isaac-sim/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python2k 37 204426
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
1.6k 43 7123
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Language:Jupyter Notebook1.5k 29 17495
facebookresearch/diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Language:Python1.3k 23 20161
gpu-mode/resource-stream
GPU programming related news and material links
1.2k 41 273
alxndrTL/mamba.py
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Language:Python1k 7 4392
Denys88/rl_games
RL implementations
Language:Jupyter Notebook911 17 104150
isaac-sim/OmniIsaacGymEnvs
Reinforcement Learning Environments for Omniverse Isaac Gym
Language:Python849 18 168218
sublee/trueskill
An implementation of the TrueSkill rating system for Python
Language:Python753 23 50115
lucidrains/ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Language:Python474 10 1627
apoorvumang/prompt-lookup-decoding
Language:Jupyter Notebook458 11 622
srush/annotated-mamba
Annotated version of the Mamba paper
Language:Jupyter Notebook455 22 318
linjames0/Transformer-CUDA
An implementation of the transformer architecture onto an Nvidia CUDA kernel
Language:Cuda157 5 17
shreyansh26/FlashAttention-PyTorch
Implementation of FlashAttention in PyTorch
Language:Python122 2 016
PeaBrane/mamba-tiny
Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).
Language:Python101 6 015
tlc-pack/libflash_attn
Standalone Flash Attention v2 kernel without libtorch dependency
Language:C++97 15 513
kotoba-tech/kotomamba
Mamba training library developed by kotoba technologies
Language:Python67 5 05
NVIDIA/online-softmax
Benchmark code for the "Online normalizer calculation for softmax" paper
Language:Cuda59 6 07
tanaymeh/mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
Language:Python49 1 24
johnma2006/candle
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
Language:Jupyter Notebook48 2 03
google-deepmind/diplomacy
Language:Python46 6 29
andyzoujm/breaking-llama-guard
Code to break Llama Guard
Language:Jupyter Notebook30 2 01
sgiraz/CUDA-Training
Some CUDA projects and utility
Language:C++28 2 013
debowin/cuda-parallel-scan-prefix-sum
An implementation of a work-efficient Parallel Prefix-Sum(Scan) algorithm on the GPU.
Language:Cuda4 1 01
TasnimAK/BPE-Vocabulary-Builder
An implementation of Byte Pair Encoding (BPE), a data compression technique that can also be used for efficient subword tokenization in natural language processing tasks
Language:Python1