chufanchen's Stars
dibyaghosh/jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
KaiYan289/RLpapersnote
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
keraJLi/rejax
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
kevmo314/scuda
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
pytorch/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
RussWong/CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
natolambert/rlhf-book
Textbook on reinforcement learning from human feedback
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
huggingface/trl
Train transformer language models with reinforcement learning.
triton-lang/triton
Development repository for the Triton language and compiler
masa-ue/RLfinetuning_Diffusion_Bioseq
Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences like DNA (enhancers) and RNA (UTRs) design.
tatsu-lab/gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
EdanToledo/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
CleanDiffuserTeam/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
BVLC/caffe
Caffe: a fast open framework for deep learning.
mcinglis/c-style
My favorite C programming practices.
BBuf/how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
EmptyJackson/policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
tracel-ai/burn
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
sun-hailong/LAMDA-PILOT
🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
state-spaces/s4
Structured state space sequence models