chufanchen

I'm a master's student at Zhejiang University. Teach me anything!

Zhejiang, China

chufanchen's Stars

dibyaghosh/jaxrl_m
Skeleton for scalable and flexible Jax RL implementations
Language:Python66
lafmdp/Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
61155
KaiYan289/RLpapersnote
392
mit-han-lab/nunchaku
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Language:Cuda54528
keraJLi/rejax
Language:Python1597
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
Language:Python26710
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.7k605
kevmo314/scuda
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
Language:C++58121
pytorch/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
Language:HTML754170
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++22.4k5.6k
inspirai/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
Language:Python32032
RussWong/CUDATutorial
A CUDA tutorial to make people learn CUDA program from 0
Language:Cuda19953
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Language:Cuda1.8k181
natolambert/rlhf-book
Textbook on reinforcement learning from human feedback
Language:TeX8711
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Language:Python1.5k71
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.4k1.3k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.8k1.7k
masa-ue/RLfinetuning_Diffusion_Bioseq
Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences like DNA (enhancers) and RNA (UTRs) design.
Language:Jupyter Notebook984
tatsu-lab/gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
Language:Python495123
EdanToledo/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
Language:Python25226
CleanDiffuserTeam/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Language:Jupyter Notebook43939
BVLC/caffe
Caffe: a fast open framework for deep learning.
Language:C++34.2k18.7k
mcinglis/c-style
My favorite C programming practices.
2k99
BBuf/how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
36923
EmptyJackson/policy-guided-diffusion
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
Language:Python1247
tracel-ai/burn
Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.
Language:Rust9.2k459
sun-hailong/LAMDA-PILOT
🎉 PILOT: A Pre-trained Model-Based Continual Learning Toolbox
Language:Python32935
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python29125
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.3k312
state-spaces/s4
Structured state space sequence models
Language:Jupyter Notebook2.5k300

chufanchen

chufanchen's Stars

dibyaghosh/jaxrl_m

lafmdp/Awesome-Papers-Autonomous-Agent

KaiYan289/RLpapersnote

mit-han-lab/nunchaku

keraJLi/rejax

zhaochenyang20/Awesome-ML-SYS-Tutorial

sgl-project/sglang

kevmo314/scuda

pytorch/kineto

PaddlePaddle/Paddle

inspirai/TimeChamber

RussWong/CUDATutorial

DefTruth/CUDA-Learn-Notes

natolambert/rlhf-book

sustcsonglin/flash-linear-attention

huggingface/trl

triton-lang/triton

masa-ue/RLfinetuning_Diffusion_Bioseq

tatsu-lab/gpt_paper_assistant

EdanToledo/Stoix

CleanDiffuserTeam/CleanDiffuser

BVLC/caffe

mcinglis/c-style

BBuf/how-to-learn-deep-learning-framework

EmptyJackson/policy-guided-diffusion

tracel-ai/burn

sun-hailong/LAMDA-PILOT

SalesforceAIResearch/DiffusionDPO

OpenRLHF/OpenRLHF

state-spaces/s4