HengLuRepos

HengLuRepos's Stars

luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python78667
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.5k4.6k
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
Language:Jupyter Notebook6.2k660
Farama-Foundation/Minari
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
Language:Python32948
Farama-Foundation/Gymnasium-Robotics
A collection of robotics simulation environments for reinforcement learning
Language:Python60493
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python8.1k1.1k
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python1.4k288
FZJ-JSC/tutorial-multi-gpu
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
Language:Cuda20354
udacity/cs344
Introduction to Parallel Programming class code
Language:Cuda1.3k1.1k
gpu-mode/ring-attention
ring-attention experiments
Language:Python11311
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda25k2.8k
Allenpandas/Reinforcement-Learning-Papers
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
31235
Breakend/experiment-impact-tracker
Language:Python27931
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python2k162
Stability-AI/StableCascade
Official Code for Stable Cascade
Language:Jupyter Notebook6.6k534
NVIDIA/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
Language:C6.7k1.9k
stepjam/RLBench
A large-scale benchmark and learning environment.
Language:Python1.2k241
google-research/ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Language:Python58597
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML11.3k952
VainF/Torch-Pruning
[CVPR 2023] DepGraph: Towards Any Structural Pruning
Language:Python2.8k339
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook11.2k1.3k
RUCAIBox/RecSysDatasets
This is a repository of public data sources for Recommender Systems (RS).
Language:Python901133
gusye1234/LightGCN-PyTorch
The PyTorch implementation of LightGCN
Language:Python900238
clvrai/awesome-rl-envs
1.1k85
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Language:Python1.7k256
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.5k1.7k
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python2.2k526
scikit-image/scikit-image
Image processing in Python
Language:Python6.1k2.2k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.6k1.4k
openai/safety-gym
Tools for accelerating safe exploration research.
Language:Python511141