BetuBin18070
PhD student of ICT,CAS. I am interested in reinforcement learning, diffusion models, and large language models.
Institute of Computing Technology Chinese Academy of SciencesBeijing, China
BetuBin18070's Stars
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
zotero-chinese/styles
中文 CSL 样式 - Zotero 中文社区
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
huawei-noah/trustworthyAI
Trustworthy AI related projects
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
pranz24/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
jvpoulos/causal-ml
Must-read papers and resources related to causal inference and machine (deep) learning
denisyarats/pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
jannerm/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
lucidrains/classifier-free-guidance-pytorch
Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models
AJLoveChina/LoveTree
:palm_tree:爱情树,将相爱的时刻永远珍藏 (微信,QQ可完美查看)https://ajlovechina.github.io/LoveTree/
songshangru/BIT-CS-Learning
保存一下我自己整理的北理工计科的学习资料,欢迎分享资源
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
shibhansh/loss-of-plasticity
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
spitis/mrl
sail-sg/edp
[NeurIPS 2023] Efficient Diffusion Policy
BellmanTimeHut/DIPO
jarridrb/DEM
Code for the paper Iterated Denoising Energy Matching for Sampling from Boltzmann Densities.
sumedh7/CausalCuriosity
Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML 2021.
GilgameshD/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
swyoon/Diffusion-by-MaxEntIRL
The official repository for NeurIPS 2024 Oral <Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models>
zhushy/trustworthyAI-1
trustworthy AI related projects
HeyuanMingong/DiffusionQL