BetuBin18070

PhD student of ICT,CAS. I am interested in reinforcement learning, diffusion models, and large language models.

Institute of Computing Technology Chinese Academy of SciencesBeijing, China

BetuBin18070's Stars

Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
Language:Python24.2k 233 5192.1k
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook9.1k 118 1291.3k
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.9k 37 3001.1k
zotero-chinese/styles
中文 CSL 样式 - Zotero 中文社区
Language:XML5.4k 18 382856
gerdm/prml
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
Language:Jupyter Notebook2.3k 36 0512
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.9k 7 61367
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
1k 40 157
huawei-noah/trustworthyAI
Trustworthy AI related projects
Language:Python1k 21 109227
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
1k 18 153
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Language:Python980 12 64162
pranz24/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python854 9 37182
jvpoulos/causal-ml
Must-read papers and resources related to causal inference and machine (deep) learning
698 28 0129
denisyarats/pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
Language:Jupyter Notebook527 6 7104
jannerm/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:Python484 10 2384
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python483 6 2065
TianhongDai/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python417 6 2974
lucidrains/classifier-free-guidance-pytorch
Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models
Language:Python411 9 529
AJLoveChina/LoveTree
:palm_tree:爱情树，将相爱的时刻永远珍藏（微信，QQ可完美查看）https://ajlovechina.github.io/LoveTree/
Language:JavaScript399 2 0903
songshangru/BIT-CS-Learning
保存一下我自己整理的北理工计科的学习资料，欢迎分享资源
Language:VHDL342 6 270
Zhendong-Wang/Diffusion-Policies-for-Offline-RL
Language:Python306 3 2239
shibhansh/loss-of-plasticity
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
Language:Python272 6 752
spitis/mrl
Language:Python112 5 524
sail-sg/edp
[NeurIPS 2023] Efficient Diffusion Policy
Language:Python93 8 26
BellmanTimeHut/DIPO
Language:Python91 1 08
jarridrb/DEM
Code for the paper Iterated Denoising Energy Matching for Sampling from Boltzmann Densities.
Language:Python47 3 27
sumedh7/CausalCuriosity
Official implementation of Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning at ICML 2021.
Language:Python37 3 18
GilgameshD/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
Language:Python32 2 15
swyoon/Diffusion-by-MaxEntIRL
The official repository for NeurIPS 2024 Oral <Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models>
Language:Python202
zhushy/trustworthyAI-1
trustworthy AI related projects
Language:Python3 1 00
HeyuanMingong/DiffusionQL
Language:Python2 0 01

BetuBin18070

BetuBin18070's Stars

Genesis-Embodied-AI/Genesis

SakanaAI/AI-Scientist

lucidrains/denoising-diffusion-pytorch

zotero-chinese/styles

gerdm/prml

nikhilbarhate99/PPO-PyTorch

opendilab/awesome-model-based-RL

huawei-noah/trustworthyAI

opendilab/awesome-diffusion-model-in-rl

jannerm/diffuser

pranz24/pytorch-soft-actor-critic

jvpoulos/causal-ml

denisyarats/pytorch_sac

jannerm/mbpo

jannerm/trajectory-transformer

TianhongDai/hindsight-experience-replay

lucidrains/classifier-free-guidance-pytorch

AJLoveChina/LoveTree

songshangru/BIT-CS-Learning

Zhendong-Wang/Diffusion-Policies-for-Offline-RL

shibhansh/loss-of-plasticity

spitis/mrl

sail-sg/edp

BellmanTimeHut/DIPO

jarridrb/DEM

sumedh7/CausalCuriosity

GilgameshD/GRADER

swyoon/Diffusion-by-MaxEntIRL

zhushy/trustworthyAI-1

HeyuanMingong/DiffusionQL