Pinned Repositories
CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
AutoCarryRobot
通过手机APP控制机器人对战;机器人人脸识别自动搬运
cat-classification
基于迁移学习的猫十二分类,准确率94.58333,使用Resnet50
Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
ED2
the ED2 implementation
euclid-iclr2023
Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)
Mini-Uni-RLHF
Minimal implementation for easy-to-use RLHF annotation
MiniC-Compiler
DUT编译原理课程设计,定义了一个C语言子集,包含词法分析,语法分析,语义分析,解释执行以及相应的图形界面
Novel-Sou-Sou
NovelSouSou小说搜索引擎,使用Scrapy爬取多家笔趣阁网站,使用MongoDB存储小说信息,建立倒排索引以便进行搜索,最后基于Django建立Web服务,实现搜索全网小说并可一键下载。
Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
pickxiguapi's Repositories
pickxiguapi/Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
pickxiguapi/euclid-iclr2023
Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)
pickxiguapi/Mini-Uni-RLHF
Minimal implementation for easy-to-use RLHF annotation
pickxiguapi/BabyAI-text
We perform functional grounding of LLMs' knowledge in BabyAI-Text
pickxiguapi/BAKU
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
pickxiguapi/Best-README-Template
An awesome README template to jumpstart your projects!
pickxiguapi/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
pickxiguapi/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
pickxiguapi/decision-diffuser
pickxiguapi/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
pickxiguapi/diffusion_reward
Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"
pickxiguapi/dreamerv2
Mastering Atari with Discrete World Models
pickxiguapi/dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
pickxiguapi/Everything-LLMs-And-Robotics
The world's largest GitHub Repository for LLMs + Robotics
pickxiguapi/learning-from-scratch
The repository of On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline
pickxiguapi/LIBERO
Benchmarking Knowledge Transfer in Lifelong Robot Learning
pickxiguapi/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
pickxiguapi/metaworld2gym
pickxiguapi/MV-MWM
pickxiguapi/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
pickxiguapi/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
pickxiguapi/RLHF
RLHF
pickxiguapi/robohive
A unified framework for robot learning
pickxiguapi/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
pickxiguapi/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
pickxiguapi/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
pickxiguapi/text2reward
Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"
pickxiguapi/unstable_baselines
Re-implementations of SOTA RL algorithms.
pickxiguapi/v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations