pickxiguapi

Building Embodied World Now

Pinned Repositories

CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Language:Jupyter Notebook443 2 1939
AutoCarryRobot
通过手机APP控制机器人对战；机器人人脸识别自动搬运
Language:Python5 1 01
cat-classification
基于迁移学习的猫十二分类，准确率94.58333，使用Resnet50
Language:Jupyter Notebook6 1 03
Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python33 2 32
ED2
the ED2 implementation
Language:Python0 0 00
euclid-iclr2023
Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)
Language:Python1 2 00
Mini-Uni-RLHF
Minimal implementation for easy-to-use RLHF annotation
Language:Python1 1 00
MiniC-Compiler
DUT编译原理课程设计，定义了一个C语言子集，包含词法分析，语法分析，语义分析，解释执行以及相应的图形界面
Language:Python12 1 01
Novel-Sou-Sou
NovelSouSou小说搜索引擎，使用Scrapy爬取多家笔趣阁网站，使用MongoDB存储小说信息，建立倒排索引以便进行搜索，最后基于Django建立Web服务，实现搜索全网小说并可一键下载。
Language:Python30 1 011
Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python32 2 01

pickxiguapi's Repositories

pickxiguapi/Clean-Offline-RLHF
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python33 2 32
pickxiguapi/Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)
Language:Python32 2 01
pickxiguapi/euclid-iclr2023
Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)
Language:Python1 2 00
pickxiguapi/Mini-Uni-RLHF
Minimal implementation for easy-to-use RLHF annotation
Language:Python1 1 00
pickxiguapi/BabyAI-text
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Language:Python0 0 00
pickxiguapi/BAKU
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
pickxiguapi/Best-README-Template
An awesome README template to jumpstart your projects!
0 0
pickxiguapi/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
pickxiguapi/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python0 0
pickxiguapi/decision-diffuser
Language:Python0 0
pickxiguapi/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Language:Python0 0
pickxiguapi/diffusion_reward
Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"
Language:Python0 0
pickxiguapi/dreamerv2
Mastering Atari with Discrete World Models
Language:Python0 0
pickxiguapi/dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
Language:Python0 0
pickxiguapi/Everything-LLMs-And-Robotics
The world's largest GitHub Repository for LLMs + Robotics
0 0
pickxiguapi/learning-from-scratch
The repository of On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline
Language:Python0 0
pickxiguapi/LIBERO
Benchmarking Knowledge Transfer in Lifelong Robot Learning
pickxiguapi/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
pickxiguapi/metaworld2gym
pickxiguapi/MV-MWM
Language:Python0 0
pickxiguapi/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
Language:Python0 0
pickxiguapi/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Language:Python0 0
pickxiguapi/RLHF
RLHF
Language:Jupyter Notebook0 0
pickxiguapi/robohive
A unified framework for robot learning
Language:Python0 0
pickxiguapi/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
Language:Python0 0
pickxiguapi/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
Language:Python0 0
pickxiguapi/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Language:Python0 0
pickxiguapi/text2reward
Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"
Language:Jupyter Notebook0 0
pickxiguapi/unstable_baselines
Re-implementations of SOTA RL algorithms.
Language:Python0 0
pickxiguapi/v-d4rl
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Language:Python0 0